Gene Cthe_0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0556 
Symbol 
ID4808231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp681871 
End bp683712 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content43% 
IMG OID640105970 
Productasparagine synthase (glutamine-hydrolyzing) 
Protein accessionYP_001036985 
Protein GI125973075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0367] Asparagine synthase (glutamine-hydrolyzing) 
TIGRFAM ID[TIGR01536] asparagine synthase (glutamine-hydrolyzing) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.464828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGGTA TTGCAGGATG GATAAATCTT AAATCAGACT TGACCAATCA AAAAGAAATA 
TTAAATTCAA TGATTGACAG GCTGACCCCC AGGGGACCCG ATGCATCGGG AAGTTGGATT
TCTCCAAATG CGCTTATTGC CCATAAGCGT TTGATAGTTG TCGACCCGGA AGGCGGAATT
CAACCCATGG TTCGCAGGCA GGGTGAAAAC ACGTACGTTA TTACATACAA CGGGGAACTT
TACAACACTG CGGACCTGCG AAACGAACTT GAATCCAGGG GCCATGAATT TTTAACAAGC
TCTGACACGG AAGTTTTACT TGTTTCCTAT ATTGAATGGG GTGCAAAATG TGTCGAACAT
CTAAACGGAA TATATGCTTT CGGAATTTGG GATGAAGGAA AAAAGAGGCT TTTTCTGGGA
AGGGACAGAT TCGGCGTAAA ACCCCTGTTT TATGCCCAAA GAGGAGATTC TTTGATATTT
GGCTCTGAAT TAAAGGCGCT CCTGGCAAAT CCCCTTGTGG AGCCAAAATT GGATGCCGAA
GGCTTAGCGG AAATCTTTGC GCTGGGACCC GCCCGAACAC CGGGACACGG TATATTCAAA
GATGTTTATG AACTAAAACC CGCTCATTCA ATGATTTTTG ATATAAACGG AATACAGATC
AGAAAATACT GGTCGCTGGA AAGTTATCCC CATACCGACA GTGAAAAAGC CACCATATCC
AAAGTCAGAG ATTTTGTATT GGATGCCATC ACAAGGCAAC TTGTGGCAGA TGTTCCTGTC
TGTACTTTTC TGTCGGGCGG ACTTGATTCA AGTGCCATTA CGGCTGTTGC TTCAAAGACT
TTTGCCTCCC AGGGCAAAGG ACAGCTTAAC ACTTTTTCTG TAGATTATGT AGACAATGAC
ATTTATTTTA AGCCCAGCAT GTTTCAGCCA AACTCCGATG AACCCTGGAT AAAAAAAATG
AGTGAAAGCT TTAATACATG CCATCACTAT GTAAAATTTG ACACACCACA GCTGGTTGAC
GCTCTTATTG ACGCTGTAAA GGCACGGGAC CTGCCGGGAA TGGCAGATAT TGACTCATCC
CTGTTTCTTT TCTGCCGGGA GGTGAAAAAA TTTTCGGTTG TGGCATTATC CGGTGAATGT
GCGGATGAAA TATTCGGAGG ATATCCGTGG TTTCATAATG AGAAAATGCT GTTTGCCGAT
ACTTTCCCCT GGTCCGTTTC GGTACACGAA AGGACAAAAA TACTGTCCCC TGAGATTTTG
AATTTGATAA AGCCTGAAGA ATACATCCAA AGACGTTATA GAGAAACTCT AAGCGAGGTT
CCGCACTTAA AAGGGGAAAA CAGAATTGAA GCCAGAAGAA GAGAAATCTT TTACCTAAAC
ATCAACTGGT TCATGGCCAC CTTGCTTGAC AGAAAAGACC GCATGAGTAT GGCATCGGGT
CTTGAAGTAA GGGTTCCGTT CTGCGACCAC AGACTGGTCC AATACGTATG GAACATACCA
TGGGAGTTGA AAATGTACAA CAAAAGAGAA AAAGGGCTTT TAAGACAGGC TCTAAAAGGC
ATACTTCCAG ATGATATAAT TGAGAGAAAG AAAAGCCCAT ACCCGAAAAC TCATAATCCT
TCATATAAAA AAGCTGTCAG CAAATGGCTG CTCGAAATAT TAAACGACAG CAGTTCACCT
CTTCATCAAT TAATTGACGT AAAAGTGGTA AGGTCCATGG CCGAAGGAAA TTCGGACAAC
ACAGATCCCT GGTTCGGCCA GTTGATGGCA CAGCCCCAAA TGCTTGCCTA TCTTATCCAG
GTGGATTTTT GGCTGCGGGA TAACCATATA TCAATTGTAT AA
 
Protein sequence
MCGIAGWINL KSDLTNQKEI LNSMIDRLTP RGPDASGSWI SPNALIAHKR LIVVDPEGGI 
QPMVRRQGEN TYVITYNGEL YNTADLRNEL ESRGHEFLTS SDTEVLLVSY IEWGAKCVEH
LNGIYAFGIW DEGKKRLFLG RDRFGVKPLF YAQRGDSLIF GSELKALLAN PLVEPKLDAE
GLAEIFALGP ARTPGHGIFK DVYELKPAHS MIFDINGIQI RKYWSLESYP HTDSEKATIS
KVRDFVLDAI TRQLVADVPV CTFLSGGLDS SAITAVASKT FASQGKGQLN TFSVDYVDND
IYFKPSMFQP NSDEPWIKKM SESFNTCHHY VKFDTPQLVD ALIDAVKARD LPGMADIDSS
LFLFCREVKK FSVVALSGEC ADEIFGGYPW FHNEKMLFAD TFPWSVSVHE RTKILSPEIL
NLIKPEEYIQ RRYRETLSEV PHLKGENRIE ARRREIFYLN INWFMATLLD RKDRMSMASG
LEVRVPFCDH RLVQYVWNIP WELKMYNKRE KGLLRQALKG ILPDDIIERK KSPYPKTHNP
SYKKAVSKWL LEILNDSSSP LHQLIDVKVV RSMAEGNSDN TDPWFGQLMA QPQMLAYLIQ
VDFWLRDNHI SIV