Gene Cthe_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0202 
Symbol 
ID4808620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp244786 
End bp246108 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content41% 
IMG OID640105615 
ProductL-glutamine synthetase 
Protein accessionYP_001036636 
Protein GI125972726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID[TIGR00653] glutamine synthetase, type I 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATA ATATAAAAAT AAAAGAGGTG CTGGAGTTTG TCGAAGAAAA TGACGTGAAA 
TTTATCCGAC TCGCCTTTTG TGATATTCTG GGGATTCCAA AGAATATTTC CATCATGCCC
CAGGAACTTG AAAGAGCGTT TGAACAAGGT ATATCCTTTG ATGCGTCTTC AATTTTGGGA
TTTATGAATG TTGAAAAATC CGATTTGTTT TTACATCCTG ACCCATCGAC TTTAAGCATT
TTGCCCTGGA GACCCCAGCA GGGAAGGGTA ATACGTTTTT TCTGCGACAT AAAGCATCCG
GATGGAAGTG CGTTTGAGGG AGATTCAAGA AATATTCTTA AAAAAGCGGT GGAACGGGCG
GAAAAAATGG GGTATGCGTG CAGAATAGGT TCGGAGTGCG AATTTTATTT GTTTGAAACC
GATGAAAAAG GAAGACCCAC ATATATTCCC CATGATGAGG GGGGATATTT GGATATGGCG
CCCCTTGACA AAGGGGAGAA CGTAAGAAGG GAGATTTGCC TGTCGCTTGA ACAAATGGGA
ATTCAGCCGG AAAGTTCCCA TCATGAACAG GGGCCCGGGC AACATGAAAT AGACTTTAAA
TACAGTGACG CTCTTACAGC TGCAGATGAT TTGATGACTT TCAAGACGGT GGTCAAGGCT
GTTGCATCAA GAAACGGACT TTTTGCCTCC TTTATGCCGA AACCCATTTT GACTGAAAGC
GGCAGCGGAC TTCATATTAA TATATCTCTT TCAAAGGACG GATTTAATAT TTTCAAAGAG
AGGAATTATG ATTCTTCGGC CGCTAAAAGC TTTATTGCCG GGGTTATTGA TAAAATATTG
GATATTACCG CATTTGCAAA TCCGATAACG AATTCTTATG CCCGGCTTGG AAGTTTCAGG
GCGCCGAAAT ACGTATCCTG GTCTCATCAA AATCGTTCCC AGCTTATAAG AATCCCTGCT
GAAACAGGGG AATACAGCAG AATGGAACTT CGTTCTCCGG ATCCGGCCTG CAATCCTTAC
ATTACTTTCG CTCTTATTTT GCATGCGGGA CTTGACGGGA TAGAGAGAAA ATTGGAGCTT
CCCGGGCCGA TTAATCAGAA TTTGTACAAT GCCGGCGCTG ATGAGTTGCA AAATATCAAA
GCTCTTCCGC AGAATTTGAA AGAGGCTTTG GATGTTGCAT CAAAAAGTAG TTTTGTAAGA
AACATTTTAG GCGAGGAAAT GTTAAGCAAG TATTTGGAGA TAAAGCTAAA AGAGTGGAAC
ATGTATTTTG AAAGTGAAGA CAGGGAAAGC GTGGAAAAAC AGATGTATTT TAAAATTATT
TAA
 
Protein sequence
MSYNIKIKEV LEFVEENDVK FIRLAFCDIL GIPKNISIMP QELERAFEQG ISFDASSILG 
FMNVEKSDLF LHPDPSTLSI LPWRPQQGRV IRFFCDIKHP DGSAFEGDSR NILKKAVERA
EKMGYACRIG SECEFYLFET DEKGRPTYIP HDEGGYLDMA PLDKGENVRR EICLSLEQMG
IQPESSHHEQ GPGQHEIDFK YSDALTAADD LMTFKTVVKA VASRNGLFAS FMPKPILTES
GSGLHINISL SKDGFNIFKE RNYDSSAAKS FIAGVIDKIL DITAFANPIT NSYARLGSFR
APKYVSWSHQ NRSQLIRIPA ETGEYSRMEL RSPDPACNPY ITFALILHAG LDGIERKLEL
PGPINQNLYN AGADELQNIK ALPQNLKEAL DVASKSSFVR NILGEEMLSK YLEIKLKEWN
MYFESEDRES VEKQMYFKII