Gene Cthe_0198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0198 
Symbol 
ID4808616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp240847 
End bp242352 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content44% 
IMG OID640105611 
Productglutamate synthase (NADPH) GltB2 subunit 
Protein accessionYP_001036632 
Protein GI125972722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATCA ATTTTTTGTA TCCGGAATAT GAAGTTGTAA GAAATATGGA CAGATGCATT 
AACTGCAGGG TTTGCGAAAG ACAGTGCGCC AATGAGGTTC ACAGCTACGA CCAGGAGAAA
AAAATAATGT GCAGTGATGA TTCCAAATGC GTCAACTGCC ATAGATGTGT GGCTCTTTGT
CCTACAAGGG CTCTCAAGAT AGTAAAAACG GATCATACGT TTAAAGAAAA TGCTAACTGG
AGAGGTTCAA TAATTAATGA AATATACAAG CAGGCTGAAA GCGGCGCTGT ACTTTTGTCC
AGCATGGGAA ATCCCAATGA GTATCCTGTG TATTTTGACA AAATTCTGTT AAATGCCTCC
CAGGTTACAA ATCCTTCTAT CGACCCGTTG AGAGAGCCTA TGGAAACAAA AGTGTTTTTG
GGCAGCAAAC CGAGGAGTAT ACAAAGAGAT GAAAACGGAA AGCTTGTTAA CAATTTGTCC
TGCGGTATTG AACTTTCCGT ACCCATTATG TTCTCGGCCA TGAGTTATGG CTCCATCAGC
TACAATGCCC ATGAATCTTT GGCAAGAGCG GCAAAAGAAG CGGGAATCCT GTACAACACG
GGAGAAGGTG GACTTCACAG AGACTTATAT CAATACGGTA GCAATACTAT TGTACAAGTG
GCTTCAGGAA GATTTGGAGT GCATAAAGAC TATCTTGAAG CGGGAGCTGC TATAGAAATA
AAAATGGGTC AGGGTGCAAA GCCCGGAATA GGAGGACATC TTCCGGGAAC AAAGATAGTA
GGGGACATAT CCAGAACCAG AATGGTTCCT GAAGGTTCGG ACGCCATTTC TCCGGCCCCG
CACCATGATA TTTATTCGAT TGAGGACTTA AGGCAGCTGG TTTATTCGCT CAAGGAAGCA
ACAAATTACA CAAAACCCGT TATAGTCAAA ATAGCGGCCG TCCACAATGT GGCAGCCATT
GCCAGCGGAA TTGCAAGAAG CGGAGCGGAC ATTATCGCCA TCGACGGATT CCGCGGAGGT
ACCGGAGCTG CTCCCACAAG AATCAGAGAC AATGTGGGAA TTCCTATTGA ACTTGCTCTG
GCAAGTGTTG ACCAAAGACT TAGAGAAGAA GGTATAAGAG ACAATGTATC CATTGTTGTG
GGCGGAAGTA TCAGAAACAG CAGTGATGTT GTAAAAGCAG TTGCATTGGG AGCCGACTGT
GTTTATATCG GAACGGCTGC GTTGATTGCT TTAGGGTGCC ATCTTTGCAG AAGCTGTCAT
ACAGGAAAGT GCAACTGGGG TATTGCAACC CAGGAGCCTG AGTTGGTAAA GCGCCTTAAC
CCCGACATGG GCTATAAGAG ACTGGTTAAT CTTGTGAATG CCTGGAAGCA TGAAATAAAA
GAAATGATGG GCGGAATGGG AATTAATTCT ATAGAAAGCC TTAGAGGAAA CAGGCTGATG
CTAAGAGGAG TAGGACTTAA TGAAAAAGAG CTTCAAATAT TAGGAATTAA ACATGCGGGG
GAATAG
 
Protein sequence
MGINFLYPEY EVVRNMDRCI NCRVCERQCA NEVHSYDQEK KIMCSDDSKC VNCHRCVALC 
PTRALKIVKT DHTFKENANW RGSIINEIYK QAESGAVLLS SMGNPNEYPV YFDKILLNAS
QVTNPSIDPL REPMETKVFL GSKPRSIQRD ENGKLVNNLS CGIELSVPIM FSAMSYGSIS
YNAHESLARA AKEAGILYNT GEGGLHRDLY QYGSNTIVQV ASGRFGVHKD YLEAGAAIEI
KMGQGAKPGI GGHLPGTKIV GDISRTRMVP EGSDAISPAP HHDIYSIEDL RQLVYSLKEA
TNYTKPVIVK IAAVHNVAAI ASGIARSGAD IIAIDGFRGG TGAAPTRIRD NVGIPIELAL
ASVDQRLREE GIRDNVSIVV GGSIRNSSDV VKAVALGADC VYIGTAALIA LGCHLCRSCH
TGKCNWGIAT QEPELVKRLN PDMGYKRLVN LVNAWKHEIK EMMGGMGINS IESLRGNRLM
LRGVGLNEKE LQILGIKHAG E