Gene Cthe_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3004 
Symbol 
ID4811152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3526995 
End bp3528476 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content47% 
IMG OID640108425 
Productferredoxin 
Protein accessionYP_001039393 
Protein GI125975483 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAC TAGAAAATCA TAATCGCATC AAAGTAACCG TCAACGGACG CGAAATCGAG 
GTATATGACA ACCTGACCAT CCTCCAGGCA TTGCTTCAGG AGGATATACA CATTCCGCAC
CTGTGCTATG ATATCAGGCT TGAAAGGTCA AACGGCAATT GCGGATTGTG TGTGGTTACC
TTAATCTCCC CTGACGGTGA GAGAGACGTC AAAGCATGTC AGACTCCCAT TAAGGAAGGT
ATGGTAATAT GCACCAACAC TCCAAAACTT GAGAATTATA GAAAAATACG TCTGGAGCAG
CTTCTTTCCG ACCACAACGC AGATTGTGTC GCTCCCTGTG TAATGACATG TCCTGCAAAT
ATTGACATCC AATCATACCT GCGCCATGTA GGCAACGGAG ATTTTGAGGC TGCAATCCGT
GTCATTAAAG AGAGGAATCC TTTCCCGATA GTATGCGGCC GCGTATGTCC TCATACCTGT
GAATCTCAAT GCCGCCGCAA CCTTGTAGAC GCGCCCGTGG CAATAAACTA TGTCAAACGT
TTTGCCGCCG ACTGGGACAT GGCACGGCCT GAGCCATGGA CTCCTGAAAA GAAGCCTCCT
ACAGGCAAAA AAATTGCCAT AGTCGGAGCA GGTCCTTCCG GTCTTTCCGC TGCATATTAC
AGTGCCATCA AGGGTCATGA CGTAACTGTT TTTGAACGTC AGCCTCATCC CGGAGGTATG
ATGAGGTACG GTATCCCTGA ATATCGTCTT CCCAAGGCTA TTCTTGACAA GGAAATCGAG
ATGATAAAAA AACTCGGAGT TAAAATTATG ACAGAAAAGG CTTTGGGAAT CCATATCCGT
CTCGAAGACC TCAGCAAGGA TTTTGATGCC GTTTACCTTG CAATCGGTTC ATGGCAGGCA
ACTCCGATGC ATATTGAAGG TGAAAAACTT GACGGCGTAT GGGCAGGTAT AAACTATCTT
GAACAAGTGG CAAAAAATGT TGATATTCCG TTGGGTGACA ATGTTGTGGT AATCGGAGGC
GGAAACACGG CCATCGACTG CGCCCGTACC GCTCTCAGGA AAGGTGCCAA ATCCGTAAAA
CTCGTATACC GCTGTACCCG TGAAGAAATG CCTGCGGCAC CCTACGAGGT GGAAGAAGCC
ATCCACGAAG GAGTTGAAAT GATTTTCCTG ATGGCGCCCA CAAAGATTAT TGTAAAAGAC
GGCAAAAAGA AACTCGTTTG TATCCGTATG CAGCTTGGAG AGCCTGACCG TTCCGGTCGT
CGCCGTCCGG TTCCCATTGA GGGAAGCGAA GTTGAAATTG ACGCCGACAC AATAATCGGT
GCCATAGGTC AAAGCACCAA CACCCAGTTC CTTTACAACG ACCTTCCGGT AAAACTTAAC
AAATGGGGAG ATATAGAAGT AAACGGTAAA ACCTTGCAGA CTTCTGAATA CAACATATTT
GCCGGCGGTG ACTGCGTAAC CGGTCCTGCA ACGGTAATTT AG
 
Protein sequence
MKTLENHNRI KVTVNGREIE VYDNLTILQA LLQEDIHIPH LCYDIRLERS NGNCGLCVVT 
LISPDGERDV KACQTPIKEG MVICTNTPKL ENYRKIRLEQ LLSDHNADCV APCVMTCPAN
IDIQSYLRHV GNGDFEAAIR VIKERNPFPI VCGRVCPHTC ESQCRRNLVD APVAINYVKR
FAADWDMARP EPWTPEKKPP TGKKIAIVGA GPSGLSAAYY SAIKGHDVTV FERQPHPGGM
MRYGIPEYRL PKAILDKEIE MIKKLGVKIM TEKALGIHIR LEDLSKDFDA VYLAIGSWQA
TPMHIEGEKL DGVWAGINYL EQVAKNVDIP LGDNVVVIGG GNTAIDCART ALRKGAKSVK
LVYRCTREEM PAAPYEVEEA IHEGVEMIFL MAPTKIIVKD GKKKLVCIRM QLGEPDRSGR
RRPVPIEGSE VEIDADTIIG AIGQSTNTQF LYNDLPVKLN KWGDIEVNGK TLQTSEYNIF
AGGDCVTGPA TVI