Gene Cthe_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2431 
Symbol 
ID4808147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2904969 
End bp2905934 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content44% 
IMG OID640107845 
Productelectron transport complex, RnfABCDGE type, D subunit 
Protein accessionYP_001038826 
Protein GI125974916 
COG category[C] Energy production and conversion 
COG ID[COG4658] Predicted NADH:ubiquinone oxidoreductase, subunit RnfD 
TIGRFAM ID[TIGR01946] electron transport complex, RnfABCDGE type, D subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000125331 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAAGAA GTTTTATAGT ATCATCATCG CCTCATATAA GGGACAATAT AAGTACAAGG 
CGGATAATGC TGGATGTGAT TATTGCCCTT ATTCCGGCAT CTTTGGCAGG AGTCTACTTT
TTCGGTCCCA GAACGCTGCT GGTAATTTTA GTAAGCATTC TGGCCTGCGT GTTGTCAGAA
TATCTCTCAG GTAAGCTGAT GAAAAGAAGC AACACAATTT CAGATTTGAG TGCGGTGGTT
ACAGGACTTA TTTTGGCATT AAACCTTCCT CCCACAGTAC CTCTGTGGAT GGTTGTGGTA
GGAGCGGTTG TGGCAATAGT TGTCATAAAA CAGCTGTTTG GAGGAATGGG ACAAAATTTC
ATCAATCCGG CATTGGGAGC AAGAGTGTTT TTATTTATAT CCTATGCAAA TCGCATGACC
AATTGGGTAA TACCGGGTAC TGACGCAGTG TCTTCGGCAA CTCCCCTTGG GTTGCTTAAG
GCCGAAGATG CCGCACAAGT TGTCCTTCCA TCCTACAAGG ACCTTTTCTT TGGCAACATT
GGAGGATGTA TAGGTGAAGT TTCTGCAGCC GCCCTTTTGA TAGGTGGAAT ATACCTTGTG
GCAAGAAAGG TTATAAGCCC GGAAATACCT TTGACATACA TCGGAACCTT GGGATTGTTT
ACATGGATAT TCGGAGGACC AACACTGTTT AGCGGGGACT TTGTATACCA CATACTTTCA
GGTGGCCTGT TGCTGGGCGC AATTTATATG GCTACGGATT ACACCACTTC GCCCATGACC
ACCAAGGGAC GGATAATTAT GGGTATAGGA TGCGGACTTC TTACCGGAAT TATACGTCTG
TATACCAACT ATCCGGAAGG AGCGTCTTTT GCAATCCTTA TAATGAATGT CATGGTTCCG
TTGATTGACA GATATACCGT TCCAAAAAGT TTTGGAGGTG GAAAAGCCGT TGAAAGATAT
AGTTAA
 
Protein sequence
MERSFIVSSS PHIRDNISTR RIMLDVIIAL IPASLAGVYF FGPRTLLVIL VSILACVLSE 
YLSGKLMKRS NTISDLSAVV TGLILALNLP PTVPLWMVVV GAVVAIVVIK QLFGGMGQNF
INPALGARVF LFISYANRMT NWVIPGTDAV SSATPLGLLK AEDAAQVVLP SYKDLFFGNI
GGCIGEVSAA ALLIGGIYLV ARKVISPEIP LTYIGTLGLF TWIFGGPTLF SGDFVYHILS
GGLLLGAIYM ATDYTTSPMT TKGRIIMGIG CGLLTGIIRL YTNYPEGASF AILIMNVMVP
LIDRYTVPKS FGGGKAVERY S