Gene Cthe_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3024 
Symbol 
ID4811096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3546060 
End bp3547970 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content41% 
IMG OID640108445 
Productech hydrogenase subunit A 
Protein accessionYP_001039413 
Protein GI125975503 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000133717 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA TTTTAATATT GATATTGTTT CCGCTGTTGG CATCTGTCAC TGTTTTGTCA 
GTAAGAAAAG ATGCCATAAG AAATATAATA GTAAGAATTT TCGCCTTTAT TACCGGCATA
CTGACATTAT TTGTGGTATG TCGGTATTTT AAGGATGGAA TATCTTTGTC CATTGAAAAC
AGGAATATTA TTGACATGAC CATATCTCTG GCGGAGGTCC TTATTGCTGC ATATATAATA
TTTACAGGCA TAAAAAACAA AAAGTTCATT GTATCAATTT TTGCAGCTGT TCAAACCGCT
CTGATTCTTT GGTTTGAATT TACACAAAAA CACGGTATCA ATGTTCATTC GGACATTGTA
TTCGACAGGC TTTCCGCTGT TATGGTCCTC ATTGTGGGAT GTATCGGAAG CCTTATACTG
ATATACACTG TCGGATATAT GAAATGGTAT CACATACACC ACGAAGGATA CAAAGAAAGA
AAGAGTTTCT TTTTTTCTGT AATTTTTCTC TTTCTCTTTG CAATGTTCGG ATTAATTTTC
AGCAACAACC TGATCTGGAT GTATTTTTGC TGGGAACTTA CAACCTTGTG TTCTTACCTT
CTTATCGGTT ACACCCGAAC ACCCGAAGCA GTAAACAATT CATTCCATGC ATTGGCAATC
AATCTTGGCG GCGGACTTGC GTTTGCGTCG GCAATGGTAT ATATAGGAAC GAACTTTAAA
ACTCTCGAGC TTTCGGCATT GACAGCCATG AAACTTGAGC TTGCGGTTCT CATACCGGTT
TTCCTTCTTT GTATTGCAGC CCTTACGAAG TCTGCCCAGA TGCCCTTTTC CTCCTGGCTT
TTGGGGGCAA TGGTAGCACC GACTCCGTCA TCGGCGCTTT TGCACTCGGC AACAATGGTA
AAAGCAGGAG TTTACCTTTT AATAAGACTT GCTCCGCTGC TTGCAGGAAC TACCATAGGA
AAAGTAATTG CTCTTTTGGG AGCGGTTACG TTCCTGGCAA GTTCCATCAT CGCAATCTCC
AAAAGCGACG CAAAGAAAAT TCTGGCTTAT TCAACCATAT CGAATTTAGG ACTTATAGTT
ACCTGCGCAG CCATAGGAAC GCAGGAATCG CTGTGGGCAG CAATACTGCT GTTAATATTC
CACTCCATAT CCAAATCCCT TCTGTTCCTG ACCGGAGGCT CAGTAGAGCA CCAGATAGGA
AGCCGCAATG TTGAGGATAT GGATATTCTT CTGCAGGTGT CAAGAAGGCT GTCTGTATAT
ATGATTGTGG GAATAGCCGG AATGTTCCTT GCCCCCTTTG GAATGCTTAT ATCCAAATGG
GTTGCCATGA AGGCATTTAT TGATTCGAAG AATATACTTA CAGTTATCAT TTTGGGATAC
GGCAGTGCCA CAACACTGTT CTACTGGACA AAATGGATGG GTAAACTCGT AGCCAATGCC
AACAGAAAAG ACCACATAAA GCATACCTTC CACATAGACG AGGAAATTCC TATTTTTATC
CATGCAGTCC TTGTGGTATT GTCCTGCTTT ACTTTTCCTC TGGTATCCCG ATATGTACTT
GTACCGTATC TTTCAGGTCT GTTTGGTCCG GATGTGCCAA TTCCTATCGG AACAAGTGAT
GTAAATATAA TGCTTATAAT GCTAAGTATG CTGTTAATAC TGCCAATAAG CTTTATTCCA
ATATATAAAA GCGACCGGCG CAGGATAGTG CCTATTTACA TGGCCGGGGA GAACACCGGC
GACAATGAGA GTTTTTATGG TGCTTTTGAT GAAAAACGTA AAGTCGAGCT CCACAACTGG
TATATGAAAA ACTTTTTCTC TGTGAAAAAA CTAACCTTCT GGAGTAATTT ACTATGTGCC
GTTGTGATAT TGGTGGGCGT AGTACTTTTA ATAGGAGGAA TTACCAAATG A
 
Protein sequence
MNAILILILF PLLASVTVLS VRKDAIRNII VRIFAFITGI LTLFVVCRYF KDGISLSIEN 
RNIIDMTISL AEVLIAAYII FTGIKNKKFI VSIFAAVQTA LILWFEFTQK HGINVHSDIV
FDRLSAVMVL IVGCIGSLIL IYTVGYMKWY HIHHEGYKER KSFFFSVIFL FLFAMFGLIF
SNNLIWMYFC WELTTLCSYL LIGYTRTPEA VNNSFHALAI NLGGGLAFAS AMVYIGTNFK
TLELSALTAM KLELAVLIPV FLLCIAALTK SAQMPFSSWL LGAMVAPTPS SALLHSATMV
KAGVYLLIRL APLLAGTTIG KVIALLGAVT FLASSIIAIS KSDAKKILAY STISNLGLIV
TCAAIGTQES LWAAILLLIF HSISKSLLFL TGGSVEHQIG SRNVEDMDIL LQVSRRLSVY
MIVGIAGMFL APFGMLISKW VAMKAFIDSK NILTVIILGY GSATTLFYWT KWMGKLVANA
NRKDHIKHTF HIDEEIPIFI HAVLVVLSCF TFPLVSRYVL VPYLSGLFGP DVPIPIGTSD
VNIMLIMLSM LLILPISFIP IYKSDRRRIV PIYMAGENTG DNESFYGAFD EKRKVELHNW
YMKNFFSVKK LTFWSNLLCA VVILVGVVLL IGGITK