Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3024 |
Symbol | |
ID | 4811096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3546060 |
End bp | 3547970 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108445 |
Product | ech hydrogenase subunit A |
Protein accession | YP_001039413 |
Protein GI | 125975503 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000133717 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAA TTTTAATATT GATATTGTTT CCGCTGTTGG CATCTGTCAC TGTTTTGTCA GTAAGAAAAG ATGCCATAAG AAATATAATA GTAAGAATTT TCGCCTTTAT TACCGGCATA CTGACATTAT TTGTGGTATG TCGGTATTTT AAGGATGGAA TATCTTTGTC CATTGAAAAC AGGAATATTA TTGACATGAC CATATCTCTG GCGGAGGTCC TTATTGCTGC ATATATAATA TTTACAGGCA TAAAAAACAA AAAGTTCATT GTATCAATTT TTGCAGCTGT TCAAACCGCT CTGATTCTTT GGTTTGAATT TACACAAAAA CACGGTATCA ATGTTCATTC GGACATTGTA TTCGACAGGC TTTCCGCTGT TATGGTCCTC ATTGTGGGAT GTATCGGAAG CCTTATACTG ATATACACTG TCGGATATAT GAAATGGTAT CACATACACC ACGAAGGATA CAAAGAAAGA AAGAGTTTCT TTTTTTCTGT AATTTTTCTC TTTCTCTTTG CAATGTTCGG ATTAATTTTC AGCAACAACC TGATCTGGAT GTATTTTTGC TGGGAACTTA CAACCTTGTG TTCTTACCTT CTTATCGGTT ACACCCGAAC ACCCGAAGCA GTAAACAATT CATTCCATGC ATTGGCAATC AATCTTGGCG GCGGACTTGC GTTTGCGTCG GCAATGGTAT ATATAGGAAC GAACTTTAAA ACTCTCGAGC TTTCGGCATT GACAGCCATG AAACTTGAGC TTGCGGTTCT CATACCGGTT TTCCTTCTTT GTATTGCAGC CCTTACGAAG TCTGCCCAGA TGCCCTTTTC CTCCTGGCTT TTGGGGGCAA TGGTAGCACC GACTCCGTCA TCGGCGCTTT TGCACTCGGC AACAATGGTA AAAGCAGGAG TTTACCTTTT AATAAGACTT GCTCCGCTGC TTGCAGGAAC TACCATAGGA AAAGTAATTG CTCTTTTGGG AGCGGTTACG TTCCTGGCAA GTTCCATCAT CGCAATCTCC AAAAGCGACG CAAAGAAAAT TCTGGCTTAT TCAACCATAT CGAATTTAGG ACTTATAGTT ACCTGCGCAG CCATAGGAAC GCAGGAATCG CTGTGGGCAG CAATACTGCT GTTAATATTC CACTCCATAT CCAAATCCCT TCTGTTCCTG ACCGGAGGCT CAGTAGAGCA CCAGATAGGA AGCCGCAATG TTGAGGATAT GGATATTCTT CTGCAGGTGT CAAGAAGGCT GTCTGTATAT ATGATTGTGG GAATAGCCGG AATGTTCCTT GCCCCCTTTG GAATGCTTAT ATCCAAATGG GTTGCCATGA AGGCATTTAT TGATTCGAAG AATATACTTA CAGTTATCAT TTTGGGATAC GGCAGTGCCA CAACACTGTT CTACTGGACA AAATGGATGG GTAAACTCGT AGCCAATGCC AACAGAAAAG ACCACATAAA GCATACCTTC CACATAGACG AGGAAATTCC TATTTTTATC CATGCAGTCC TTGTGGTATT GTCCTGCTTT ACTTTTCCTC TGGTATCCCG ATATGTACTT GTACCGTATC TTTCAGGTCT GTTTGGTCCG GATGTGCCAA TTCCTATCGG AACAAGTGAT GTAAATATAA TGCTTATAAT GCTAAGTATG CTGTTAATAC TGCCAATAAG CTTTATTCCA ATATATAAAA GCGACCGGCG CAGGATAGTG CCTATTTACA TGGCCGGGGA GAACACCGGC GACAATGAGA GTTTTTATGG TGCTTTTGAT GAAAAACGTA AAGTCGAGCT CCACAACTGG TATATGAAAA ACTTTTTCTC TGTGAAAAAA CTAACCTTCT GGAGTAATTT ACTATGTGCC GTTGTGATAT TGGTGGGCGT AGTACTTTTA ATAGGAGGAA TTACCAAATG A
|
Protein sequence | MNAILILILF PLLASVTVLS VRKDAIRNII VRIFAFITGI LTLFVVCRYF KDGISLSIEN RNIIDMTISL AEVLIAAYII FTGIKNKKFI VSIFAAVQTA LILWFEFTQK HGINVHSDIV FDRLSAVMVL IVGCIGSLIL IYTVGYMKWY HIHHEGYKER KSFFFSVIFL FLFAMFGLIF SNNLIWMYFC WELTTLCSYL LIGYTRTPEA VNNSFHALAI NLGGGLAFAS AMVYIGTNFK TLELSALTAM KLELAVLIPV FLLCIAALTK SAQMPFSSWL LGAMVAPTPS SALLHSATMV KAGVYLLIRL APLLAGTTIG KVIALLGAVT FLASSIIAIS KSDAKKILAY STISNLGLIV TCAAIGTQES LWAAILLLIF HSISKSLLFL TGGSVEHQIG SRNVEDMDIL LQVSRRLSVY MIVGIAGMFL APFGMLISKW VAMKAFIDSK NILTVIILGY GSATTLFYWT KWMGKLVANA NRKDHIKHTF HIDEEIPIFI HAVLVVLSCF TFPLVSRYVL VPYLSGLFGP DVPIPIGTSD VNIMLIMLSM LLILPISFIP IYKSDRRRIV PIYMAGENTG DNESFYGAFD EKRKVELHNW YMKNFFSVKK LTFWSNLLCA VVILVGVVLL IGGITK
|
| |