Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2277 |
Symbol | |
ID | 3831388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2384894 |
End bp | 2386789 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637830197 |
Product | thiamine pyrophosphate enzyme |
Protein accession | YP_431107 |
Protein GI | 83591098 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00118672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000449131 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGCCCGGA TTACGGCGGG CCTGGAGTAC GGGAGAGGTG AAATCGTGCG GGAACTGTTA ATCGGCAACC ATGCCCTGGC CCGGGGGGCC TGGGAAGCCG GGGTCCGGGT GGCCGCCGCT TACCCGGGGA CGCCGAGTAC AGAGATTATA GAAGCCCTGG CCCGCTATCC GGAGGTATAC GCGGAATGGG CGCCCAACGA AAAGGTGGCC CTGGAGGTAG CCATTGGTGC GGCCATAGGC GGCGCGCGCT CCCTGGCCGC TATGAAACAT GTGGGTGTTA ACGTCGCCGC CGATCCCCTG ATGACCCTGG CCTATACCGG CGTCAATGCC GGGCTGGTCC TGGTTTCGGC CGACGACCCC GGCCTTTTTA GCTCCCAGAA CGAGCAGGAT AACCGCTTTT ACGCCCGCAT GGCCCAGATA CCCTGCTTGG AGCCTGCCGA CAGCCAGGAA GTTAAGGATA TGGTCATGCA GGCTTTTAAT TTAAGCGAGG AATTTGATAC CCCGGTAATT TTACGCCTGA CCACCCGGAT TGCCCATTCT TACAGCCTGG TGGAACTTGG CGACCGCCAG GAAGTGCCGC TTAAAAGTTA TGTCAAACAG CCCGCCAAGT ATGTCATGCT ACCGGCCTTT GGCAAGGCCC GCCATGTAGT GGTGGAAGAA CGGCGTCTCA AGCTGGCGGC CTATGCGGAA ACTGCCCCCT TCAACCGGGT GGAATGGGCC GACCGGCGGG TGGGCATCAT TACCTCCGGT ATTGCCTACC AGTACGTTAA AGAGGCCCTG CCCGGGGTTT CGGTATTGAA GCTGGGGTTA ACTTATCCTC TGCCGGAGAA GCTGATTTCC GATTTTGTAA AGGCAGTAAA AACTTGCTAT GTGGTAGAAG AACTGGAACC CTTCCTGGAA GATCAGATCC GCGCCTGGGG ACTGGGTGTA GTAGGCAAGG AATTGGTGCC AAGGGTAGAT GAATTGAGTA GCGCCATTGT CGCCCGTACG GTGGGCTCGC AGGTGGCGGC CGTTGCCCCG GAGCTGGTTG CTCCTGATCT GACTGCGGTG GCCTTGCCCG GTATGAGGAC CCCGGGCGGT CAAAGGGCAG GCGAAGGATC GGTCCCGGAT GCGAGGGAAA CGACGACTGC CGCGCCGGCT GAACTCCCCG GTCGTCCACC CCTCATGTGC CCCGGCTGCC CCCACCGCGG CGTCTTTTAC GTCCTGAAAA AGCTCCGGCT GGTGGTGGCC GGTGACATCG GCTGCTATAC CCTGGGGGCT ACACCGCCTC TCCAAGCCAT GGATAGCTGT ATTTGCATGG GGGCCAGCCT GGGGGTGGCC ATGGGGCTGG AGAAGGCCCG CGGTGCGGAC TTCGCCCGGC GGGTGGTAGG GGTTATTGGC GATTCAACTT TCCTCCACTC GGGAATGACC GGACTCCTGG ACATGGTCTA CAATGGCGGC ACCGGGACCT TGATTATCCT GGATAATAGC ACCACGGCCA TGACCGGCCA CCAGGACCAT CCCGGCACGG GTTATACTGC TTCCCATCAG CCGGCCCCTA AGGTTGACCT GGAACAGATA GCCCGCGCCC TGGGGGTGCA CCGGGTACAG GTGGTTGATA GTTATAATCT AGAGACCCTC GAGAGGGCGA TCCAGGAAGA AACGGCCGCC AGGGAACCAT CGGTAATCAT TGCCAGGCGG CCCTGCGCCC TCTTGAAGAA AGAAAAAGAA GCCGTTTACG CTGTAAGCCC CGATAACTGC CTGAGCTGCC GTTATTGCCT GGACCTGGGC TGTCCCGCCA TTTCCTTTAG CGACGGGCAC GGAGTGATTG ACCCGGTACT GTGCAACGGC TGCGGCCTTT GTACCCAGGT CTGCCCTGGT GAGGCCATCA GGAAGGCTGG TGAAGAAGAT GAGTAA
|
Protein sequence | MARITAGLEY GRGEIVRELL IGNHALARGA WEAGVRVAAA YPGTPSTEII EALARYPEVY AEWAPNEKVA LEVAIGAAIG GARSLAAMKH VGVNVAADPL MTLAYTGVNA GLVLVSADDP GLFSSQNEQD NRFYARMAQI PCLEPADSQE VKDMVMQAFN LSEEFDTPVI LRLTTRIAHS YSLVELGDRQ EVPLKSYVKQ PAKYVMLPAF GKARHVVVEE RRLKLAAYAE TAPFNRVEWA DRRVGIITSG IAYQYVKEAL PGVSVLKLGL TYPLPEKLIS DFVKAVKTCY VVEELEPFLE DQIRAWGLGV VGKELVPRVD ELSSAIVART VGSQVAAVAP ELVAPDLTAV ALPGMRTPGG QRAGEGSVPD ARETTTAAPA ELPGRPPLMC PGCPHRGVFY VLKKLRLVVA GDIGCYTLGA TPPLQAMDSC ICMGASLGVA MGLEKARGAD FARRVVGVIG DSTFLHSGMT GLLDMVYNGG TGTLIILDNS TTAMTGHQDH PGTGYTASHQ PAPKVDLEQI ARALGVHRVQ VVDSYNLETL ERAIQEETAA REPSVIIARR PCALLKKEKE AVYAVSPDNC LSCRYCLDLG CPAISFSDGH GVIDPVLCNG CGLCTQVCPG EAIRKAGEED E
|
| |