Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1401 |
Symbol | |
ID | 3831688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1448822 |
End bp | 1450120 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829337 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_430257 |
Protein GI | 83590248 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAC TGGAAGCAGC CCAGGTGGGC CAGGTGACGC GGGCGATGGA ACAGGTGGCG GCCCGGGAGA AGGTGCGTGT AGAGGACCTA ATGGCAGAGG TAGCCGCCGG CCGGGTGGTG ATACCGGTCA ATAAGAATCA CCATAAACTC CAGCCATGCG GTATTGGCAG GGGGTTAAGG ACCAAGGTTA ACGCCAACCT GGGTACCTCC ACGGACTACC CGGATATCGC CGCCGAGTTG GAGAAGCTCC AGGTGGCCCT GGACGCCGGG GCTGATGCCG TCATGGATCT AAGCACCGGC GGTGACATTA ACGAATGCCG GCGCCAGGTC ATTGCCCGCT CGCCGGCGAC CGTCGGGACT GTGCCCATTT ACCAGGCTAC GGTGGAGGCC CAGGAGAAAT ACGGCGCTCT GGTAAAAATG ACCGTTGACG ACCTCTTCCG GGTTATCGAA ATGCAGGCTG AAGACGGTGT TGATTTTATT ACCGTTCACT GCGGTGTCAC CATGGAGGTA GTTGAGCGCC TGCGCCGCGA GGGCCGCCTG GCGGATATCG TCAGTCGGGG CGGATCTTTC CTGACAGGCT GGATGCTCCA TAATGAACAG GAGAATCCCC TCTACGCCCA TTACGACCGC CTGCTGGAGA TCGCCCGGCG CTATGATGTC ACCTTAAGCC TGGGCGACGG CCTGCGGCCG GGTTGCCTGG CTGACGCCAC CGACCGGGCC CAGATCCAGG AGTTGATTAT CCTGGGAGAG CTGGTGGATC GCGCCCGGGA AGCCGGTGTC CAGGCCATGG TGGAAGGACC CGGACACGTA CCCTTAAACC AGATCCAGGC TAATATCCTC CTGGAGAAAC GCCTTTGCCA CGAAGCGCCC TTCTACGTCC TGGGACCCCT GGTCACTGAC GTCGCGCCGG GATACGATCA CCTTACTGCC GCCATCGGCG GCGCCCTGGC GGCTGCTGCC GGGGCCGATT TTATCTGCTA CGTTACCCCG GCCGAACATC TGGGCCTGCC CACCCTGGCC GATGTGCGGG AAGGAGTGAT CGCCGCCCGC ATTGCCGGCC ATGCCGCCGA CCTGGCCAAA GGCCTTCCCG GGGCCTGGGA ATGGGACCGG GAGATGGCCC GCGCCCGCAA GGCCCTGGAC TGGCAGCGCC AAATAGAGCT GGCCCTGGAC CCGGAAAAGG CCAGGCAGTA CCGCCGGGCC CGCAACGACG AGGGGGCCGT TGCCTGCTCT ATGTGCGGTG ACTTCTGCGC CATGCGCCTC GTCGGAGAGT ACCTGGGGAA ACCGTCAGAA ACGTGTTAA
|
Protein sequence | MTQLEAAQVG QVTRAMEQVA AREKVRVEDL MAEVAAGRVV IPVNKNHHKL QPCGIGRGLR TKVNANLGTS TDYPDIAAEL EKLQVALDAG ADAVMDLSTG GDINECRRQV IARSPATVGT VPIYQATVEA QEKYGALVKM TVDDLFRVIE MQAEDGVDFI TVHCGVTMEV VERLRREGRL ADIVSRGGSF LTGWMLHNEQ ENPLYAHYDR LLEIARRYDV TLSLGDGLRP GCLADATDRA QIQELIILGE LVDRAREAGV QAMVEGPGHV PLNQIQANIL LEKRLCHEAP FYVLGPLVTD VAPGYDHLTA AIGGALAAAA GADFICYVTP AEHLGLPTLA DVREGVIAAR IAGHAADLAK GLPGAWEWDR EMARARKALD WQRQIELALD PEKARQYRRA RNDEGAVACS MCGDFCAMRL VGEYLGKPSE TC
|
| |