Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0917 |
Symbol | |
ID | 3831306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 952244 |
End bp | 953455 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828848 |
Product | GTP cyclohydrolase II |
Protein accession | YP_429777 |
Protein GI | 83589768 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase [COG0807] GTP cyclohydrolase II |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00160336 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAAG GGATAAGCTT TAATACCATT GAGGAGGCTA TCGCCGAGAT CAAGGCTGGG CGGATGGTGG TCGTCGTCGA TGACGAGGAC CGGGAAAACG AAGGTGACCT GGTCATGGCG GCCGCCAGGG TTACGCCGGA GGCCATAAAC TTTATGGCCA CCCATGGTCG CGGCCTGATC TGTGTGCCCA TGGAAGGTCA ACGCCTGGAC GAGCTGGAAC TGGAACCTAT GGTAAACCAA AATACAGAGT CCATGGGAAC GGCCTTTACC GTCTCGGTAG ATGCAGCCGA GGTGACCACT GGTATCTCGG CCTTTGAGCG GGCCGCAACC ATTAAGCGGC TTATCGATCC CTCGACCCGG CCGGAGGATC TGCGCCGGCC CGGGCACATC TTTCCCCTAA GGGCGAAACC GGACGGGGTC CTGCGCCGGG CCGGGCACAC CGAGGCGGCC GTGGACCTGG CCCGCCTGGC CGGGCTCTAC CCGGCAGGGG TCATCTGCGA AATTATGAAC CCCGACGGTA CCATGGCCCG GGTACCCCAG CTCTATAAAT TCTGCCAGGA ACATGGCCTG AAGTTAATCA CGGTTGCCGA CCTGATCGAG TTTCGTCGCC GGCGGGAAAA ACTGGTACGC CGGGTGGCCG AGGCCGATCT GCCCACCAGG TACGGCCACT TTAAAGCCGT TGCCTATGAA GAGATCATGA ACGGGAAGGG CCACCTGGCC CTGGTGAAGG GCGATATAGC CAAGGGAAGG CCGGTCCTGG TCCGGGTTCA TTCGGAATGC CTGACGGGGG ATGTCTTCGG CTCTGAACGC TGCGATTGTG GCGACCAGTT GCAACGGGCT ATGAAGATGA TTGAGGATGA GGGTGCCGGG GTAATCCTCT ATATGCGCCA GGAAGGCCGG GGCATCGGCC TCCTCAACAA GATCAAGGCC TACAAGCTGC AGGAGGAAGG TAAAGACACA GTGGAGGCTA ATGAGGCCCT GGGCTTCCCG CCCGATTTGC GGGACTACGG CATTGGCGCT CAGATCCTGG CCGACCTGGG GGTCCGCCAG ATCCGCCTCC TGACTAATAA CCCCAAAAAG ATCGCCGGCC TGGAAGGATA TGGCCTCCAG GTTGTCGAAA GGGTACCCAT TGAAATCTGC CCAAACAAGG TTAACCGGCG TTACCTGAAG ACTAAAAAGG AAAAAATGGG CCACCTGCTG CATATCAGCT AG
|
Protein sequence | MNEGISFNTI EEAIAEIKAG RMVVVVDDED RENEGDLVMA AARVTPEAIN FMATHGRGLI CVPMEGQRLD ELELEPMVNQ NTESMGTAFT VSVDAAEVTT GISAFERAAT IKRLIDPSTR PEDLRRPGHI FPLRAKPDGV LRRAGHTEAA VDLARLAGLY PAGVICEIMN PDGTMARVPQ LYKFCQEHGL KLITVADLIE FRRRREKLVR RVAEADLPTR YGHFKAVAYE EIMNGKGHLA LVKGDIAKGR PVLVRVHSEC LTGDVFGSER CDCGDQLQRA MKMIEDEGAG VILYMRQEGR GIGLLNKIKA YKLQEEGKDT VEANEALGFP PDLRDYGIGA QILADLGVRQ IRLLTNNPKK IAGLEGYGLQ VVERVPIEIC PNKVNRRYLK TKKEKMGHLL HIS
|
| |