Gene Moth_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0917 
Symbol 
ID3831306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp952244 
End bp953455 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID637828848 
ProductGTP cyclohydrolase II 
Protein accessionYP_429777 
Protein GI83589768 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00160336 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAG GGATAAGCTT TAATACCATT GAGGAGGCTA TCGCCGAGAT CAAGGCTGGG 
CGGATGGTGG TCGTCGTCGA TGACGAGGAC CGGGAAAACG AAGGTGACCT GGTCATGGCG
GCCGCCAGGG TTACGCCGGA GGCCATAAAC TTTATGGCCA CCCATGGTCG CGGCCTGATC
TGTGTGCCCA TGGAAGGTCA ACGCCTGGAC GAGCTGGAAC TGGAACCTAT GGTAAACCAA
AATACAGAGT CCATGGGAAC GGCCTTTACC GTCTCGGTAG ATGCAGCCGA GGTGACCACT
GGTATCTCGG CCTTTGAGCG GGCCGCAACC ATTAAGCGGC TTATCGATCC CTCGACCCGG
CCGGAGGATC TGCGCCGGCC CGGGCACATC TTTCCCCTAA GGGCGAAACC GGACGGGGTC
CTGCGCCGGG CCGGGCACAC CGAGGCGGCC GTGGACCTGG CCCGCCTGGC CGGGCTCTAC
CCGGCAGGGG TCATCTGCGA AATTATGAAC CCCGACGGTA CCATGGCCCG GGTACCCCAG
CTCTATAAAT TCTGCCAGGA ACATGGCCTG AAGTTAATCA CGGTTGCCGA CCTGATCGAG
TTTCGTCGCC GGCGGGAAAA ACTGGTACGC CGGGTGGCCG AGGCCGATCT GCCCACCAGG
TACGGCCACT TTAAAGCCGT TGCCTATGAA GAGATCATGA ACGGGAAGGG CCACCTGGCC
CTGGTGAAGG GCGATATAGC CAAGGGAAGG CCGGTCCTGG TCCGGGTTCA TTCGGAATGC
CTGACGGGGG ATGTCTTCGG CTCTGAACGC TGCGATTGTG GCGACCAGTT GCAACGGGCT
ATGAAGATGA TTGAGGATGA GGGTGCCGGG GTAATCCTCT ATATGCGCCA GGAAGGCCGG
GGCATCGGCC TCCTCAACAA GATCAAGGCC TACAAGCTGC AGGAGGAAGG TAAAGACACA
GTGGAGGCTA ATGAGGCCCT GGGCTTCCCG CCCGATTTGC GGGACTACGG CATTGGCGCT
CAGATCCTGG CCGACCTGGG GGTCCGCCAG ATCCGCCTCC TGACTAATAA CCCCAAAAAG
ATCGCCGGCC TGGAAGGATA TGGCCTCCAG GTTGTCGAAA GGGTACCCAT TGAAATCTGC
CCAAACAAGG TTAACCGGCG TTACCTGAAG ACTAAAAAGG AAAAAATGGG CCACCTGCTG
CATATCAGCT AG
 
Protein sequence
MNEGISFNTI EEAIAEIKAG RMVVVVDDED RENEGDLVMA AARVTPEAIN FMATHGRGLI 
CVPMEGQRLD ELELEPMVNQ NTESMGTAFT VSVDAAEVTT GISAFERAAT IKRLIDPSTR
PEDLRRPGHI FPLRAKPDGV LRRAGHTEAA VDLARLAGLY PAGVICEIMN PDGTMARVPQ
LYKFCQEHGL KLITVADLIE FRRRREKLVR RVAEADLPTR YGHFKAVAYE EIMNGKGHLA
LVKGDIAKGR PVLVRVHSEC LTGDVFGSER CDCGDQLQRA MKMIEDEGAG VILYMRQEGR
GIGLLNKIKA YKLQEEGKDT VEANEALGFP PDLRDYGIGA QILADLGVRQ IRLLTNNPKK
IAGLEGYGLQ VVERVPIEIC PNKVNRRYLK TKKEKMGHLL HIS