Gene Moth_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1997 
Symbol 
ID3832330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2080835 
End bp2082199 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content62% 
IMG OID637829926 
Productdihydropyrimidinase 
Protein accessionYP_430836 
Protein GI83590827 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCT TGCTGAAGGG CGGCTGGGTT GTTACCCAGG AGCGGGTGGA GCAGGCCGAT 
ATAGCCGTCG AAGGGGAGAA AATTGCCGCC ATAGGCCCGG ACCTGGAGGC CCGGGCGGCT
GCCGTCAGGG ATGTCACCGG GAAATACATC CTGCCCGGCG CCATCGACGC CCACGTCCAC
TACCAGATGC CCATCGGCGA GCTCCTGACC GCCGACGACT GGTTCACCGG TACCAGACTG
GCAGCCTGCG GCGGGGTTAC CACCGTCATT GATTATGCCG AGCCCGCCGG CCCGGCCGAA
CCCTTGACGG AAGCCCTGGC CAAACGCCTG GAGGAAGCCC GGGAGCAGGC GTGCGTTGAC
TATGGTCTGC ACCAGGTGGT GCTGCCCGGC CAGGAAGAAG ACACCGGTGA GCTGGCAAAG
GTAATAGAGC AGGGAGTCAC CAGCTTTAAA GTCTTTACCA CCTACAAGCA GTGCCTGGGT
TACGCGGCCA TCGGCCGCCT GCTTAAGCAG GCCCGGCGGT TGGGAGCCCT GGTGACGGTT
CACTGTGAGG ATCATGATCT GGTAACGGCC AGGCGGAGGG AACTGGAGGC TACCGGCCAG
ACAGACCCGG CCTACCACGC CAACAGCCGG CCCGCAGCGG CCGAAGTAAA GGCCATAGAA
AAGGTTATCC GCCAGGCAGC CGCGGCCGGG GCGCCAGTTT ATATCGTCCA TGTCTCCACC
GGCGGGGGGG CGGAATTAAT TGCCGCCGCC CGGGCCCGAG GACAGCAGGT CTTTGGTGAA
ACCTGCCCCC ACTACCTCTT ACTAACAGAG GAGAGGTATG CCGGTCCGGA CAGCCGCCTC
TTCCTGATGT GCCCACCCCT GAGGACGGTA AAAGATAACC GGATTCTCTG GCAGCACCTG
GCCAGTGGTG ATCTCCAGGT GGTAGCGACT GACCATTGCA GCTACAGCCC GGAACAGAAG
GCTGCCGGAA CGGCTTTTTA TAACACCCCC TCGGGCGTAC CGGGGACGGA GACCCTTTTA
CCCCTACTTT ATTCTTATGG TGTACGCCAG GGGCGGCTGA CCCTGCCGCA AATGGTCCGG
GTGCTGGCCA CCAACCCGGC CCGCCTTTTC GGTCTTTACC CGCGTAAAGG TTGCCTGGCG
CCGGGCAGCG ATGCCGACCT GGTGGTCTTC GACCCCAGCC AGGAGGTTAT ACTCAAGGCT
TCTGACCTGC ATTCTGCCGC AGCTTATACC ATCTTTGAGG GCTTTGCTCT CCAGGGGTAC
GTGGAAGCAA CCTATCTACG GGGTCGGCTT ATTTATGACC TGGGCCGTTT CCTGGGCCGG
GCCGGTCAGG GAGAGTTTAT CCCTGGAAAA ATTACCGTCC TGTAA
 
Protein sequence
MDLLLKGGWV VTQERVEQAD IAVEGEKIAA IGPDLEARAA AVRDVTGKYI LPGAIDAHVH 
YQMPIGELLT ADDWFTGTRL AACGGVTTVI DYAEPAGPAE PLTEALAKRL EEAREQACVD
YGLHQVVLPG QEEDTGELAK VIEQGVTSFK VFTTYKQCLG YAAIGRLLKQ ARRLGALVTV
HCEDHDLVTA RRRELEATGQ TDPAYHANSR PAAAEVKAIE KVIRQAAAAG APVYIVHVST
GGGAELIAAA RARGQQVFGE TCPHYLLLTE ERYAGPDSRL FLMCPPLRTV KDNRILWQHL
ASGDLQVVAT DHCSYSPEQK AAGTAFYNTP SGVPGTETLL PLLYSYGVRQ GRLTLPQMVR
VLATNPARLF GLYPRKGCLA PGSDADLVVF DPSQEVILKA SDLHSAAAYT IFEGFALQGY
VEATYLRGRL IYDLGRFLGR AGQGEFIPGK ITVL