Gene Moth_0880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0880 
Symbol 
ID3831518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp908922 
End bp910217 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content63% 
IMG OID637828810 
Productdihydroorotase 
Protein accessionYP_429740 
Protein GI83589731 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTT TAATAAAAGG CGGGCGGGTT ATTGACCCGG CTCGAAACCT GGATGGACGG 
CAGGACATAC TCATCGAAGG CGAGAAAATA ACTACCTTGG CAGCCAATCT CGAGGCCCCG
GCCGGAGCGC GGGTCATCGA CGCGGGAGGC ATGATTGTCA CCCCCGGCCT TATTGATATG
CATGTGCACC TGCGCGAACC GGGCTACGAG CAGAAGGAGA CCATCGCCAG CGGCACCCGG
GCGGCGGCTG CCGGCGGCTT TACGGCCGTG GCCTGCATGG CCAATACCAA CCCGGTGGCC
GACAGCGCCA GTGTTATCTA CTTTATCAAA GAAAAGGCTC GGCAGGAGGG GGTAGTCCGG
GTTTACCCGG TGGGCGCCCT TTCCAAAGGC CTGGAAGGTA AAGAAATCGC CGAGATCGGC
GACCTGGCGG CAGCCGGGGC GGTAGCCATC TCCGACGACG GCCGCCCGGT CATGAACGCC
CTGGTCATGC GCCATGCCCT GGAGTACGCC AAAATGTTCA ACCTGCCGGT AATCAGCCAC
TGCGAAGACG AAGCCCTGGC CAACGACGGC CTGATGCATG AAGGCCTGGT GGCCACCATC
CTGGGCCTCA GGGGCATCCC GGCGGCAGCC GAGGAGGTCA TGGTGGCCCG GGATCTCATC
CTGGCGGAAT TGACCGGGGG AAGGCTGCAC CTGGCCCATG TCAGCACGGC CGGGTCCGTC
CGCCTCCTTA AGGAGGCCCG GGCCCGGGGG GTCAGGGTAA CGGCCGAAGC CACGCCCCAC
CACCTCTGCC TAACGGACAT GCTGGTCCAG AGTTACGATA CCAGCACTAA AGTTAACCCG
CCCCTGCGAC CGGCCGGCGA TGTGGCGGCA GTGGCGGCGG CCCTGGCGGC CGGCGACATC
GACGTCATTG CCTCCGATCA CGCCCCCCAC GCCGACGAGG ATAAGGACGT GGAATACGAT
TATGCACCCT TCGGCATGGT CGGCCTGGAA ACAGCCGTGC CCCTGGTGGT GACGGAACTG
ATCCTACCCG GCAAATTAAC CTGGCAACAG GCCATCAAGT CCTGGACGGC AAACCCGGCC
CGGATTCTCA ACATACCCGG CGGCAGCCTG GTCCCGGGCG GGGTGGCCGA CGTGACCATA
ATCGACCCCG ACATGGAGAA GGAAGTCGAT GTCAACGAGT TCTATTCCCG AGGCCACAAC
TCGCCCCTGC AGGGCCGGAA GCTCAAAGGC TGGCCGGTAT TGACCATTGT AGGCGGCCGG
GTAGTGATGG AGAATGGGAA GATCATTGAG GAATGA
 
Protein sequence
MAILIKGGRV IDPARNLDGR QDILIEGEKI TTLAANLEAP AGARVIDAGG MIVTPGLIDM 
HVHLREPGYE QKETIASGTR AAAAGGFTAV ACMANTNPVA DSASVIYFIK EKARQEGVVR
VYPVGALSKG LEGKEIAEIG DLAAAGAVAI SDDGRPVMNA LVMRHALEYA KMFNLPVISH
CEDEALANDG LMHEGLVATI LGLRGIPAAA EEVMVARDLI LAELTGGRLH LAHVSTAGSV
RLLKEARARG VRVTAEATPH HLCLTDMLVQ SYDTSTKVNP PLRPAGDVAA VAAALAAGDI
DVIASDHAPH ADEDKDVEYD YAPFGMVGLE TAVPLVVTEL ILPGKLTWQQ AIKSWTANPA
RILNIPGGSL VPGGVADVTI IDPDMEKEVD VNEFYSRGHN SPLQGRKLKG WPVLTIVGGR
VVMENGKIIE E