Gene Moth_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1302 
Symbol 
ID3831788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1344863 
End bp1346464 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content61% 
IMG OID637829238 
Productdihydroorotase 
Protein accessionYP_430158 
Protein GI83590149 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3653] N-acyl-D-aspartate/D-glutamate deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0050942 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.521882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCC TGCTGATTAA GGGCGGCACG GTTGTCGACG GTACCGGTCG CCCTCCTTTT 
CGGGCCGATG TAGGTTTGCA AGGAGCAAAG ATCGCCGCCC TGGGCCTGTT TGATGGCGTA
CCCGCGGGGA AGGTCGTGGA TGCCACCGGC CTGGTAGTTG CACCCGGGTT TATTGACTTT
CACAGCCATG CTGACGCCGA ACTCCTGCGG GACCCGGAGG ATCGGGCCAA GCTGGCCCAG
GGAGTCACTA CCGAAGTCAT CGGTAACTGT GGTATGTCCC TGGTCCCGGG GTCCCAGGAT
ACCCGTCCCC TCCTGGCCGC GTATACCAGC CCGGTGCTGG GGGAGATCCC CCCGGATTTT
CAGGCCACCG GCCTGGCTGA ATACCATTCC CTCTTAAGGC GCCAGGGAAT AGCTGTCAAT
GTTGCCACTC TGGCCGGTCA CGGTTCAATC CGCCTGGCGG TCATGGGTAT GGCCGATCGC
CGTGCCCCAA GGGCTGAGCT GGAGGAGATG TGCTCCCTCC TCAGGCAGGC GATGGCGGAA
GGCGCCTGGG GCCTGTCCAG CGGTCTCCTC TACCCGCCGG GATGTTATGC CCCCACTGGG
GAGTTAATCA CCCTCTGCCG GGTAGTCCGC CAGTACGGTG GTTTTTATGT CAGCCATATT
CGTAATGAAT CCGACGGTGT CCTGGAGGCG GTAGAGGAAG CCCTGGAAAT CGGGCGGGAG
GCCGGTGTTC CCGTCCACAT CTCCCACCTT AAAGCCTGCG GCTCCCGCAA CTGGCCCAAA
ATACCCCGGG CTTTGGCCCT GCTGGACGCT GCCCGGGCTA AAGGCCAGGA CGTAAGCTGG
GACGTCTACC CTTATACTGC CGGTTCCACC ACTGCCGCCT CCCTGCTGCC CCCGTGGGCT
GTTGCCGGAG GCACTGCCGC CCTTCAGGAA CGCCTGCATT CACCGGAGGT TCGCCAGGAA
ATTAAGAAAG CCTGGCAGGA AGGCTTGCCG GGCTGGGACA ACATGGTCAG TTCCCTTGGT
TACGACCATT TAATAATTAA TGCTGTGAGC CACCGGGAAA ATAAGGACTG CGTGGGCCTG
AGCCTGGCGC AAATCGGCCA ACAAAGGAGT CTGGACCCAG GTGACGCCCT GCTGGACCTG
TTACAGAGCG AAGGTGGTAA CCTGGCCATT GAAACCTACC ACGCCTGCGA GGAGACCCTG
GGAATGATCT TGCAGCACCC GGTAACGATC ATCGGCAGCG ATGGGATTTA TTCCGGCGAA
CATGCTCATC CCCGCCTATA CGGCACCTTT GCCAGGGTTT TGGGCCGTTA TGTCCGGGAG
CGAAAACTCC TCTCCCTGGA GGAGGCCATA GCCAAAATGA CTTCCAAACC GGCGGCCAGG
CTCGGCCTCC GGTACCGGGG GCGGGTAACG CCAGGCTATT ATGCCGACCT GATCCTCTTT
GACCAGGAAA CTATTGCCGA CCGGGCTACC TTCCAGGAAC CGGCCCGGAC ACCTTCTGGC
ATCAAGGCTG TTATCGTCAA CGGGCGGGTG GCCTACCAGG AAGGGCGGTT TACCGGCGAA
CGGGCCGGAA TTATTCTCAC CAGTCATACT ACCGGTGTTT AG
 
Protein sequence
MFSLLIKGGT VVDGTGRPPF RADVGLQGAK IAALGLFDGV PAGKVVDATG LVVAPGFIDF 
HSHADAELLR DPEDRAKLAQ GVTTEVIGNC GMSLVPGSQD TRPLLAAYTS PVLGEIPPDF
QATGLAEYHS LLRRQGIAVN VATLAGHGSI RLAVMGMADR RAPRAELEEM CSLLRQAMAE
GAWGLSSGLL YPPGCYAPTG ELITLCRVVR QYGGFYVSHI RNESDGVLEA VEEALEIGRE
AGVPVHISHL KACGSRNWPK IPRALALLDA ARAKGQDVSW DVYPYTAGST TAASLLPPWA
VAGGTAALQE RLHSPEVRQE IKKAWQEGLP GWDNMVSSLG YDHLIINAVS HRENKDCVGL
SLAQIGQQRS LDPGDALLDL LQSEGGNLAI ETYHACEETL GMILQHPVTI IGSDGIYSGE
HAHPRLYGTF ARVLGRYVRE RKLLSLEEAI AKMTSKPAAR LGLRYRGRVT PGYYADLILF
DQETIADRAT FQEPARTPSG IKAVIVNGRV AYQEGRFTGE RAGIILTSHT TGV