Gene Mpal_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1734 
Symbol 
ID7271298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1805027 
End bp1806295 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content63% 
IMG OID643570348 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_002466764 
Protein GI219852332 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.409452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGATC TCCTCCTATT AAACCTGACC CTTCCGGACG GAAGGGTTGT CGACCTGCAG 
GTCCGGGACG GGATCGTCGT GCATGCAGGT GCCGGAGCTC CGGCTCATCA GACGCTCGAT
TGCAGGGGAC TGCTCGTCCT CCCGGCCGCG ATCGATATGC ATGTTCATAT GCGGGGCGGC
ACTCAGTCCG TCAAGGAGGA CTGGACCACC GGTTCGCAGA GCGCACTGGC CGGCGGGGTG
ACGGTGGTGG TCGACCAGCC GAACACTGTC CCGCCGATCA CCAACCGGGA ACATTTCAAA
GTCAGGGTCG CCGATGCCAC CGCCCATTCG TACTGCGGGT TCGGGGTGAA CGGGGCCGTG
ACCCGGGATG CGAGAATTGC GGACCTCTGG CAGGGCGGGG CGCTGGCGTT CGGCGAAGTC
TTCATCGCTC CGTCCAGTTA CGGGGAGGCC CTGACACTGG AGGTGCAGCA GCGCACCTTT
GCTGAGATCC ATCGGCTGGG GGGGCTCGTC ACCGTTCATG CTGAGGAGGT CTCCGGTACC
GCGCCGGTCG GGCTCCGCCA GCACAGTCTG CAGCGATCGC CGGCAGGGGA AGAGCGAGCT
GTACAGGCCC TGCGGGCATC GTGCGCACCC GGTCAGCGGG TCCACTGCTG CCACATGAGC
ACGGCAGGAT CGTTGGATGC AGCTCATCGG GCAGGGATGA CGGCCGAGGT GACTCCCCAT
CACCTCCTTC TCTCCATCGA ACGGTTCGCC GATACCGACA CTCACGGGCG TGTGAACCCA
CCGCTCAGGT CGGAACGTCT CCAGAGAGAA CTCTTTCTGG CCTGGGATCG GATCGACCTG
ATCGCTTCGG ACCATGCGCC GCACACATTG AACGAGAAGG CGCAGGCCTT TACGAATGCC
CCTTCCGGGC TGCCGGGCGT CGAGACGATG GTTCCGCTGT TGATGGCGCA TGTCCTCACC
AGCGAACTCT CTCTCGCTTC TGTCGTGCAG AAGACCGCTG TTGCACCGGC GAAAGTTCTG
GGAATCCCAC CGGCCGGGTT CTCACCCGGC GATCGTGCCG ACTTTGCGCT CTATCCCCGT
GAGGCGGTCC CTGTTGAGGC CGCCGACCTG CACAGCAGGT GTACCTGGAC ACCGTATCAA
GGGATGTTGG CGGTCTTTCC TGAACGGGTA ATCATGCGGG GAACGGTCGT CTATGACCAT
GGGGACTTCA CAAGGATCGA CCCCTGCTGG TACAGGGGGA GGGGTTATAT GGAGAGACCA
CAGATATGA
 
Protein sequence
MLDLLLLNLT LPDGRVVDLQ VRDGIVVHAG AGAPAHQTLD CRGLLVLPAA IDMHVHMRGG 
TQSVKEDWTT GSQSALAGGV TVVVDQPNTV PPITNREHFK VRVADATAHS YCGFGVNGAV
TRDARIADLW QGGALAFGEV FIAPSSYGEA LTLEVQQRTF AEIHRLGGLV TVHAEEVSGT
APVGLRQHSL QRSPAGEERA VQALRASCAP GQRVHCCHMS TAGSLDAAHR AGMTAEVTPH
HLLLSIERFA DTDTHGRVNP PLRSERLQRE LFLAWDRIDL IASDHAPHTL NEKAQAFTNA
PSGLPGVETM VPLLMAHVLT SELSLASVVQ KTAVAPAKVL GIPPAGFSPG DRADFALYPR
EAVPVEAADL HSRCTWTPYQ GMLAVFPERV IMRGTVVYDH GDFTRIDPCW YRGRGYMERP
QI