Gene Mmar10_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0142 
Symbol 
ID4284149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp143687 
End bp144832 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID638139607 
Productpeptidase U34, dipeptidase 
Protein accessionYP_755376 
Protein GI114568696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGACA CCCTGGTCGT CCGTGGCGGC GGCGCCGTCT GGTTCGCCAA GAATTCCGAC 
CGTGAGCCGG GCGAGGTCCA GCGGGTCGAG CGGCATGCCG CGGTCGCCGA CGACACCACT
GAAAAGCTCG CCTGCACCCA TATCGAGATC GACCAGATCC CGGATCGTCA GGCTACCATC
CTGTCGCGCC CGTCCTGGAT GTGGGGCGCC GAGATGGGCG TCAATGCGTC CGGCGTGGTG
ATCGGCAATG AAGCCGTGTT CTCGCGCAAG GTGATGAAAC GGGGCAAGGC CCTGCTGGGC
ATGGACCTTG TCCGCCTGGG ACTGGAGCGG GGCAGCTCGG CACATGAATC GGCGGCGATC
ATCATCCATC TGCTGGAAAC CCATGGGCAG GGCGGACCGG CCGGCTGGCG CAATAAGGGG
TTTCGTTATG ACAACAGCTT CCTGATTGCC GACGCTGCCG AGGTGCTGGT GCTGGAGACC
TGCGGCCGCG ACTGGCGGCT GGAGCGCGTG AAACGCCACG CGGCGATCTC CAACGCCTAT
ACCCTTGAAG GCCCGGTGAC CATGGCTTCG GAGGGAGCGC CGATCGAGGG CTTTGGCGCG
AGTGACGAGA CCTGGCTGCG CCCGACATTA GGACGGGCCC GCGAGCGCCG GGCCTGTGCC
CTGGCGGCGC TGGAACGCCT CGACAGGCCG GACTTTGCCA GCCTGGCGAA AATCATGCGC
TCGCATGACC GGGGCGACGG CTTCACCAAG GGCTCCAACC GCGATCTGTG CCTCCATCAT
GGCGGCCTCA TGCGACCCAG CCAAACGACC AATTCCATGC TGGTACGGCT GGCGCCCGGC
GAGGCCCCGG CCGTCGCCAT GACCGGCACC AAGACACCCT GTGTCTCCCT CTTTCGCCCG
GTGGCCTTCG ACGGTGGGTC CAGCCTGTTC TCGGACACGC TCTGGGAGCA GGGCTCAAAG
CGCCACGACG CGCTGGCCCG CGACCCGTCA GCCCGCCAGC AGGTCCGCAA TCGCATCGCC
GCGGCGGAGG CGCATATCCT GCCGGCCATC GAGGCCGGCC GACCGGATGT GGCGGAGGCC
CTGGTCACGG CCTGGGATGA TCATGGACTG GATGCGGGCC GTGCCGGGAC CGAGCCGTCC
GGCTGA
 
Protein sequence
MCDTLVVRGG GAVWFAKNSD REPGEVQRVE RHAAVADDTT EKLACTHIEI DQIPDRQATI 
LSRPSWMWGA EMGVNASGVV IGNEAVFSRK VMKRGKALLG MDLVRLGLER GSSAHESAAI
IIHLLETHGQ GGPAGWRNKG FRYDNSFLIA DAAEVLVLET CGRDWRLERV KRHAAISNAY
TLEGPVTMAS EGAPIEGFGA SDETWLRPTL GRARERRACA LAALERLDRP DFASLAKIMR
SHDRGDGFTK GSNRDLCLHH GGLMRPSQTT NSMLVRLAPG EAPAVAMTGT KTPCVSLFRP
VAFDGGSSLF SDTLWEQGSK RHDALARDPS ARQQVRNRIA AAEAHILPAI EAGRPDVAEA
LVTAWDDHGL DAGRAGTEPS G