Gene Mmar10_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0143 
Symbol 
ID4285508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp144984 
End bp147188 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content64% 
IMG OID638139608 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_755377 
Protein GI114568697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.62915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAC TTATGATGAT CACTGCGAGC ACGGCCGCGC TTTTTGCCGC CGCCTGCACG 
CCTGCCCAAG ACGCCGCCGA GACGGATGAA ACCAGCGCGA TGACCGACAC CACCGACACC
GCAGCCAACG CGGCTGACAC TGCGCCGGCC ACATTCGACA ATCCGCTGCT CGCCGAGTGG
ACCGGCCCCT ATGGCGGAAC GCCGGACTTC GACGCGGCCA GCCTCGATGA TCTGAGCGCC
GCGCTGCGTG AGGGCATGAG GCTGAACCTG GCCGAGATCG ACGCGATCAC CGCCAATCCC
GAGCCGGCCA CGTTCGCGAA CACCATTCTC GAGCTGGAAC GCGCCGGTGC ACCGCTGGGG
CGTGTCTTCA GCCATTGGGG CATCTGGTCG TCCAACATGT CGTCGCCGGA GTTCCGCGAC
ATCCAGCGCG AAATGTCCGG CGAGCTGTCG GCCTTCTCGT CGCAGATAAC CCAGAATGAA
GAGCTCTTCG GCCGCGTTCG TGCGGTCTAT GAAGGCGAGG AAATGGCGAC GTTGACGCCG
GCCCAACAGC GCGTCGTGCA GCTGACCTAT GATGGCTTTG CCCGCAATGG CGCGATGCTG
GAAGGCGAAG CAGCCGAGCG CTATGCCGCG ATCAATGCCC GGCTCGCCGA GCTGCACCGC
ACCTTCGGCA ACAACGTCCT CAATGACGAG GAAAATTACG TCACCTATCT CAGCGAAGAC
CAGCTGGCCG GCCTGCCGGC GGACTTCGTT GCGGCCTCGG CGGCGGCTGC CGCGACACGG
GATCATGAGG GCGAGTACGC CATCATCAAT ACCCGCTCCT CGATGGACCC CTTCCTGACC
TTCTCCGACG AGCGCGATCT GCGCGAGCAG GTCTGGAACA ATTATTATTC GCGCGGCGAT
AATGGCGGCG AGTACGACAA CAATGCCGTC ATCAGCGAGA TCCTGCAGCT GCGCCACGAG
CGCGTCGGTC TGCTTGGCTA TGACAATTAC GCTTCCTGGC GTCTGGAAAA CCGCATGGCC
GGGACGCCGG AGCGCGCCCA GGCCCTGATG GAAGGCGTAT GGACCGCTGC CGTCGCCCGC
ATCGAGGAAG AGGTCGCCGA CATGCAGGCC CTGGCCGATG CCAATGGCGA CGACATCACC
ATCCAGCCCT GGGACTACCG CTACTACGCC GAGCAGGTCC GTTCGGCCCG CTATGACCTC
GACAGTGAAG AGGTGAAGCA ATACCTGCAG CTCGATAACC TGCGCGAAGG CATGTTCTAC
GTCGCTGGCG AGCTGTTCGG CTTTGCCTTC CGCGAACTGC CCGAGGGCGA GGTCTCGGTC
TGGCATCCGA CGGTGCGGGT CTGGGAGGTC ACCAATCGCG AGACCGGCGC CAATGTCGGC
CTGTGGTATC TTGATCCCTT CGCCCGCCAG GGCAAGCGTT CGGGCGCCTG GGCGAATTCC
TTCCGTTCGC ACACCACGAT TGATGGCGAG ACCAATGTCC TGGCCACCAA CAACTCGAAC
TTCGTCGAAG GGGCTCCGGG CGAGCCGGTC CTCGTCTCGT TTGATGATGC GACGACCTTC
TTCCACGAAT TCGGTCACGC CCTGCACACG CTGTCGTCCA ATGTCGATTA TCCGACGCTC
AATGGCGGCG TGCGCGACTA TACCGAATTC CAGTCACAGC TGCTCGAGCG CTGGGTCCTG
ACCGACCCGG TGGTCAACAA TTTCCTGACC CATGTCGAGA CCGGCGAGCC GATGCCGCAA
GCCCTGATCG ACCGCATCCG GGCGGCCGCC AATTTCAACC AGGGTTTCTC GACCGGCGAA
TATCTCGCCT CGGCGCTGAT GGACATGGTC TATCACACGA CCGATCCGGC CGAGATGAGC
GATCCGGACA CGTTCGAGCG TGAAACGCTG GAGCGTCTGG GCATGCCGAG CGAGATCGTC
ATGCGTCACC GCTCGCCGCA TTTCGGGCAT ATCTTCTCGG GTGAGGGCTA TTCGGCCGGC
TATTACGGCT ACATGTGGGC CGATGTGCTG ACCGCCGACG CCGCCGAGGC CTTCCAGGAC
GCGCCGGGTG GTTTCTACGA TCCGGAGGTT TCGGCCCGTC TGGTCGAGTA TCTCTTCGCC
CCGCGCAACT CGATGGACCC GGCCGAGGCC TACCGCCTTT TCCGTGGCCG CGACGCCGAG
GTGTCGGCCC TGATGCGTGA TCGCGGTTTC CCGGTCACCG AATAG
 
Protein sequence
MRKLMMITAS TAALFAAACT PAQDAAETDE TSAMTDTTDT AANAADTAPA TFDNPLLAEW 
TGPYGGTPDF DAASLDDLSA ALREGMRLNL AEIDAITANP EPATFANTIL ELERAGAPLG
RVFSHWGIWS SNMSSPEFRD IQREMSGELS AFSSQITQNE ELFGRVRAVY EGEEMATLTP
AQQRVVQLTY DGFARNGAML EGEAAERYAA INARLAELHR TFGNNVLNDE ENYVTYLSED
QLAGLPADFV AASAAAAATR DHEGEYAIIN TRSSMDPFLT FSDERDLREQ VWNNYYSRGD
NGGEYDNNAV ISEILQLRHE RVGLLGYDNY ASWRLENRMA GTPERAQALM EGVWTAAVAR
IEEEVADMQA LADANGDDIT IQPWDYRYYA EQVRSARYDL DSEEVKQYLQ LDNLREGMFY
VAGELFGFAF RELPEGEVSV WHPTVRVWEV TNRETGANVG LWYLDPFARQ GKRSGAWANS
FRSHTTIDGE TNVLATNNSN FVEGAPGEPV LVSFDDATTF FHEFGHALHT LSSNVDYPTL
NGGVRDYTEF QSQLLERWVL TDPVVNNFLT HVETGEPMPQ ALIDRIRAAA NFNQGFSTGE
YLASALMDMV YHTTDPAEMS DPDTFERETL ERLGMPSEIV MRHRSPHFGH IFSGEGYSAG
YYGYMWADVL TADAAEAFQD APGGFYDPEV SARLVEYLFA PRNSMDPAEA YRLFRGRDAE
VSALMRDRGF PVTE