Gene Mmar10_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0463 
Symbol 
ID4284152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp539292 
End bp540287 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID638139926 
Productpeptidase S58, DmpA 
Protein accessionYP_755694 
Protein GI114569014 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGC CAGCACCGCG AAACAGCCTG ACCGATATCG CCGGTCTGCG CGTCGGACAG 
GTTCATGATG CCGCCGTCCG GACGGGCGTG ACGGTCATTT TGCCGGACCA GCGAGCCGTC
TGTGCGGTCG ATGTGCGCGG TGGCGGGCCG GGCACGCGCG AGACCGATGC GCTGGCCAGC
CACACCCTGG TCGATGCCGT CGACGCGATC GTCCTGTCCG GGGGCTCGTC CTACGGGCTG
GCGGCCGCAG ACGGGGTTGC GGCGGCGCTG GGCGCCCGTG GTGACGGTTT TGCCCTGTTC
GACATGCCGG GCGTGCCCAA ATCCCCGGTC GTGCCCTCGG CCATCCTCTA TGATCTCGCC
AATGGCGGCG ACAAGGCCTG GGGCGAGGAG CCGCCCTATC GGGCACTTGG CAAGGCGGCG
CTGGCGGCCG TCTCGGACAC GGTCGAGCTT GGGGCTTTCG GCGCCGGACA CGGGGCCCGC
GCCGGCCTGC ATGCCGGCGG CACAGGCACG GCCAGCCTCG ACCTTTCCGG CGAAGGCGGG
CGGGTCGCGG CGCTGGTCTG CGTCAACAGT TTCGGCTCGG TGACGCTGCC CGGCGCTGAC
GATGTCTACT GGGCCTGGCC ATACGAGATA GATGGCGAGT TCGGCTGTGG TCGTCCGCCA
GCCGACTGGC GTCCGGCCCC GGAAGACTGG GGCGCGGCCA AGATGCAGCC CGGACCGCGT
GAAAACACCA CCATCGCCGT CGTCGCCACC GACATCGCGC TGACGCCGGC CCAGGCCAAG
CGCCTCGCCA TCATGGCCCA GGACGGGCTG GCCCGCGCCA TCCGCCCGGT CCACACACCC
TTTGACGGGG ATGTGGTCTT TGCCCTCTCA ACCGCCGCCC GGCCGCTGGG CAAGGACGGC
GAAGTACAAC TCGCCCGGCT CGGTTCGGCC GCCGCCGACT GCCTCGCCCG CGCCGTCGCC
CGCGGCGTCC ACGCGGCCCG TCTGGCAACC GGTTGA
 
Protein sequence
MSMPAPRNSL TDIAGLRVGQ VHDAAVRTGV TVILPDQRAV CAVDVRGGGP GTRETDALAS 
HTLVDAVDAI VLSGGSSYGL AAADGVAAAL GARGDGFALF DMPGVPKSPV VPSAILYDLA
NGGDKAWGEE PPYRALGKAA LAAVSDTVEL GAFGAGHGAR AGLHAGGTGT ASLDLSGEGG
RVAALVCVNS FGSVTLPGAD DVYWAWPYEI DGEFGCGRPP ADWRPAPEDW GAAKMQPGPR
ENTTIAVVAT DIALTPAQAK RLAIMAQDGL ARAIRPVHTP FDGDVVFALS TAARPLGKDG
EVQLARLGSA AADCLARAVA RGVHAARLAT G