Gene Mmar10_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1103 
Symbol 
ID4284277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1203872 
End bp1205287 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content64% 
IMG OID638140581 
Productpeptidase M28 
Protein accessionYP_756334 
Protein GI114569654 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.608477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TTATCGCGCT TGGAATGACA GTCTTTCTGT CTGCGGAAAT ACTGGCCCAG 
GACGGCTTTG TCCTGCCGGC CGGGGACGCC GATACGGCGC AGGCCCTGGT CGAGACGGCG
CTGGACAGTG ATCTGGCCTG GGACATTGTC GAATCGCTGA CCACCAGCGT CGGTCCCCGT
CTTGCCGGCT CCGAGGCCGA GGCCCGGGCC CGTGCCTGGG GTGAGGAACT GGGTCGCGAG
CTCGGTTTTG ACCGGGTCAG TGTCGAGCCG TTCACGATGG AATTCTGGGA GCGCGGCGAG
ATGGAAATCG TCATGACCGC GCCCTATGAG CAGGCCCTGT ATGGCTCGGC TCTTGGTGGT
TCCGGCCGGT CGCCTTTCCT GGGGGCGGTG AATGCGGAGA TCGTCTATTT CCGCAATATC
GATGCACTGA CCGCCATTGA GGATGGGGCG CTGGACGGCA AGATCGCGTT TGTTGACGGT
GATGCCATGG TGCCCAGCCA GACCGGTGCC GGCTATGGCC CGTCAAACCA GCGCCGCCGT
ATCGGCTGGC AGCACGCCGA GCGTGGCGGT GCCGAGGCCC TCGTGGTGCG CTCGGTCGGC
TCGGACAGCC ATCGCATGCC CCATACCGGC ATGATGAGCT CCATGGATGG CGAGTGGGCC
GATATTCCGG TCATCGCGGT CTCCAATCCG GACGCAGACC ATCTGCGCCG GCTGCACAAT
TCGGGTGAAG CGATCGAGAT GCGGATCCGC TCGACCGCCG GTTGGCGGGG CGAGGTGACC
AGCGGCAATG TCGTGCTGGA TCTGATCGGC CGCGAGAACC CTGACGAGAT CGTCCTGATC
GGCGGTCATC TCGACAGTTG GGACCAGGGT ACAGGCGCGG TCGATGACGG TGCCGGTGTC
GCCATCACGA CGGCTGCCGC AGCGCTGATC GCGCAATTGC CGCAGCGCCC GCGCCGGACC
ATTCGTGTCG TGATGTTCGG GGCCGAGGAA GTCGGCCTTC TCGGTGCCCG CGCCTATGCC
GAACAGCATG CCGACGAGAT CGGCAATCAT GTGCTGGCAA CTGAATCCGA TTTCGGTGCC
CGGACCGTCT GGCAGCTTGT TTCGAATGTC TCGGATGAAG GCACGCCGGC GATCGATGCG
GTGGGCGACA TTATCGGCCC GCTCGGGATT GTCCGCGGGG GATCAAACGT GCCCGGTGGT
GGACCGGACA TCATCCCGCT GGCCATGCAG GGCGTGCCGA CGGTGCGTTT GAGTCAGAAT
GGCAGCGATT ATTTCGACCT GCATCACACG CCGGATGACA CGCTCGACAA GATCGATCCG
GATGAGCTGG CCCAGAACGT CGCGGCCTAT GTGGCGCTGG TTTATCTCGC CGCCGAACTG
GATGTGGATT TCCGGTCGGC GACCGAAGAC GAATAG
 
Protein sequence
MKSIIALGMT VFLSAEILAQ DGFVLPAGDA DTAQALVETA LDSDLAWDIV ESLTTSVGPR 
LAGSEAEARA RAWGEELGRE LGFDRVSVEP FTMEFWERGE MEIVMTAPYE QALYGSALGG
SGRSPFLGAV NAEIVYFRNI DALTAIEDGA LDGKIAFVDG DAMVPSQTGA GYGPSNQRRR
IGWQHAERGG AEALVVRSVG SDSHRMPHTG MMSSMDGEWA DIPVIAVSNP DADHLRRLHN
SGEAIEMRIR STAGWRGEVT SGNVVLDLIG RENPDEIVLI GGHLDSWDQG TGAVDDGAGV
AITTAAAALI AQLPQRPRRT IRVVMFGAEE VGLLGARAYA EQHADEIGNH VLATESDFGA
RTVWQLVSNV SDEGTPAIDA VGDIIGPLGI VRGGSNVPGG GPDIIPLAMQ GVPTVRLSQN
GSDYFDLHHT PDDTLDKIDP DELAQNVAAY VALVYLAAEL DVDFRSATED E