Gene Mmar10_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0587 
Symbol 
ID4286875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp678644 
End bp679978 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content66% 
IMG OID638140052 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_755818 
Protein GI114569138 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3591] V8-like Glu-specific endopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00290812 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCATC ACATCCGGAT CGCCGTACTG GCACTCTTGG CATCAGGTCT GGGCCTGTCG 
GCTCCACTGG CAGCCCAGGA CGGGCCCGTC CTTGCCGGCG CCCCACCTGT CGAAGCTTCA
GCCGAGCCCG CAAGCGAACG CCTGACCAGC CGCGAGATCG GCAACTGGCT GCGCCAGGTC
GACGTCAAGC CTCCCAAGCC CGGCGGCGGC GCCGCGCCGT CCACGCGCTA TACCGGCAAT
GACACCTGCC AGTGGGCCAA TGATGGCGAG TGCGACGACC CCGGCATCGG CACCGGCGCC
TGCCAGGTCG GCACGGACTA TTCCGATTGC TGGCGCATCG TGGAAGGTGT CGAGGACAAT
ACCTGCCGCT GGGCCAATGA TGGCGAGTGC GACGAGCCGG GCTTCGGCAC CGGTGCCTGT
ACCCAGGGCA CGGACCTGGC TGATTGCGGT GCCATCATCG AGCTGCGCTT TCGCAATGAC
AGCTGCGAGA CCGCCTTTGA CGGTGTGTGC AGCGAGCCGG GCATCGGCGA TGGCCGTTGC
GCCGAGCGGT CGGACCGTGC CGACTGCATC GGCCGCGAGC GCCCGCTGAC CATCAATGAC
CATTATTTCG GCCATGATGA CCGCGTCTTC CACGACACCT CGGTCTTCCC CTGGAACGTC
GTCGGACAGG TCGATTTCGA CAGTGGCGGC GCCTGCACGG CGACCCTGAT CGGTCCCGAC
ATACTGATCA CCGCCTCGCA CTGCATCAAT GGCGATGGCC GCACCGACTC GCGCGGCGTG
TTCCAGACCG CCTACGACCG CCCGGGCGGT CCGCTCTCGG CGCGGGTGAT CGACCATTTC
ATCGACCCGG ACTGGGACGA CCAGCGCTTT TCCTCTGGTG ACGAGATCGA CGGCACCGAC
TGGGCACTGC TGCGCATCGA CCAGCGGCTG GGCGACACGC TCGGTCATGT TGGCGTACGC
GGACTGGTCG ACACGGAAGG TCGTCGCGGC GCCATGCAGG CCGACCTCTA TCAGGGCGGC
TATAGCTGGG ATACCGGCGC CCACCTGTCC GGCAATATCG GCTGTCACAT GGTCGAGATC
GCCAATGACA ACACCATGGC CCATGACTGC GACACCACCC GCGGCGACAG CGGTTCGCCC
TTCATGGTGC GTGAGGGCAA TGAGTATTTC GTCGTCGCCA CCGACAGCAA TTTCCGCTCC
AACCCGCGCG GCCCGATGAT CTACATCGCC GCCCGCTCGG AGCGCTGGAT CCCCTATCTG
GAGGACTTCG CCGCCGGTCG CCTGACAAAC GCCACACAGC GCCCGGCCAG CGGTGGGGTG
AAGCCGCCGA AATAG
 
Protein sequence
MRHHIRIAVL ALLASGLGLS APLAAQDGPV LAGAPPVEAS AEPASERLTS REIGNWLRQV 
DVKPPKPGGG AAPSTRYTGN DTCQWANDGE CDDPGIGTGA CQVGTDYSDC WRIVEGVEDN
TCRWANDGEC DEPGFGTGAC TQGTDLADCG AIIELRFRND SCETAFDGVC SEPGIGDGRC
AERSDRADCI GRERPLTIND HYFGHDDRVF HDTSVFPWNV VGQVDFDSGG ACTATLIGPD
ILITASHCIN GDGRTDSRGV FQTAYDRPGG PLSARVIDHF IDPDWDDQRF SSGDEIDGTD
WALLRIDQRL GDTLGHVGVR GLVDTEGRRG AMQADLYQGG YSWDTGAHLS GNIGCHMVEI
ANDNTMAHDC DTTRGDSGSP FMVREGNEYF VVATDSNFRS NPRGPMIYIA ARSERWIPYL
EDFAAGRLTN ATQRPASGGV KPPK