Gene Mmar10_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1159 
Symbol 
ID4285723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1267616 
End bp1272481 
Gene Length4866 bp 
Protein Length1621 aa 
Translation table11 
GC content67% 
IMG OID638140639 
Productalpha-2-macroglobulin domain-containing protein 
Protein accessionYP_756390 
Protein GI114569710 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.173941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGA CTCGGTATGC AAACTGGCTT GTGGGGGCAT TGGCCCTGAC TGTTGCGGCC 
TGTTCCGGTG GCTCCCAGGA GACCGAGACC GGCCCGCAGC TGGACGCCCG TGAAGCCGGC
GCCCCCCGCG CCGAAGAAAG CACGCCCAGC CAGTTCGCCT TTCTGCGCTA TTCCATCGAT
GTCGATGACG ACGCGCCCCG CCTGTGCCTT GGCTTTACCC ACCCGCTCGA TCCGCAGGCT
GATTACGCGT CCTATGTCGC TGTCACTCCT GAGCGCCCCA TCGCCCTGGA TGCCAGCGGG
CAGAGCCTGT GTGTCGGTGG ACTGGGCTTT GGCGAGGGTC AGTCGCTGAC GCTGCGTGCC
GGTCTCCCGG CCGCCAATGG CGATGGCCTG CTGGTCGATG AGACCGTCGA GGTCGATTTC
GGTGACCGGC CCGCCTATGT CGGTATTGCC GGCGACGGGG TCATCCTGCC GCGCCTCGAT
GCCGACGGCC TGGCGATCGA GACCGTCAAT GTCGACAGCG TCGCGGTCAC CCTTCGCCGC
ATCAATGACC GGGCCCTGGC TTTCCGGTCC ATCACCTCCG GCGCCAATAT TGCCTCGGGG
GACTATTACT GGTCCGGCGA AGAGGAAAAC CCCGACGCGG TCGGCGAGAT CATCTGGCGC
GGTGAGATGG ACACGGCCGG CCTGCTGAAC ACGCCGGTGA CGACCGTCTT CCCGGTCGCC
GAGGCCATCG GCACACTGAC ACCGGGTGCC TATCACATCA CCGTGGTCGA TGCCGCCGAG
GCTGATGATG ACTACCGCAC GCCGGCTCGC GCCAATCGCT GGCTGGTGAT CACCGATCTG
GCCCTGACGG CCTATCGCGG CAATGACGGC GTCGATTTCG TTGTCCGTTC GCTGCAGACC
GCCCAGCCTG TCTCCGGTGT GCGGGTCGAA CTGATCGCCC GCTCCAATGA AGTGCTGGCC
GGGGTGACGA CCGATGCCAG TGGCCGGGCG CGCTTTGAGG GCCCGCTGAC CCGCGGCGAG
GGCGCCATGG CGCCGCGCTT GCTGACAGCC TACGGCCCTG ACAATGACTT CGCCGTGCTC
GACTTTCAGC GCAATCCGGT TGACCTGTCC GGACTGGACA CAACGGGGCG CCAGCGCCCG
GACGGGGCCG ATGGTCTCGT CTATCTCGAC CGTGGCATCT ATCGTCCCGG CGAAACCGTG
CATGTATCAG CCCTGCTGCG CGATGCCGAG GCATTCGCCG TCACCGACCG TCCTGTCGAT
CTGACTGTCT ATGGTCCCAA CGGTATCGAG GCGGCCAATG TGCGTTTCCC CGCTGCTCCT
GATGCGGGCG GTGTCAGCTG GGCCTGGGAT GTGCCGCGCA CAGCGGCACG CGGCGAGTGG
CGAATTGTCG CCGAGATGGA TGGCTATGGC CGCGTCGGCC AGGTCCGCTT CTCGGTCGAG
GACTTCGTGC CGCAACGCGT CGGTCTGACC CTGTCCGGCG ATGACGCGAT GGCCATCGGT
GCCGGCGAAG TCCGCGATAT TGAAGCCAAT GTGCGCTTCC TCTATGGCGC ACCTGGCGCC
GGTCTGGTGG TCGAGGGACG GGTCCGTGTC GAGGTCGATC CGGCTCCCTT CGCCGATTTT
GCCGATTTCC GTTTCGGGCG CGGCGATGAA CCCTTCCGTG AGTTCACCAG TGATCTCGCC
GATACCGTCG CCGATGGCGC CGGCCGCGCG GTCCAGTCGA TCGATCTGGG CGAGGCCGGC
CGCGATGCCA CCCAGCCCTT GCGTGTGCGC GCTGTGATCT CGGCCATTGA GCCCGGTGGC
AGGCCGGTCG CCGACGATCT GCGCATTCCC TACCGCCCGG CCGATCTCTA TCTCGGCTTG
CGCCCGCAAT TTGACGGGCG CGCCCAGCGC AATCGTGAAA CCGCGATCGA TGTCGTGGCT
CTGGACCCCG CAGGTGACCT CGTCGCCACG CCGCTGGAAT GGCAATTGGT GCGCGTGGAC
TGGGAATATG ACTGGTATCG CGTGGGCAGT GGTCGCTGGC AGTGGCGCCG CACCCGCAAT
ATCGTTCTGG TCGAACAGGG TGTTGCGTCG AGTGTTGGCG ATGGTCCGAC ATCCATCGAT
ATCCGGGCTC TCGACTGGGG CAGCTACCGG CTTGTCGTCA CCGGGGCGAG CAGTGGCCAG
AGCGCCTCGA CCGATTTCTG GGTCGGCTGG GGTGCGACGG CAGAAGCGGG CAGCGAAGCG
CCTGACCGTG TCGCCCTGTC GACACCGGAC ACGCCAACGC CGGTTGGTGG CGAAATGACG
CTCAGCCTGC TGCCGCCCTA TGCCGGCGAG GCCGAAATCG TGATCGCCAG CGATCATGTG
ATCGAGACGC GCTCGGTCAC CATTCCGGAA AGCGGCGCCG AATTGTCCTT CCGTGTCACC
GAGGAATGGG GCGCCGGCGC CTATGCCATG GTGTCCCTGT TCACGCCGCG CCATCCGGTC
GACCAGCCAG CCCCGCGACG GGCGGTGGGT GTGGCCTACC TGCCGGTCGA TATGGGGCAG
AGGACATTTG AGCTGACGAT GGACGCGCCT GAACGTGTCC ATCCGCGCCA GACGCTGGAA
TTGGGCGTGA CCCTGGACGG GCCGGTGCGT GAGGGCGCCT GGCTGACCGT CGCGGCTGTG
GATGAGGGCA TTCTGGCCCT GACCCGTTTT GCCTCGCCCG ACCCGGTCAG CTGGTTCTTC
GGTCAGTCCT CGCTCGATGT GGATCTTTAT GATGATTACG GTCGCCTGCT CGATCCCAAC
CAGGGCGCCG CTGCAGCTGT CCGTTCCGGT GGTGACCAGA TCGGCGGCGC CGGCCTGACC
GTGGTGCCGA CCCGCACGGT GGCCCTGTTC AGTGGTCCGG TGTCGGTCGG TCGCAATGGC
CGCGCCACCG TCGCGCTGGA GATTCCCGAC TTCAATGGCG AGCTGCGCCT GATGGCCGTG
GCCTGGAGTG AAAGCGGGGT GGGCGGTCTC TCCCAACCGC TGACGGTGCG CGATGATGTG
CCCGCCGAAC TCATCCTGCC CCGCTTCCTG GCGCCTGGTG ACGTCTCGAC GGCGACGCTG
ACCATCGACA ATGTCGACGG TGCTCCCGGC GACTATCTGA CCACCATGAC AGCCGATGGT
GCCGTGTCCG GTGCCGCATC TGACACCATC CCGCTCGACC AGGGCCAGCG CGCCGACCGT
CGCTATGCGC TGGAAGGCGC CGATGCCGGC CTGGGCTCTG TGGCGCTTGA TGTTGGCGGA
CCGGCTGATT TCGCCGTGTC TCGCAGCTAT CCGATCGAGG TCCGCTCGGC CTGGCTGCCA
TCTTCCACCG TCACCCGTGG TCGGCTGCTG CCGGGTGAAA GCTGGTCGCT GGGCTCGGAC
GCACTTGCGG CCTATCTGCC CGGCGGGTCG GATGTCACGC TGAGCTTCTC GCCGACACCG
CTGGACGAGG ATGCGCTGCT GCGCTCCCTG TCGCGCTATC CCTATGGCTG CACCGAACAG
ATCACCAGTC GCGCCATGCC GCTTCTGATG GCCGACCCGC TGGCCCAGGC CGCCGGCATT
GACGGTGTCG ACGACACGCG GGTGATTATC CAGGATGCCA TCTCGACCCT GCTCAACCGA
CAGGCCAATG ATGGCACGAT CGGCCTGTGG CGCATCGGCG ACCGCGGCAG CCGTCCGTGG
ATCGGGGCCT ATGCCGTCGA TTTCCTCGAA CGGGCCAAGG CGGCCGGCTA CACCGTGCCG
CAGGCAGCGC TGGACCGCGC CTATTCGGGT TTGGAGCATG TTGCCGCGCA GGAGAGCTGG
CGGGTCAGCG GCTACACCAC CACCATCTAT TCCTGGCGCG GCCAGACCGA CACCGCCGAG
CGCCTGTCTG ACCGCAGTGC GGCCTATGCG CTTTACGTGC TGGCCCGGGC CGGCCGGGTC
GACCGCTCGC GCCTGCGCTA CATGCATGAT GAACGTCTCG GCGAGATCGA CAGCCCGCTC
GCCCTGGCGC AATTGGGCGC GGCGCTTCAT CTGATCGGCG ACCGGGCCCG GTCGCTCAGT
GCCTTCGACG CCGCCGAGGC CCTGATCGGT TATGAGAATC CGGGCGACTG GTATCAGTCG
GCCCGTCGTG ACCTGGCCGG CGTGGTCGCC TATGCCGCCG AGGCCGGCGA TGCCGAACGC
GTCGCCCGCC TCGCCGAACA GGTCGTGACC GATCTGCCCG AACCGGCCCG TCTGACCACG
CAGGAAAAGG CTTTCCTCCT GATGGCGGCG CAAGGCTTGT CGGGCGGCGC GGATAGCGTT
GTGATTGAGG CGCCGTTCGC CCCGACGAAC GGTGAGCGTC CCGTCTTCAC CGTGCAGCCG
GACATGCTGG ATAGCGAGCT GACCTTCACC GCAGCAGGCG AGGGACCGGT CTGGGTGACC
CAGCTGGCCC ATGGCGAGAC CCGCCTGGCA CCGGACGCCG CAGCCGAAGG CCTGTCCGTG
CAGAAACGTG TGCTCGGCCT TGATGGCCGG GCAGTTGATC TGGAAGCGCT GGTGCAGGGC
GACCGGCTTG TCATCGACAT TACCGTGTCT CCGCACGAAC AACGCCTGAT CCCGGCCATT
CTGACCGACC TTCTGCCGCC GGGCTTCGAG ATCGAGGCCG AGATCAGTTC GGCCGAGGGC
GCGCCGCGCG GAGCCTATGC CTGGCTCGGC CAGATCGTCT CGCCATCGAT GAGTGAAACC
CGGGATGACC GGTATGCCGC CGCCCTGGAC CTGACCCAGC GCCGACCGCG CCGGCTGGCC
TATATCGTCC GTGCCGTGAC GCCCGGTGAA TACACCCTGC CGGGCGCCGT GGTCGAGGAC
ATGTATCGCA GTGATGTCTA TGCCCGATCG CAAACCCGCC GGGTGGTGAT CGCGCCGCGC
GACTGA
 
Protein sequence
MMATRYANWL VGALALTVAA CSGGSQETET GPQLDAREAG APRAEESTPS QFAFLRYSID 
VDDDAPRLCL GFTHPLDPQA DYASYVAVTP ERPIALDASG QSLCVGGLGF GEGQSLTLRA
GLPAANGDGL LVDETVEVDF GDRPAYVGIA GDGVILPRLD ADGLAIETVN VDSVAVTLRR
INDRALAFRS ITSGANIASG DYYWSGEEEN PDAVGEIIWR GEMDTAGLLN TPVTTVFPVA
EAIGTLTPGA YHITVVDAAE ADDDYRTPAR ANRWLVITDL ALTAYRGNDG VDFVVRSLQT
AQPVSGVRVE LIARSNEVLA GVTTDASGRA RFEGPLTRGE GAMAPRLLTA YGPDNDFAVL
DFQRNPVDLS GLDTTGRQRP DGADGLVYLD RGIYRPGETV HVSALLRDAE AFAVTDRPVD
LTVYGPNGIE AANVRFPAAP DAGGVSWAWD VPRTAARGEW RIVAEMDGYG RVGQVRFSVE
DFVPQRVGLT LSGDDAMAIG AGEVRDIEAN VRFLYGAPGA GLVVEGRVRV EVDPAPFADF
ADFRFGRGDE PFREFTSDLA DTVADGAGRA VQSIDLGEAG RDATQPLRVR AVISAIEPGG
RPVADDLRIP YRPADLYLGL RPQFDGRAQR NRETAIDVVA LDPAGDLVAT PLEWQLVRVD
WEYDWYRVGS GRWQWRRTRN IVLVEQGVAS SVGDGPTSID IRALDWGSYR LVVTGASSGQ
SASTDFWVGW GATAEAGSEA PDRVALSTPD TPTPVGGEMT LSLLPPYAGE AEIVIASDHV
IETRSVTIPE SGAELSFRVT EEWGAGAYAM VSLFTPRHPV DQPAPRRAVG VAYLPVDMGQ
RTFELTMDAP ERVHPRQTLE LGVTLDGPVR EGAWLTVAAV DEGILALTRF ASPDPVSWFF
GQSSLDVDLY DDYGRLLDPN QGAAAAVRSG GDQIGGAGLT VVPTRTVALF SGPVSVGRNG
RATVALEIPD FNGELRLMAV AWSESGVGGL SQPLTVRDDV PAELILPRFL APGDVSTATL
TIDNVDGAPG DYLTTMTADG AVSGAASDTI PLDQGQRADR RYALEGADAG LGSVALDVGG
PADFAVSRSY PIEVRSAWLP SSTVTRGRLL PGESWSLGSD ALAAYLPGGS DVTLSFSPTP
LDEDALLRSL SRYPYGCTEQ ITSRAMPLLM ADPLAQAAGI DGVDDTRVII QDAISTLLNR
QANDGTIGLW RIGDRGSRPW IGAYAVDFLE RAKAAGYTVP QAALDRAYSG LEHVAAQESW
RVSGYTTTIY SWRGQTDTAE RLSDRSAAYA LYVLARAGRV DRSRLRYMHD ERLGEIDSPL
ALAQLGAALH LIGDRARSLS AFDAAEALIG YENPGDWYQS ARRDLAGVVA YAAEAGDAER
VARLAEQVVT DLPEPARLTT QEKAFLLMAA QGLSGGADSV VIEAPFAPTN GERPVFTVQP
DMLDSELTFT AAGEGPVWVT QLAHGETRLA PDAAAEGLSV QKRVLGLDGR AVDLEALVQG
DRLVIDITVS PHEQRLIPAI LTDLLPPGFE IEAEISSAEG APRGAYAWLG QIVSPSMSET
RDDRYAAALD LTQRRPRRLA YIVRAVTPGE YTLPGAVVED MYRSDVYARS QTRRVVIAPR
D