Gene Mmar10_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1609 
Symbol 
ID4283931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1762193 
End bp1764247 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content65% 
IMG OID638141096 
ProductTonB-dependent receptor 
Protein accessionYP_756839 
Protein GI114570159 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.952193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.492474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATGG GTGAACCAGG ACTGGCTGGG CTTGCGGCGA GCGTCATGCT GGCTGCCGGC 
GCGGCTGCCC AGGGCCCGCT GCCGGGCAAT AATCGGGATG ATGTGTTCAA TCGGGATGAT
GTGATCGTGG TCACCGGCGC GCGCAGCGGC GAGGCGGTCG ACGCGACCAT CCTGCCGGTC
ACCCGGATTG ACGGCGACAC GCTCATCCGG ACGGCGGCCC AGCATCCCGC CGAAATTCTG
GCCCGTGTGC CCGGTGTGGC GCTGAACCGC GGCAATGGCG CCGAACATCT TGCCGCCATC
CGCTCGCCGG TCCTGACCGG CGGGGCAGGG GCCGGATCCT TCCTCTATCT GGAGGACGGC
ATCCCGCTGC GCGCCGCCGG CTTTGCCAAT GTGAATGGCC TGTTCGAGGC CCTGAACGGG
CTGTCTGGTG GGATCGAAAT TGTGCGTGGA CCTGGATCGG CCCTGCAGGG TTCGAACGGG
CTTCACGGTG TCATCAACTA CCTGACCCCG ACCGCCGCGG ACACGGCATC GGGGTTCGAG
GCTGAGCTGG GGTCTTTCGG GCGGCGACGC GGACAGCTGA TCATCTCCCG TCCCGGCGAC
GGAGCGTCGG CTCTGCTTGC CATAGCGGGG CAGCACGAAG ATGGCTGGCG TGACGAAGCC
GGCCTCGATC GGGTTGAAAC CCTGGTGCGG CTCGATGGCG CGATGCCGCG CTTTGACTGG
CGGGCCACGG CGGCTCTGGT GAGCTTGCAG CAGGAAACCG CCGGCTATGT CGGCGGTCTC
GACGCCTATC GCAATCTTGA TCTGGCGCGC AGGAATGGAG ATCCGGAAGC CTTTCGCAAC
GCGCAGGCCC TGCGTCTGAC GCTCAGCCTG AGCGGTCAGG CCGACGCCCT CACGCAATGG
CAGGTCACCC CCTATCTGCG CGCCAACGAG ATGGATTTCC TGATGCATTT CCTGCCCTCC
GAGGCTCTCG AGGAAAGCGG CCATGCCAGC GCGGGCGTGC AAACCTCGCT GACCCGGCAG
ACCGGATGGG GCGAGTGGGT GCTGGGCGCC GATGTCGAAG CCACGACCGG CACGCTGCGC
GAAACCCAGA CCCGCCCGAC CCTCTTCTCC TTCGTCCAGG GCGAACATTA TGACTACACC
GTGGATAGCA CGCTGACGGC GGTCTACGGG CAGGGGTATT GGGACGCGTC CGATGCCTTG
CGCCTGCAAG CCGGATTGCG GCTCGAGCGG GTGCAATATG ACTATGATAA CCGGCTCGCC
GACAACGCCA TTGGGCGCTA TCTGCGCGTC GCCGACCGAA GCGACGGCTA TGATCTCGTC
CTGCCTCATG TCGGTGCCAG CTGGCAGATC AATACCTCGA GCCATGTCGT GACCCGCCTC
GCACGCGGGG CCCGGGCGCC GCAGACCGCC GAGCTCTACC GGCTGCAGCC AGGTCAGGTG
ATTGACGGCA TCGAGCCGGA AACACTGGAC AGTTTCGAGA TTGGATACCG GCGCGGCGGC
GCGCGCTTCG ACTGGTCCGT GACCGCTTTT GCCATGAAAA AGCACAATGT TTTTTTCCGC
GATGCCGACG GCTTCAATGT GACGGGTGGC CAGACCCGGC ATCACGGGCT GGAACTGGCG
CTGGACTGGC TGGCTTCATC GACGCTGACA TTCAACCTGG CCGGCAGCTG GGCCCGTCAC
CTCTATGATT TCGACCGGCC GGTCAGCACT GCCAGCGAGA CCATCCGCGC GGGAGCCGAG
ATCGACACCG CCCCGGAATG GTTGTGGAAT GTGCGTGCCC GATGGCAACC CACCGCGCGC
TTCAGTGGCG AGCTGGAATG GTCGCACATG GGTCGCTACT TCACCGATGC TGGCAATCTG
CACAGCTATG ACGGACACGA CCTGTTTCAC CTGCGCGCCC GCTATCAATG GCGCGAGGGC
GCGGAAATTT TCGGCGCTAT TAGAAACCTG CTCAACACCC GCTACGCCGA GCGCGCCGAC
TACGCTTTCG GCTCAGACCG TTACTTTCCG GGCGAGGAGC GGGGAGTGAG CATTGGCGTT
CGGGTCGGGT TTTGA
 
Protein sequence
MRMGEPGLAG LAASVMLAAG AAAQGPLPGN NRDDVFNRDD VIVVTGARSG EAVDATILPV 
TRIDGDTLIR TAAQHPAEIL ARVPGVALNR GNGAEHLAAI RSPVLTGGAG AGSFLYLEDG
IPLRAAGFAN VNGLFEALNG LSGGIEIVRG PGSALQGSNG LHGVINYLTP TAADTASGFE
AELGSFGRRR GQLIISRPGD GASALLAIAG QHEDGWRDEA GLDRVETLVR LDGAMPRFDW
RATAALVSLQ QETAGYVGGL DAYRNLDLAR RNGDPEAFRN AQALRLTLSL SGQADALTQW
QVTPYLRANE MDFLMHFLPS EALEESGHAS AGVQTSLTRQ TGWGEWVLGA DVEATTGTLR
ETQTRPTLFS FVQGEHYDYT VDSTLTAVYG QGYWDASDAL RLQAGLRLER VQYDYDNRLA
DNAIGRYLRV ADRSDGYDLV LPHVGASWQI NTSSHVVTRL ARGARAPQTA ELYRLQPGQV
IDGIEPETLD SFEIGYRRGG ARFDWSVTAF AMKKHNVFFR DADGFNVTGG QTRHHGLELA
LDWLASSTLT FNLAGSWARH LYDFDRPVST ASETIRAGAE IDTAPEWLWN VRARWQPTAR
FSGELEWSHM GRYFTDAGNL HSYDGHDLFH LRARYQWREG AEIFGAIRNL LNTRYAERAD
YAFGSDRYFP GEERGVSIGV RVGF