Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0253 |
Symbol | |
ID | 4284117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 288889 |
End bp | 291924 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638139716 |
Product | TonB-dependent receptor |
Protein accession | YP_755484 |
Protein GI | 114568804 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0016718 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGAAT TACGAGGGGA GCGTCGTCTG TCGCGCTCCA GGTTGACCCG TGCTCTGATG TTGAGCTGTG CGGTCACAGC ATTTTCGGCG CCGGCTTTCG CGCAGAATGA TGAAGTCGAA GACGTCGTCG TCATCCAGGG GATTCGGCAG ACCATCCAGG ACTCGATCGA GACCAAGCGC AATTCCGACA CGGTCGTTGA AGCACTGTCT TCCGACGACA TCGGCGACCT GCCGGCCCTG TCGATTGGTG AGGCGCTGGA AACCTTGACC AGTGCGGCCT CCCATATCGA ACAGGGTGGC GCGACCGAGA TTTCCATCCG AGGCCTCGGC CCTTATCTTG GCTCGACTGT CATTAATGGC CGGGAAGCGG CCAATGGCAG TGGCGACCGT TCAGTCAATT TCAGCCAGTT TCCATCGGAA TTGTTCAACT CCCTTGCCGT TTACAAGACG CAGGAAGCGA GCTTCATCGA AGGTGGCGTG TCGGGTCAGA TTTCGCTCTC GACCCTTCGT CCCATTGATT ACGGCCGTAG CCGGATCCAG CTGGAGCTGA AGGGGAATTA CAATCCGGAC AATTCGGATC TGAACATCAG TGAGCGGGGC ATCGGTCACC GGGCGACGCT CAGCTTCGTC GACAGTTGGG AGACCGGCGC TGGCGAGTTC GGCCTGTCCT TCGGTTATCA GAACAACCGC TCGACCAATC CCGAACAGGA AGCGCGCACG ACCAGTTCAT TCCGTGATTG CCGCAATGTT GTCTCCGGAG ATCCCAACAG CGATAATTTC GGCGTAGACA GCCTCGGTGA CCCGGATCAG AACTGCGATA GCGGCGGTGG CGATCTTGTG CTTGAGGTCG ATCCCGCAAC CGGTGTTGCG CCGGACGCGA ACACCCCCTT CGTCTTTGTT CCGAGCCAGC GCCATTTCCG CCAGAACATT ACGGATGATG CGCGTGACTC GATTTTCTTT GCGGCCCAGT GGCAGCCGAA CGCGCGCTGG GACATCAATG TCGATTACCA GCAGTCCGAT CGCGAATTCA CTGAACTGCG CAGTGACCTG ACGATCGACG GCAACTCGGT CCTGAACGTC GGTGAAAGCG GCGAAATCGT CCCGTTGTCC GTGAGCCCGA CCGGCGCTTT TCTGGGTGGC ACGACCTATG ACGGCGCAGA AGTCAGCTCG CACTATATGG AGCGCATCGA GGAATACACC GGCTATGGAT TGCAAGCCGA ATACCAGCTG ACCGACGATC TGACGATTTC GGCCGACTAT TCGTATTCGG AAACGATGCG GCGCGAGAAC ATCATCCAGT CACGCTTGCG CAGTGACACG GACGGTGACT CCGGTTCAGA GGATGTTTTC GTCGGAATTA TCGTCGAGGA CGATGCGCAA CGGTTTGTCT TCGGGGACTT TGATGTCACC GATCCGGCAA ACTTTGACGT CGGCCCCCGT ACCCGGGAAG ACCTCAACCA GTTCCGCAAC AACAGCATCG AAGCCTTCCG GGCCGACTTC GACTATCTGT GGGACAACGG GTTCATCACC AATGTTCGCG GTGGTGTCCG GCATTCAACG CTGGAATATG ATTCCGTGCC TCGCGTGCGC CGCGAGAGCG ATGGCAGCCC GCTCTCTGTG GGTGATGGTG CGGCGGCAAG CGCGGCCTGC ATGAACACTG TCTTCCCGGA AGATGACTTC CTTGCGGACG TCGTGGATGG CCCGCTGATC ACGAATGTCG ACAGTGCCGG GAATGTCATC GCCAACGGCA CCGGTAACTC CTATGTGACT TTCGACCCGC TGTGCCTTGC CGAGGCCATC CTGGGTCGTG CACCGAGTAT CCCGGATGCG AGCGATGTCT TTTTGACCTC GGCTGAAATG GGTACCGGTC AAAACCCGCT CCAGATTGTC GATGTTGCGG AAGAAACGAT CGCTGCCTAT CTGCAGGCCG ACTTTACCGG AGAAATGGGT GAGCTGCCCG TGCGCGGTAA TTTTGGTGTT CGTGTGGTGG ATACGGAAGT CACGTCGAAC GGGTATCGCG GTTCGCTGAC GATTGACCGC GACGGGGCCA ATGTCATCAC CGGCATCGGC GTCGACAACA GCAATCTCGT CGAGATCACG GCCAATCACA GCTATACCGA GGTTCTGCCG AGCGCGAACC TGGTTGTGGA GCTGCGCGAT GACGTGCTGT TGCGCGGTGG CATCTACCGT GCGCTGTCGC GTCCGGACCC GTCCGATCTG GGTGTGGGGC GCAGCTTCTC GAGCAGCATC GACAATGACG GCGGGTCGAC CGACGTGGCC GATGTCATCG CCCAGGTCAC CGGCTTCGGA AACCCGGAGC TCGACCCTCT GATGTCGTGG AATTATGACG CGGCGCTCGA ATGGTATCCC AATGAGGACA CCATTCTGGC CTTCGGGGTC TACTACAAGA GCTTCAATGG TGGTTTCACA AACGTCGGGC AGGTGGAGAC TTTCACCGTC GACGGCCAGG ACCTCCAGGC TGTTGTCACG ACCCAGTCGG TAGACAGCGA GGAGAGCACG ATTTCAGGCT TTGAAATCTC GGCAGCGCAC GCCTTCAGCT ACCTGCCGGG TGCCTGGAGC GGTCTCGGCT TCCGGGTCGG ATACAACTAC GCCGACTCTG ATTTCGAGTT CGAAGATGCC GTCTTTGGTG CGTCGACGAT CATCGCCGCC GATGGCACCG AGATCGAACG GGTCGGAATC GTGGCGCCAG CCAATGTCCC CGGCCTGTCG GAGCATGTCG CATCGGCGCA GCTGTACTAC AGCATCGGGG ATCTCGATCT GCAGGGTGTC TACAAATATC GTAGTGGCTA CTTCCAGCAG TTCATCTCGA CGCCGGGCAA TCTGCGCTAC ATCGACGAAC GCGGCATCTA CGAAGCCCGC GCCTCGTATC AGGTCAATGA TGCGGTGCGT GTCAGCGTTG AGGCGATCAA TATCTTTGAC GAGCCGCGGG TCCAGTACAA TCCGACCCTC GACAACTTCG CTGAAGTCAA TGTCTACGGA CCCCGTATCT ACTTCGGAGT GCGGGGGCGG TTCTAA
|
Protein sequence | MFELRGERRL SRSRLTRALM LSCAVTAFSA PAFAQNDEVE DVVVIQGIRQ TIQDSIETKR NSDTVVEALS SDDIGDLPAL SIGEALETLT SAASHIEQGG ATEISIRGLG PYLGSTVING REAANGSGDR SVNFSQFPSE LFNSLAVYKT QEASFIEGGV SGQISLSTLR PIDYGRSRIQ LELKGNYNPD NSDLNISERG IGHRATLSFV DSWETGAGEF GLSFGYQNNR STNPEQEART TSSFRDCRNV VSGDPNSDNF GVDSLGDPDQ NCDSGGGDLV LEVDPATGVA PDANTPFVFV PSQRHFRQNI TDDARDSIFF AAQWQPNARW DINVDYQQSD REFTELRSDL TIDGNSVLNV GESGEIVPLS VSPTGAFLGG TTYDGAEVSS HYMERIEEYT GYGLQAEYQL TDDLTISADY SYSETMRREN IIQSRLRSDT DGDSGSEDVF VGIIVEDDAQ RFVFGDFDVT DPANFDVGPR TREDLNQFRN NSIEAFRADF DYLWDNGFIT NVRGGVRHST LEYDSVPRVR RESDGSPLSV GDGAAASAAC MNTVFPEDDF LADVVDGPLI TNVDSAGNVI ANGTGNSYVT FDPLCLAEAI LGRAPSIPDA SDVFLTSAEM GTGQNPLQIV DVAEETIAAY LQADFTGEMG ELPVRGNFGV RVVDTEVTSN GYRGSLTIDR DGANVITGIG VDNSNLVEIT ANHSYTEVLP SANLVVELRD DVLLRGGIYR ALSRPDPSDL GVGRSFSSSI DNDGGSTDVA DVIAQVTGFG NPELDPLMSW NYDAALEWYP NEDTILAFGV YYKSFNGGFT NVGQVETFTV DGQDLQAVVT TQSVDSEEST ISGFEISAAH AFSYLPGAWS GLGFRVGYNY ADSDFEFEDA VFGASTIIAA DGTEIERVGI VAPANVPGLS EHVASAQLYY SIGDLDLQGV YKYRSGYFQQ FISTPGNLRY IDERGIYEAR ASYQVNDAVR VSVEAINIFD EPRVQYNPTL DNFAEVNVYG PRIYFGVRGR F
|
| |