Gene Mmar10_2720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2720 
Symbol 
ID4286085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2981653 
End bp2984682 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content59% 
IMG OID638142219 
ProductTonB-dependent receptor 
Protein accessionYP_757944 
Protein GI114571264 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000258905 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.119134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTACA AAGGGATGAG GGAGCGTCCG ATGCGGGGGC TCCGAACTGG ATTGTGGCGG 
GGGGCGAGCC TCGCAGCCAT ATCCATTGCG ATGGTTTCGA CGAGCGTTTA CGCGCAGGAT
GTCGAGGACG AAGACGATAG CAATACCGAG GTGATCACGG TGCGGGGTAT CCGCTCCAGC
CTGCAAAGCG CCCAGGAGAT CCGCCGCAAT TCCGACGTTC TTGTGGACGC GATCACGGCC
GAGGATATCG GCGCTTTGCC GGACCGTTCG GTCGCGGAAG CGTTGCAGCG CGTTCCGGGT
GTGAACATCA CGCGGTTTGA AGGTGAAAAC GACCCAGACC ATTTCTCGGT CGAAGGGTCC
GGCGCGATCA TTCGTGGCCT CAACTTCGCC CGCAGTGAGC TGAACGGTCG GGACGTGTTC
TCTGCCGATA ATGGTCAGCA AATCGGCTTC AACGATGTTT CGCCGGAAAT GCTTGGATCG
GTTGAAGTAT TCAAGAACCA GTCGGCCGAC ATGATCGAAG GCGGGCTCGC CGGTACGGTG
AACCTCAATA CACGCCTCCC GTTCGACCAG GGTGGTCGCA TGATGGCGTT CACCCTCGAG
GCGAACTACT CGGATTTCGC CGAGGAAGTC ACGCCCACGA TGTCGGGCAT CTACAGCAAT
CGCTGGGAGC TGGGGAATGG GTCTGAATTC GGCCTCATCT TTGGCGCCTC CTATTCCGAG
CTGCAAAGCC GCGCTGATGG TACGCTCGTC GCTGAATGGC TCGATCGGGA CCCTAGTGAC
GGCCAAAGCC TCTACGTGCC GTCTGGCGCT GGGATCCGGA CGCAGTTGTT TGACCGGACA
CGAGACTCGA TTTCCGCTGC CGCTCAGTGG CGCAGTGCAG ACGGTGAATT GGAGGCGACG
GCGCAATTCT TCCGGGCTTC CTATGAAAAT GCCTGGTCGG AACGCGCGGT TGAGCCGTCC
ATCGACAGTG GGCCCGCAAT CACGCCGGCT CCGGGTACAA GCTTCACCTA TGACGACAAT
GGTCTGTTCG AGAGCGGCGT GATCTCGGAG AATGTCGGTT GGCGTTCGAA TGATCCGACT
CATCCGCTGA ACGGCGTTCG CCAGCTCGCC CTTGCACGGG GCAAGACGGA TGAAAGTCTG
ACCACCGACA TCGGATTCAA TCTCCGCTGG TCGCCGACTG ACCGCTTCCG CGCCAGTTTT
GACCTGCAGT TCGTTGAGTC CGAAGTGAAC ATCGCGGATG TTACGGTCCA CGGCGCGTTT
TTCTCGGACA TTGATCTGGA TGTGACCGGT GATGTGCCCA GCGTGATCTA CCGCCGTCCT
GCTGACGGCT CGGATCCGTA TTTCTCTGAC GCCAGCAACT ACTACCTTCG CTCGGCTATG
GACCATCTCT CGGACAATGA AGGCGAATCG GTCGCCTTCC GTGCTGACGC CGAATATGAT
TTCGAAGGCG ACAGCTGGCT GAGGTCCGTG CGCTTTGGTG GCCGTGTTTC CAACCGGGAG
CAGGTTGTAC GCAGCTCGGT CTATAACTGG GGCAATATTT CGGCGACATG GAACTCGCCG
TTCGCTCTGG ACAATCCGGC CATCCCGCCG GGAACATTCG AGCTCTATAA CTTCGATAAT
TACATGCGTG GCATGACAAC CGGCCTTGAG GGCGGCATCC CGATGTATAC TGGCGCTCTG
GGCGAGGATT ACGACCAGGC TGTCGCTGCC ATCATGGCGA TGCGTAACGC GGCTGGAAAC
GGTGGCTGGG TGCCGCTGGC CCAGCGTGGT GGTGTCGTGG CAGGGACGCC ATTCCTGCCG
TCCGAGATTA CCGATGTGAC GCAGGATACG ACGGCATTCT ATGGCCGTCT GGACTTCGAC
AATGAACATC GGTCCGACGG CATTCGCATT TCAGGCAATG TCGGCCTTCG TTATGTCGAA
ACCCAGACCA GTGCCGTTGG TGCGCTCTCA TTCCCGGACC GTGCCCTGGT GTTCAACGGT
ATGACGCCGG CGGCCTATTG TGCCGGTATC GTCGGTACTA CGCCCGGTAT CTGCCTCGAG
GGTGCGGCAA CGCAGACGGC CTTCTATGAC TGGGCGGATG GCAGCAACCA GTCCAACACC
GACAGCCACA CCTATGAGAA TTGGCTTCCC AGCGCCAATG TGGTGTTCGG TATCACGGAC
GAATTCCAGA TCCGCCTGGG TCTCTCCCAG GCCATCTTCC GTCCTGATTT CGGCCTGATG
AAAAGCAATC TGGTCATTCA GCAAGGCGGC GATGACCCGG TGACCGGCGC GTGGCTGGGG
CCGGACGCCT CAACCGGCAA CGTTCAACTT GACCCGATCA CGGCCGACCA GTTTGACCTC
GCATTCGAAT GGTATTTCGA TGATGTTGGC TCGCTGACCT TCTCCTTGTT CCAGAAAAAC
CTGAGCGACT ACATCGTTCC GGCCATCCTG CAACGGGATG TCACAAACAA TGGCGAGACA
TTTGCCGTTG ATGTCGACGG TGTCGGCAAT GCGTCGGACA GTGGCGAGAT CAAGGGCTTC
GAAATTGCCT ATCAGCAAAC CTTCGATGAC CTGCCCGGCA TATGGTCAGG ATTGGGCGTC
CAGGCGAACT ACACCTATAT CGACGCTCAG GGTGTGCCGA ACATCGGTCC CAAGAATGAC
GAGCCGAACG GGCGTGGCTC CGCGCCTAAC TTCGACGTGT CACAGTTGCG CCTGCCGCTC
ATGTCCGAGC ACACGGTCAA CCTGGTCGGC TTCTACGAAA CCGACAGCTG GAGCGCCCGT
CTCGCCTATA ATTGGCGTTC GGAATACACG CTGACTGTTC GTGACGTGAT CTATCCGTTC
ACCCCTGTCG TTCATGAGAG TACGGGTCAG CTTGATGGCT CAATCTTCTA CGACATCACG
GACAGTCTGA CTGTCGGTGT CCAGGCGGTG AACCTGCTGG ATGAGGTTTC TGAAACCAGC
GCGGTCATCA ATTCGAACCT GGTGCAGGCA CCGCGGGCCT ATTTCCGCAA TGACCGGCGC
TTCGCATTCG TCTTGCGCGG CCGCTTCTAA
 
Protein sequence
MFYKGMRERP MRGLRTGLWR GASLAAISIA MVSTSVYAQD VEDEDDSNTE VITVRGIRSS 
LQSAQEIRRN SDVLVDAITA EDIGALPDRS VAEALQRVPG VNITRFEGEN DPDHFSVEGS
GAIIRGLNFA RSELNGRDVF SADNGQQIGF NDVSPEMLGS VEVFKNQSAD MIEGGLAGTV
NLNTRLPFDQ GGRMMAFTLE ANYSDFAEEV TPTMSGIYSN RWELGNGSEF GLIFGASYSE
LQSRADGTLV AEWLDRDPSD GQSLYVPSGA GIRTQLFDRT RDSISAAAQW RSADGELEAT
AQFFRASYEN AWSERAVEPS IDSGPAITPA PGTSFTYDDN GLFESGVISE NVGWRSNDPT
HPLNGVRQLA LARGKTDESL TTDIGFNLRW SPTDRFRASF DLQFVESEVN IADVTVHGAF
FSDIDLDVTG DVPSVIYRRP ADGSDPYFSD ASNYYLRSAM DHLSDNEGES VAFRADAEYD
FEGDSWLRSV RFGGRVSNRE QVVRSSVYNW GNISATWNSP FALDNPAIPP GTFELYNFDN
YMRGMTTGLE GGIPMYTGAL GEDYDQAVAA IMAMRNAAGN GGWVPLAQRG GVVAGTPFLP
SEITDVTQDT TAFYGRLDFD NEHRSDGIRI SGNVGLRYVE TQTSAVGALS FPDRALVFNG
MTPAAYCAGI VGTTPGICLE GAATQTAFYD WADGSNQSNT DSHTYENWLP SANVVFGITD
EFQIRLGLSQ AIFRPDFGLM KSNLVIQQGG DDPVTGAWLG PDASTGNVQL DPITADQFDL
AFEWYFDDVG SLTFSLFQKN LSDYIVPAIL QRDVTNNGET FAVDVDGVGN ASDSGEIKGF
EIAYQQTFDD LPGIWSGLGV QANYTYIDAQ GVPNIGPKND EPNGRGSAPN FDVSQLRLPL
MSEHTVNLVG FYETDSWSAR LAYNWRSEYT LTVRDVIYPF TPVVHESTGQ LDGSIFYDIT
DSLTVGVQAV NLLDEVSETS AVINSNLVQA PRAYFRNDRR FAFVLRGRF