Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2720 |
Symbol | |
ID | 4286085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 2981653 |
End bp | 2984682 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638142219 |
Product | TonB-dependent receptor |
Protein accession | YP_757944 |
Protein GI | 114571264 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000258905 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.119134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTACA AAGGGATGAG GGAGCGTCCG ATGCGGGGGC TCCGAACTGG ATTGTGGCGG GGGGCGAGCC TCGCAGCCAT ATCCATTGCG ATGGTTTCGA CGAGCGTTTA CGCGCAGGAT GTCGAGGACG AAGACGATAG CAATACCGAG GTGATCACGG TGCGGGGTAT CCGCTCCAGC CTGCAAAGCG CCCAGGAGAT CCGCCGCAAT TCCGACGTTC TTGTGGACGC GATCACGGCC GAGGATATCG GCGCTTTGCC GGACCGTTCG GTCGCGGAAG CGTTGCAGCG CGTTCCGGGT GTGAACATCA CGCGGTTTGA AGGTGAAAAC GACCCAGACC ATTTCTCGGT CGAAGGGTCC GGCGCGATCA TTCGTGGCCT CAACTTCGCC CGCAGTGAGC TGAACGGTCG GGACGTGTTC TCTGCCGATA ATGGTCAGCA AATCGGCTTC AACGATGTTT CGCCGGAAAT GCTTGGATCG GTTGAAGTAT TCAAGAACCA GTCGGCCGAC ATGATCGAAG GCGGGCTCGC CGGTACGGTG AACCTCAATA CACGCCTCCC GTTCGACCAG GGTGGTCGCA TGATGGCGTT CACCCTCGAG GCGAACTACT CGGATTTCGC CGAGGAAGTC ACGCCCACGA TGTCGGGCAT CTACAGCAAT CGCTGGGAGC TGGGGAATGG GTCTGAATTC GGCCTCATCT TTGGCGCCTC CTATTCCGAG CTGCAAAGCC GCGCTGATGG TACGCTCGTC GCTGAATGGC TCGATCGGGA CCCTAGTGAC GGCCAAAGCC TCTACGTGCC GTCTGGCGCT GGGATCCGGA CGCAGTTGTT TGACCGGACA CGAGACTCGA TTTCCGCTGC CGCTCAGTGG CGCAGTGCAG ACGGTGAATT GGAGGCGACG GCGCAATTCT TCCGGGCTTC CTATGAAAAT GCCTGGTCGG AACGCGCGGT TGAGCCGTCC ATCGACAGTG GGCCCGCAAT CACGCCGGCT CCGGGTACAA GCTTCACCTA TGACGACAAT GGTCTGTTCG AGAGCGGCGT GATCTCGGAG AATGTCGGTT GGCGTTCGAA TGATCCGACT CATCCGCTGA ACGGCGTTCG CCAGCTCGCC CTTGCACGGG GCAAGACGGA TGAAAGTCTG ACCACCGACA TCGGATTCAA TCTCCGCTGG TCGCCGACTG ACCGCTTCCG CGCCAGTTTT GACCTGCAGT TCGTTGAGTC CGAAGTGAAC ATCGCGGATG TTACGGTCCA CGGCGCGTTT TTCTCGGACA TTGATCTGGA TGTGACCGGT GATGTGCCCA GCGTGATCTA CCGCCGTCCT GCTGACGGCT CGGATCCGTA TTTCTCTGAC GCCAGCAACT ACTACCTTCG CTCGGCTATG GACCATCTCT CGGACAATGA AGGCGAATCG GTCGCCTTCC GTGCTGACGC CGAATATGAT TTCGAAGGCG ACAGCTGGCT GAGGTCCGTG CGCTTTGGTG GCCGTGTTTC CAACCGGGAG CAGGTTGTAC GCAGCTCGGT CTATAACTGG GGCAATATTT CGGCGACATG GAACTCGCCG TTCGCTCTGG ACAATCCGGC CATCCCGCCG GGAACATTCG AGCTCTATAA CTTCGATAAT TACATGCGTG GCATGACAAC CGGCCTTGAG GGCGGCATCC CGATGTATAC TGGCGCTCTG GGCGAGGATT ACGACCAGGC TGTCGCTGCC ATCATGGCGA TGCGTAACGC GGCTGGAAAC GGTGGCTGGG TGCCGCTGGC CCAGCGTGGT GGTGTCGTGG CAGGGACGCC ATTCCTGCCG TCCGAGATTA CCGATGTGAC GCAGGATACG ACGGCATTCT ATGGCCGTCT GGACTTCGAC AATGAACATC GGTCCGACGG CATTCGCATT TCAGGCAATG TCGGCCTTCG TTATGTCGAA ACCCAGACCA GTGCCGTTGG TGCGCTCTCA TTCCCGGACC GTGCCCTGGT GTTCAACGGT ATGACGCCGG CGGCCTATTG TGCCGGTATC GTCGGTACTA CGCCCGGTAT CTGCCTCGAG GGTGCGGCAA CGCAGACGGC CTTCTATGAC TGGGCGGATG GCAGCAACCA GTCCAACACC GACAGCCACA CCTATGAGAA TTGGCTTCCC AGCGCCAATG TGGTGTTCGG TATCACGGAC GAATTCCAGA TCCGCCTGGG TCTCTCCCAG GCCATCTTCC GTCCTGATTT CGGCCTGATG AAAAGCAATC TGGTCATTCA GCAAGGCGGC GATGACCCGG TGACCGGCGC GTGGCTGGGG CCGGACGCCT CAACCGGCAA CGTTCAACTT GACCCGATCA CGGCCGACCA GTTTGACCTC GCATTCGAAT GGTATTTCGA TGATGTTGGC TCGCTGACCT TCTCCTTGTT CCAGAAAAAC CTGAGCGACT ACATCGTTCC GGCCATCCTG CAACGGGATG TCACAAACAA TGGCGAGACA TTTGCCGTTG ATGTCGACGG TGTCGGCAAT GCGTCGGACA GTGGCGAGAT CAAGGGCTTC GAAATTGCCT ATCAGCAAAC CTTCGATGAC CTGCCCGGCA TATGGTCAGG ATTGGGCGTC CAGGCGAACT ACACCTATAT CGACGCTCAG GGTGTGCCGA ACATCGGTCC CAAGAATGAC GAGCCGAACG GGCGTGGCTC CGCGCCTAAC TTCGACGTGT CACAGTTGCG CCTGCCGCTC ATGTCCGAGC ACACGGTCAA CCTGGTCGGC TTCTACGAAA CCGACAGCTG GAGCGCCCGT CTCGCCTATA ATTGGCGTTC GGAATACACG CTGACTGTTC GTGACGTGAT CTATCCGTTC ACCCCTGTCG TTCATGAGAG TACGGGTCAG CTTGATGGCT CAATCTTCTA CGACATCACG GACAGTCTGA CTGTCGGTGT CCAGGCGGTG AACCTGCTGG ATGAGGTTTC TGAAACCAGC GCGGTCATCA ATTCGAACCT GGTGCAGGCA CCGCGGGCCT ATTTCCGCAA TGACCGGCGC TTCGCATTCG TCTTGCGCGG CCGCTTCTAA
|
Protein sequence | MFYKGMRERP MRGLRTGLWR GASLAAISIA MVSTSVYAQD VEDEDDSNTE VITVRGIRSS LQSAQEIRRN SDVLVDAITA EDIGALPDRS VAEALQRVPG VNITRFEGEN DPDHFSVEGS GAIIRGLNFA RSELNGRDVF SADNGQQIGF NDVSPEMLGS VEVFKNQSAD MIEGGLAGTV NLNTRLPFDQ GGRMMAFTLE ANYSDFAEEV TPTMSGIYSN RWELGNGSEF GLIFGASYSE LQSRADGTLV AEWLDRDPSD GQSLYVPSGA GIRTQLFDRT RDSISAAAQW RSADGELEAT AQFFRASYEN AWSERAVEPS IDSGPAITPA PGTSFTYDDN GLFESGVISE NVGWRSNDPT HPLNGVRQLA LARGKTDESL TTDIGFNLRW SPTDRFRASF DLQFVESEVN IADVTVHGAF FSDIDLDVTG DVPSVIYRRP ADGSDPYFSD ASNYYLRSAM DHLSDNEGES VAFRADAEYD FEGDSWLRSV RFGGRVSNRE QVVRSSVYNW GNISATWNSP FALDNPAIPP GTFELYNFDN YMRGMTTGLE GGIPMYTGAL GEDYDQAVAA IMAMRNAAGN GGWVPLAQRG GVVAGTPFLP SEITDVTQDT TAFYGRLDFD NEHRSDGIRI SGNVGLRYVE TQTSAVGALS FPDRALVFNG MTPAAYCAGI VGTTPGICLE GAATQTAFYD WADGSNQSNT DSHTYENWLP SANVVFGITD EFQIRLGLSQ AIFRPDFGLM KSNLVIQQGG DDPVTGAWLG PDASTGNVQL DPITADQFDL AFEWYFDDVG SLTFSLFQKN LSDYIVPAIL QRDVTNNGET FAVDVDGVGN ASDSGEIKGF EIAYQQTFDD LPGIWSGLGV QANYTYIDAQ GVPNIGPKND EPNGRGSAPN FDVSQLRLPL MSEHTVNLVG FYETDSWSAR LAYNWRSEYT LTVRDVIYPF TPVVHESTGQ LDGSIFYDIT DSLTVGVQAV NLLDEVSETS AVINSNLVQA PRAYFRNDRR FAFVLRGRF
|
| |