Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_4301 |
Symbol | |
ID | 5368059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | + |
Start bp | 4869589 |
End bp | 4872645 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640806703 |
Product | TonB-dependent receptor |
Protein accession | YP_001343131 |
Protein GI | 152998296 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.15211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACC AAAAACAGCC CCTCTTCAAA CCATTAATAC TGGCCATACC ATTAAGCGTT TCCGTCGCAT TATATGGAAT TTCTCCCGTT TTAGCCGCAG ATAACATGGC CCAAGTAGAC AGTCGCTACT ATTCCATCCC GGCAGGCTCT TTAGATTCAG TACTGAATCA ATTTGCATTA ACCGCTGATG TGAGTTTATC CATTAACTCT GCCCTTACCA CAGGCAAACG CAGCTCTGGA CTTAACGGCG ACTACACACA AGACGCTGCA CTGGCAAAGA TTCTGGCAAA CACTAATCTG GTTGCCCAAC AAACTCAAAA TGGCAGTTAC ATCGTAAAAT CCGAGCAAGG AGATGACGGA GTCAATCTTC CTACCATTAC CATTGAAGAC AACAACGCAA TGTCTGATAT GAGTGCCCGT GATCGAAAGG GCTATGACGA CGTATACGAC AAAAACACCT CCACGACCTT TATTGGTAAA ACCGAGGTGG AACGTTATAA GGGCACCACC CCATCCGATT TACTACAAGG CGTTCCCGGC GTATTCAGTG GCGAAGCCCG CAACAGCGGC GCACTCGATC TAAACATTCG TGGCGTTCAA GGCCCTGGCC GTGTTCCGGT AACAATTGAT GGCACAGAAC AAGCGCTGAC AGTATGGCGT GGCTACAACG GCGCGACTAA CCGCAATTAC ATCGACCCAA ACCTTATTGG CAATGTTCAA ATTTACAAAG GTGCCACCAA TGAGCGGGAC GTACATAGCG GTGTTGGTGG CGCCATGGTG GTCAAAACCC TGTCACCCGA CGATTTAATC CGCGATGGTG AAACCTTTGG CGCAGAATTC AAAATAGAAG GCAGTAGCAA TGCTACAGGA GAACGAGTAC CCGCTCTTCA TACTGGCGAA TTAGCGACAG ATGTCAACGG TTACCCTGCC GGAAGCGCTT ACCCCTATGC CGACAAAACC TTACGAGTTA ATTTAAAAAG CAAATCGGAT AGCGATAACA ACCCATTAAA TGGCGGCGAC TATGCCTACC GAGTCGCAGC GGCCAAAAAG AGTGAACACT TTGACGTGCT CGCTGCTTAT GCTTACCGAG AACGAGGCAA TTATTATTCA GGGAAAAATA ATACGGGCTA CTATAACAAT CCCAGCGCAG CTGACACCCG AGACTACATC ACCTCACTTG CCCAATATTG GCAGCCTGGC GACGAAGTGA CCAATACCTC CAGCTTAATG GAGTCTTGGT TATTAAAAAC CACATGGCAC ATTGATGACG ACCAAAAAAT TGGTTTCAAC TTCCGCCAAT CTGATTCCAC TTATGGCGAA ATCATGCCGT CCAGAATCAA CAACCAAAGC GACCGCAGTG CCATTCAATG GCCACTTAGT GAAGTCACGG CAAAAGCGTA CAATATTGAA TACAGCTACA AACCCATAGA CAACCGCTGG ATCGATTTTA ATGCCAATCT TTGGCGAACT GATACCGTCA GTGATACCTA CACCTCCGGT GGCTTCCCTA ACCAAACCTT ACCCTCCGAT ACATCAGGCG TACTCTATGA TAACGCTGCA ACCAATGCCA ACAGCACCCG CGACGGTATC ACCCTTAGTA ACAAAATGAA CCTTACCGAT ACTCTAGACC TAACACTTGG CGGTCGTTTT CAGCACGAAA AGCTCACATC CGATGACGAA TATAACGAAG CAGCCAGTGC TGGCTGGCGA ATGCTGCCAC GCGCTGGTCG ACGGGAAGAG TGGGAAACCA ATTTTGATTT TGCTTGGCGC CCTACCGATA AGCTCAAACT CAATGCCGGC ATGACCTATT CCGCTTATTG GGCGTTTGAT GATTTCCTCG CGGCTCATCC TGGTGAGTTT AGCCAAAGCA CTACCGATTA TTACAACATA AGCTATAAAA CGGAGCACAC CTACACAGAA GCAGAAAGGT TGGCCGTAGC ACAAGATTCT CTGGAAGAAC TTCAGGCATT AGGGATCCCT ATTACCTTAG AGCAACTTCT TGCGATCACT CCCACCACTA AAACAACTAC AAACAATGCT GGCACTTGGA AGCCCGATGC TGACGGCAAT TACGACCGCG CCGACAACCC TTGCCTAAAT GGTGAACTAG CAGGAAAAAA CGTCGTTTCA TGTAACACGA ATCCTGTGAA TAACATATCC ATCGCAGAAG CCAAAAAACA CAAAGACCAT GGCTGGGTGC CTCATGCAGG TATCAGTTAT CAATTCACCG ATTACAGCCG AGCTTATCTA ACCTATACAG AAACCTTGCG CTACCCAAGC ATGTTTGAAA GCACCATGGC TTTCTCTGCA TCTCAAAACC CTTATGGCGT CAAACCAGAA CACGCCCATA ACTGGGAGCT GGCTTATGTA CATGATTTAA CCCAATGGTT TACCAGCGCA GAGTATGCCG ACATCAAAAT TGCCTACTAC GACAACCTCA CTGAAAACGT TATCGAACGA GACAGCAATT TTAAATTTAA CAATGTCGAC GAGCAAAAAA TTCGTGGTAT AGAACTCAGT GCCCGTTATG ACAATGGCCG ATTTTTCACT GGTTTAGGCG TGAATTACAC CTTACAAAAT GAAATCTGTG ATGAAGACAG CGCGGCAATG TTATCCACAA ATGACCTGAT GCGGGCCGTT GATAATCCGA TTCCTCGCTG CTTCAAATAC GGCTTTCCTA ATGGTTATCA ACTCGCCCAA GCTACTCCGG AGCTTTCCGC CAACTTATCA CTAGGAGGCC GTTTTATGGA CCGTCGTTTA GAAATAGGTG GACGAGCTAC CTATTACAAA GGCTACGAAA ACAGTGATCT AGATTGGTAC ATCGCAAACT CTTATAGGGC AGGTGAACTT GGTTATGTTT ATTTCTATAA CACGCCTTAT AGCTGGGGCG ACACGCTCAT CTTCGACGCC TACGTCCGCT ATAAAATTAA TGAAATCTTT GATGTGGAGT TCACCGGTTC AAACCTAAGC GACCAATATT ACGTGGATCC AGCTACTCGA TCTGCCGTGG CCGCGCCGGG ACGCACATTT AAACTGGGTT TAACAGGCCG CTTCTAG
|
Protein sequence | MKHQKQPLFK PLILAIPLSV SVALYGISPV LAADNMAQVD SRYYSIPAGS LDSVLNQFAL TADVSLSINS ALTTGKRSSG LNGDYTQDAA LAKILANTNL VAQQTQNGSY IVKSEQGDDG VNLPTITIED NNAMSDMSAR DRKGYDDVYD KNTSTTFIGK TEVERYKGTT PSDLLQGVPG VFSGEARNSG ALDLNIRGVQ GPGRVPVTID GTEQALTVWR GYNGATNRNY IDPNLIGNVQ IYKGATNERD VHSGVGGAMV VKTLSPDDLI RDGETFGAEF KIEGSSNATG ERVPALHTGE LATDVNGYPA GSAYPYADKT LRVNLKSKSD SDNNPLNGGD YAYRVAAAKK SEHFDVLAAY AYRERGNYYS GKNNTGYYNN PSAADTRDYI TSLAQYWQPG DEVTNTSSLM ESWLLKTTWH IDDDQKIGFN FRQSDSTYGE IMPSRINNQS DRSAIQWPLS EVTAKAYNIE YSYKPIDNRW IDFNANLWRT DTVSDTYTSG GFPNQTLPSD TSGVLYDNAA TNANSTRDGI TLSNKMNLTD TLDLTLGGRF QHEKLTSDDE YNEAASAGWR MLPRAGRREE WETNFDFAWR PTDKLKLNAG MTYSAYWAFD DFLAAHPGEF SQSTTDYYNI SYKTEHTYTE AERLAVAQDS LEELQALGIP ITLEQLLAIT PTTKTTTNNA GTWKPDADGN YDRADNPCLN GELAGKNVVS CNTNPVNNIS IAEAKKHKDH GWVPHAGISY QFTDYSRAYL TYTETLRYPS MFESTMAFSA SQNPYGVKPE HAHNWELAYV HDLTQWFTSA EYADIKIAYY DNLTENVIER DSNFKFNNVD EQKIRGIELS ARYDNGRFFT GLGVNYTLQN EICDEDSAAM LSTNDLMRAV DNPIPRCFKY GFPNGYQLAQ ATPELSANLS LGGRFMDRRL EIGGRATYYK GYENSDLDWY IANSYRAGEL GYVYFYNTPY SWGDTLIFDA YVRYKINEIF DVEFTGSNLS DQYYVDPATR SAVAAPGRTF KLGLTGRF
|
| |