Gene Mmwyl1_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_4301 
Symbol 
ID5368059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp4869589 
End bp4872645 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content47% 
IMG OID640806703 
ProductTonB-dependent receptor 
Protein accessionYP_001343131 
Protein GI152998296 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.15211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACC AAAAACAGCC CCTCTTCAAA CCATTAATAC TGGCCATACC ATTAAGCGTT 
TCCGTCGCAT TATATGGAAT TTCTCCCGTT TTAGCCGCAG ATAACATGGC CCAAGTAGAC
AGTCGCTACT ATTCCATCCC GGCAGGCTCT TTAGATTCAG TACTGAATCA ATTTGCATTA
ACCGCTGATG TGAGTTTATC CATTAACTCT GCCCTTACCA CAGGCAAACG CAGCTCTGGA
CTTAACGGCG ACTACACACA AGACGCTGCA CTGGCAAAGA TTCTGGCAAA CACTAATCTG
GTTGCCCAAC AAACTCAAAA TGGCAGTTAC ATCGTAAAAT CCGAGCAAGG AGATGACGGA
GTCAATCTTC CTACCATTAC CATTGAAGAC AACAACGCAA TGTCTGATAT GAGTGCCCGT
GATCGAAAGG GCTATGACGA CGTATACGAC AAAAACACCT CCACGACCTT TATTGGTAAA
ACCGAGGTGG AACGTTATAA GGGCACCACC CCATCCGATT TACTACAAGG CGTTCCCGGC
GTATTCAGTG GCGAAGCCCG CAACAGCGGC GCACTCGATC TAAACATTCG TGGCGTTCAA
GGCCCTGGCC GTGTTCCGGT AACAATTGAT GGCACAGAAC AAGCGCTGAC AGTATGGCGT
GGCTACAACG GCGCGACTAA CCGCAATTAC ATCGACCCAA ACCTTATTGG CAATGTTCAA
ATTTACAAAG GTGCCACCAA TGAGCGGGAC GTACATAGCG GTGTTGGTGG CGCCATGGTG
GTCAAAACCC TGTCACCCGA CGATTTAATC CGCGATGGTG AAACCTTTGG CGCAGAATTC
AAAATAGAAG GCAGTAGCAA TGCTACAGGA GAACGAGTAC CCGCTCTTCA TACTGGCGAA
TTAGCGACAG ATGTCAACGG TTACCCTGCC GGAAGCGCTT ACCCCTATGC CGACAAAACC
TTACGAGTTA ATTTAAAAAG CAAATCGGAT AGCGATAACA ACCCATTAAA TGGCGGCGAC
TATGCCTACC GAGTCGCAGC GGCCAAAAAG AGTGAACACT TTGACGTGCT CGCTGCTTAT
GCTTACCGAG AACGAGGCAA TTATTATTCA GGGAAAAATA ATACGGGCTA CTATAACAAT
CCCAGCGCAG CTGACACCCG AGACTACATC ACCTCACTTG CCCAATATTG GCAGCCTGGC
GACGAAGTGA CCAATACCTC CAGCTTAATG GAGTCTTGGT TATTAAAAAC CACATGGCAC
ATTGATGACG ACCAAAAAAT TGGTTTCAAC TTCCGCCAAT CTGATTCCAC TTATGGCGAA
ATCATGCCGT CCAGAATCAA CAACCAAAGC GACCGCAGTG CCATTCAATG GCCACTTAGT
GAAGTCACGG CAAAAGCGTA CAATATTGAA TACAGCTACA AACCCATAGA CAACCGCTGG
ATCGATTTTA ATGCCAATCT TTGGCGAACT GATACCGTCA GTGATACCTA CACCTCCGGT
GGCTTCCCTA ACCAAACCTT ACCCTCCGAT ACATCAGGCG TACTCTATGA TAACGCTGCA
ACCAATGCCA ACAGCACCCG CGACGGTATC ACCCTTAGTA ACAAAATGAA CCTTACCGAT
ACTCTAGACC TAACACTTGG CGGTCGTTTT CAGCACGAAA AGCTCACATC CGATGACGAA
TATAACGAAG CAGCCAGTGC TGGCTGGCGA ATGCTGCCAC GCGCTGGTCG ACGGGAAGAG
TGGGAAACCA ATTTTGATTT TGCTTGGCGC CCTACCGATA AGCTCAAACT CAATGCCGGC
ATGACCTATT CCGCTTATTG GGCGTTTGAT GATTTCCTCG CGGCTCATCC TGGTGAGTTT
AGCCAAAGCA CTACCGATTA TTACAACATA AGCTATAAAA CGGAGCACAC CTACACAGAA
GCAGAAAGGT TGGCCGTAGC ACAAGATTCT CTGGAAGAAC TTCAGGCATT AGGGATCCCT
ATTACCTTAG AGCAACTTCT TGCGATCACT CCCACCACTA AAACAACTAC AAACAATGCT
GGCACTTGGA AGCCCGATGC TGACGGCAAT TACGACCGCG CCGACAACCC TTGCCTAAAT
GGTGAACTAG CAGGAAAAAA CGTCGTTTCA TGTAACACGA ATCCTGTGAA TAACATATCC
ATCGCAGAAG CCAAAAAACA CAAAGACCAT GGCTGGGTGC CTCATGCAGG TATCAGTTAT
CAATTCACCG ATTACAGCCG AGCTTATCTA ACCTATACAG AAACCTTGCG CTACCCAAGC
ATGTTTGAAA GCACCATGGC TTTCTCTGCA TCTCAAAACC CTTATGGCGT CAAACCAGAA
CACGCCCATA ACTGGGAGCT GGCTTATGTA CATGATTTAA CCCAATGGTT TACCAGCGCA
GAGTATGCCG ACATCAAAAT TGCCTACTAC GACAACCTCA CTGAAAACGT TATCGAACGA
GACAGCAATT TTAAATTTAA CAATGTCGAC GAGCAAAAAA TTCGTGGTAT AGAACTCAGT
GCCCGTTATG ACAATGGCCG ATTTTTCACT GGTTTAGGCG TGAATTACAC CTTACAAAAT
GAAATCTGTG ATGAAGACAG CGCGGCAATG TTATCCACAA ATGACCTGAT GCGGGCCGTT
GATAATCCGA TTCCTCGCTG CTTCAAATAC GGCTTTCCTA ATGGTTATCA ACTCGCCCAA
GCTACTCCGG AGCTTTCCGC CAACTTATCA CTAGGAGGCC GTTTTATGGA CCGTCGTTTA
GAAATAGGTG GACGAGCTAC CTATTACAAA GGCTACGAAA ACAGTGATCT AGATTGGTAC
ATCGCAAACT CTTATAGGGC AGGTGAACTT GGTTATGTTT ATTTCTATAA CACGCCTTAT
AGCTGGGGCG ACACGCTCAT CTTCGACGCC TACGTCCGCT ATAAAATTAA TGAAATCTTT
GATGTGGAGT TCACCGGTTC AAACCTAAGC GACCAATATT ACGTGGATCC AGCTACTCGA
TCTGCCGTGG CCGCGCCGGG ACGCACATTT AAACTGGGTT TAACAGGCCG CTTCTAG
 
Protein sequence
MKHQKQPLFK PLILAIPLSV SVALYGISPV LAADNMAQVD SRYYSIPAGS LDSVLNQFAL 
TADVSLSINS ALTTGKRSSG LNGDYTQDAA LAKILANTNL VAQQTQNGSY IVKSEQGDDG
VNLPTITIED NNAMSDMSAR DRKGYDDVYD KNTSTTFIGK TEVERYKGTT PSDLLQGVPG
VFSGEARNSG ALDLNIRGVQ GPGRVPVTID GTEQALTVWR GYNGATNRNY IDPNLIGNVQ
IYKGATNERD VHSGVGGAMV VKTLSPDDLI RDGETFGAEF KIEGSSNATG ERVPALHTGE
LATDVNGYPA GSAYPYADKT LRVNLKSKSD SDNNPLNGGD YAYRVAAAKK SEHFDVLAAY
AYRERGNYYS GKNNTGYYNN PSAADTRDYI TSLAQYWQPG DEVTNTSSLM ESWLLKTTWH
IDDDQKIGFN FRQSDSTYGE IMPSRINNQS DRSAIQWPLS EVTAKAYNIE YSYKPIDNRW
IDFNANLWRT DTVSDTYTSG GFPNQTLPSD TSGVLYDNAA TNANSTRDGI TLSNKMNLTD
TLDLTLGGRF QHEKLTSDDE YNEAASAGWR MLPRAGRREE WETNFDFAWR PTDKLKLNAG
MTYSAYWAFD DFLAAHPGEF SQSTTDYYNI SYKTEHTYTE AERLAVAQDS LEELQALGIP
ITLEQLLAIT PTTKTTTNNA GTWKPDADGN YDRADNPCLN GELAGKNVVS CNTNPVNNIS
IAEAKKHKDH GWVPHAGISY QFTDYSRAYL TYTETLRYPS MFESTMAFSA SQNPYGVKPE
HAHNWELAYV HDLTQWFTSA EYADIKIAYY DNLTENVIER DSNFKFNNVD EQKIRGIELS
ARYDNGRFFT GLGVNYTLQN EICDEDSAAM LSTNDLMRAV DNPIPRCFKY GFPNGYQLAQ
ATPELSANLS LGGRFMDRRL EIGGRATYYK GYENSDLDWY IANSYRAGEL GYVYFYNTPY
SWGDTLIFDA YVRYKINEIF DVEFTGSNLS DQYYVDPATR SAVAAPGRTF KLGLTGRF