Gene Shewmr4_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2082 
Symbol 
ID4252655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2478353 
End bp2481181 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content51% 
IMG OID638118706 
ProductTonB-dependent receptor 
Protein accessionYP_734212 
Protein GI113970419 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCTC CAATCGCTCG ACAATTAGGT TCTCTGATAC CACAACAAGC GCATCCTGTC 
GCTAGGGTCC ACAAACGTCT CACGTCGCGT CTCTCGGCCT TAAGCTTAGC CATGGTGCTC
GCTGGACTGA GTCACAATGC TCTCGCCATA GGCAAACTTG AGGGGCAAAT TCGCGATAAC
AGCAGCCAAC AGCCCCTTGC AGGTGCAACC GTCACCTTAA AAGAGCTCAA CCTTAGCCAA
CAAGCAGGCC GTGATGGCCG TTTCTTTTTT GTCGGTGTGC AGGATGGCGA TTACACCTTA
GTGGTCAACT ACCTAGGGGC GATGCCAAAC GAGCATAGGA TCAGCATTCG CGATAAACAA
ACCACACTGC AAGATATCAA CTTAAGCAGC CAAGATGATG TGGAGCATAT CCGCGTGGTC
GGCCAACAAG GCGCATTAAG TAAATCCATG AACCGCCAAC GCGGCGCCGA CAATGTGCTG
AGCGTGGTCA GTGCCGATGT GCTGGGGAAT TTCCCCGACA GCAATATCAG TGAATCACTG
CAACGGGTGC CAGGGCTGTC TATCGAGCGG GATCAGGGCG AAGGTCGCTT TGTGCGCGTG
CGCGGTATGG CGCCTGACTA TAACTCAGTG TCGATGAACG GCACGCGCTT ACCCTCACCC
GAAAGCGATC GCCGTGCGGT TGCCCTTGAT GTAGTGCCAT CGGATCTATT GCAATCGGTA
GAAGTGAGTA AAACCTTAAC ACCGGATATG GATGCCGATG CACTGGGCGG CGCCATTGAG
GTCAAAAGTC TGTCGGCCTT CGATCGCGAT GATACTTATC TCAACCTCAA CGCCGAGGCG
AGCCAAGATA CCTTAACCGA CAATACCAAT CCCAAACTTG CCGCCAGCTA CAGCGATATC
TTTGCCGACA AGCTAGGGGT AGCCATAGGT GCCAGTTGGT ATAACCGTGA CTTTGGCTCA
GACAACGTCG AAACCGGCGG TAAATGGGAA TTTGCCGGGG ATAACGGCTT TGAGGATGCG
GCGCTTGAAT CCGTCGATGC ACGGGATTAT GAAATCAATC GTGAACGTTT AGGCATAGGG
GTGAACTTCG ATTATCGCCC GAGCGATGAT ACCGATCTGT ATCTGCGCAC CCTTTACAGT
GAGTTTGATG ATACCGAAAC CCGCAACAGC GCTAAAACCA AGTGGAAATC GCCGCAGCAA
GAAAATGCCC TCAGCCAAGG CAAAACCACC CGCTCGCTAA AATCACGCAC CGAGAACCAA
AACATCACCT CCTTTGTGCT GGGCGGTCAA ACCCGCTTCG AACGCTGGAC CTTTGACTAT
CAAGCAAGCC ACAGCACCGC CAGCGCCGAC AAACCGCGGG ATATTGCCGG CGCCAACTTT
GTCGCCAAGA TTGATAACAC GGGTTTTAGC AATACCAATC AGCCACAGAT AATCGCCCCC
GAAGACTATT ACCAAAACGA CAATTTTGAA TTAGATGAAA TTGAAGTTGC CGCCTCCAAG
GCCGAAGACA CCATCAACAG TGGCCAACTG GATCTCACCC GCCAGTTAAC CATTGCGGAT
TACAGCGTCG AGCTAAAAAC TGGGGTCAAG CTGAGCCGCC GCGATAAATC CAATCGCGAA
GATATTTGGA TCTACAGCGA CTTAGGCGAT CAAGGCGTGA GCGATGAAGA CTTGTTATTG
AGTCAATACG CAGGCAATGA GCTTGACTAT GACCTCGGCC GCTTTGGCAG CGGCATCAAT
GCTGCGCCGC TATGGCAGCT TATCGACAGC CTCGATGCCG ACAGCAATCG CGATGATATC
GAGTCCACCA TTAACGATTT TGATATCAGC GAAGATATCA ACGCCGCCTA CCTGATGGGC
CACATCGATA TCGAGAAACT GCGTATCTTA ACTGGGCTGC GTTTTGAGCA AAATCAATGG
GATTCTAGCG GGTACGGTTA CGATGGTGCC AAGGGCGAGT TTATCGATAT CAAGCATTCC
CGCGATGAAG ATCATTGGCT GCCCGCACTG CACCTCACTT ACCGCTATAG CGACAATACC
GTGCTGCGCG CCGCTTGGAC CAACACCTTA GTGCGCCCAA CATTCGGCCA ATTAGCGCCG
GGATATTTGC TCGAAGAGGA TGATGGCGAT ATCGACTTAA CCTTTGGTAA TCCACAACTT
AAGTCGCTCG AATCGATGAA CTTTGACTTA AGCCTCGAGC ACTATTTTGG CAATATCGGC
TTAATTTCGG CGGGGCTGTT TTATAAAGAT ATCGACAACT TTATCTATCA GGCAGATTTA
GCTGGTCGTG GCGATTATAT CGATGCCCAC AGCGCCGTGA CCTTTGTCAA TGGCGACAGT
GCCGACATCT ACGGCGTGGA GCTCAGCTAT GTGCAAGAGT TTAACTTTTT GCCCGAGCCC
TTTAATGCGC TGGTGCTCAA CTCTAACCTC ACCTACACAG ATTCCAGTGC TCAGATCAGT
TGGCTGGAGG ATGGCCAATT ACTGAGCCGC GATATTCCCA TGCCAAGCCA ATCGGATCTC
ACCGCAAACC TGTCCCTTGG CTATGAAAAC AGCTACGCCA GTGTTTGGTT ATCGGCGGCC
TATAAATCCG AATATTTACA GGAAGTCACA GAGCTCAGTG ATGAGCGCTA CGATCTCTAT
CAAGACAATC ATTTGCAGTG GGACTTTGTC GCCAAGGCCC ATTTAACCAG CAATTTAACC
TTGTATTTCA AAGGGGTGAA CCTGACCGAC GAGCCCTACT ACAGCTACAC GGGTGACAGC
AGCTATAACG CCCAATACGA AGCCTATGGC CGCACTTTCC AGTTAGGCGT GCAGTACACC
AACTATTAA
 
Protein sequence
MQSPIARQLG SLIPQQAHPV ARVHKRLTSR LSALSLAMVL AGLSHNALAI GKLEGQIRDN 
SSQQPLAGAT VTLKELNLSQ QAGRDGRFFF VGVQDGDYTL VVNYLGAMPN EHRISIRDKQ
TTLQDINLSS QDDVEHIRVV GQQGALSKSM NRQRGADNVL SVVSADVLGN FPDSNISESL
QRVPGLSIER DQGEGRFVRV RGMAPDYNSV SMNGTRLPSP ESDRRAVALD VVPSDLLQSV
EVSKTLTPDM DADALGGAIE VKSLSAFDRD DTYLNLNAEA SQDTLTDNTN PKLAASYSDI
FADKLGVAIG ASWYNRDFGS DNVETGGKWE FAGDNGFEDA ALESVDARDY EINRERLGIG
VNFDYRPSDD TDLYLRTLYS EFDDTETRNS AKTKWKSPQQ ENALSQGKTT RSLKSRTENQ
NITSFVLGGQ TRFERWTFDY QASHSTASAD KPRDIAGANF VAKIDNTGFS NTNQPQIIAP
EDYYQNDNFE LDEIEVAASK AEDTINSGQL DLTRQLTIAD YSVELKTGVK LSRRDKSNRE
DIWIYSDLGD QGVSDEDLLL SQYAGNELDY DLGRFGSGIN AAPLWQLIDS LDADSNRDDI
ESTINDFDIS EDINAAYLMG HIDIEKLRIL TGLRFEQNQW DSSGYGYDGA KGEFIDIKHS
RDEDHWLPAL HLTYRYSDNT VLRAAWTNTL VRPTFGQLAP GYLLEEDDGD IDLTFGNPQL
KSLESMNFDL SLEHYFGNIG LISAGLFYKD IDNFIYQADL AGRGDYIDAH SAVTFVNGDS
ADIYGVELSY VQEFNFLPEP FNALVLNSNL TYTDSSAQIS WLEDGQLLSR DIPMPSQSDL
TANLSLGYEN SYASVWLSAA YKSEYLQEVT ELSDERYDLY QDNHLQWDFV AKAHLTSNLT
LYFKGVNLTD EPYYSYTGDS SYNAQYEAYG RTFQLGVQYT NY