Gene Sbal223_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3075 
Symbol 
ID7088985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3645777 
End bp3648743 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content46% 
IMG OID643461959 
ProductTonB-dependent receptor 
Protein accessionYP_002358983 
Protein GI217974232 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000179572 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCATT GTAAACTCAG CGTCTTGACC ATGGCTCTCG CGACAGCAGG CTTTAGTGTG 
GTGTATACAC CGCAGTTAGC CTTTGCAGCA GAGACTACGC CAACTCAACA GGAGGCCAAA
CAAGCTCAAG ACGACAATAT AAAAAACGAT ACCAACACGA ATACCAAGAG TGTCGAAACC
AACAAAAAAG CCAGCGATGA AGCGGCAAAT ATCGAACGTA TTGAGGTTAA AGGTTACAGC
AAAAGTCTGA TTGATTCCCT CGATGCTAAG CGTTACGGCG ACACAGTATC AGAGCAATTA
TCGGCAGATG ATCTTGGCGC CTTACCCGAT GTATCAATGG CGGATGCGCT GACCCGCTTA
CCCGGTATTT CAGCGGTACG TACAGGTGGA CAAGCGGCGG AAATTAACAT TCGTGGTATG
TCTGGCGGTT TTGTTTTTTC AACCTTAAAT GGCCGTGAAC AGGTATCGAC GAGTGGTACT
CGCAGTATCG AGTTCGATCA ATACCCGTCT GAACTGATTT CATCCGCCGC TGTGTACAAA
TCCCCTAAAG CCTCACTCAT TGAAGGGGGC GTTGCAGGCA CAGTTGAACT GCAAACAGTC
AGCCCACTGC AATCCGATAA AGACCATAGT TTTCTGGTAA ATGCCCGTGG TATGTACAAC
GACAGAGCTA GTGAAGTGTA TGGTGCTGAC GAGTATGGCC ACAGATTGAG TATGTCTTAT
CAGGGTAAGT TCTTCGACGA TACCTTAGGC GTGGCGTTAG GTTATGCCCG TCTAAAACAA
CCCAGTGTTG CGACCCAATT TATCGGTTTG GCTTATAACG ACAGCAAAGA AGTTGATGGA
CTGGCTAATG ATACCAACGG TCCTATTGAT GCCCCTGCGA ATGAGTACAT CAGCGAGGGT
TTTGAGCTAC AGCACCTTGG TGGCGTTGAA ATTCGTAACG GTTATATGAG CGCGATTGAG
TGGGCACCTA GGGATAATTT TAAGCTCAAG GCCGATGCAT TTTTATCGCG TTTCGATAGC
GAATCGTTTG CCCGAGGATT CAGGGTTAAG TTTGGTGGCT CAGGCGCTGC GATTGCCAAT
CCTGTACTCG ATGGTAATTC TGTTATTGGC GGAACTTTTA ATCGCACCTC AAAGAGCTAC
ACCCGAGTTG AGACTGTAAA CGACGATAAC CAAGATTTTG ATGAAGTGAA TAGCTTTGGT
TTAAATGCTG ACTGGCAAGT GACAGACAGA CTGAATGTCG CCGCCGATAT TTCCTATTCC
TCAGCCAAAA GTAACTTTAG GAATGGCCTG TTATGGGCGA ACGTTGCTGT CGATGCTAAT
GCTGACACCC CAGTTTTCGA TGATAATGTC TCGATTAGTT ATCAACTTAA CGGGCTTAAT
CTGCCCGATG TGGGCTTTAA TCAAGCCGAT GCTTTCACCG ATCTGGATCG GGTGATGGTG
AGTAAGTACG GCATCTATCC CTATGAGAAT GAAGATGCGG TAAAGGCTTA TCGCCTTGAT
TTTAAATATG ACCTTGAGAA TGATTACATC AGCTCAGTAG AGTTTGGTGT ACGTTACTCT
GATCGCAATT ACTCCAATCG CCGCTCGGTA TTTGAATATG GCAATGATGG CGCTTTCTCT
AGCGCTGAGC CACCACTCAA GCTGACGTCT GATATGGCCT CGGTCGTCGA TTGGCAGGGT
GAGTTCAGTT ATTTCCCCTC TTATCTCGCC ATTGATCTCG ACGCTGCGCT AGCGGCTTGG
TTCCCTGAAG GTATCCCACA ACCAGTACAA ACTTGGGGAA ATGCCGATGG TGTACTCGAT
GCTAAGGGCT ATACCACTAA CTATTCTTGG ACCGTATTGC AAAGTGGGGA GGTGTTCGAA
AAGGTGTTTG CGGCTTATGC CATGGTTAAT TTTGATACCG AGATTGGTGG TATTCCAGTC
ACAGGAAATC TCGGTTTACG CCGTGTTGAA ACAGACCAAT CCGCAACTGT CTTAGAGAAT
GTCGGTGCAC ATCCAGAGCT TGGCGCTCAG TACATTGTCG ATGATTTAGG CATAGTAAAT
AACTACTATG CACCTAAAAT CAAAGGGATA GATTATGTTG ATTATCTACC GTCCCTTAAC
CTTAGTTTTA AGTTCACTGA AGATTCTCAA ATCCGTTTAG CTGCCGCTAA AGTAATGTCG
CGTCCGCCCA TTAACCGCTT GGCGGGAGAT GCGAGCGCAA CGGCTAATAG CGATGGGGTC
ATCAATGGTT CCAGTACTAA TAACCCTTAT TTAAAACCTT TCTATGCGGA TCAATATGAT
ATTTCCTACG AGAAATACTT TGATGAAGGC GCCTTTGTCG CGGCGCTATT CTACAAAAAT
ATTGATTCTT TTATTGATAC TGTGGCTATC ACCAATTTCG ATTTTAAGGG GAATGGCTTC
AACGTGCCGG ATTACATCGT CGATCCTGTT ACTGGGGTGC AAACCTCAAC AAGTAATGGC
ACTTACACTA CTGCAATGAA TAATGCTGAG GGTGGATACA TTCGCGGTTT AGAGTTGGCT
TATACCCAAG TGTTTGCTTC CTTGCCCGAT TTATTTTCCG GTTTAGGTTT TAACGCCAGT
TATTCTTATA CCGAGAGTGA GGTGCAGTCC ATTACCAGCC TAGGCGGTGA TAGTGCTACT
CAATCTTTAC CTGGGTTATC GAATAATGTG TTCAGTGCCA CCTTGTTTTA TGGCTATGAG
GGCTTTGAAA CCCGTATCAG CGCTCGTTAC CGTGATGCTT TTGTCTCTGA ACAGGTGGCG
ATTAACGATC AAGTGGTGAA CTTCGATTCC GAAACCGTGA TGGACTATCA AGCCTCCTAC
CAAGTAACGG ACGGTTTAAA CGTACTGTTC CAAGTCAATA ACCTCACAGA TGAACCGACT
AAGAGTTACT TCGGCACCGA GCAGAAAACC GGCACCCTAC AGTACTTTGG CCGCGAATTT
TTCTTGGGGT TCACCTATGC CCTGTAA
 
Protein sequence
MRHCKLSVLT MALATAGFSV VYTPQLAFAA ETTPTQQEAK QAQDDNIKND TNTNTKSVET 
NKKASDEAAN IERIEVKGYS KSLIDSLDAK RYGDTVSEQL SADDLGALPD VSMADALTRL
PGISAVRTGG QAAEINIRGM SGGFVFSTLN GREQVSTSGT RSIEFDQYPS ELISSAAVYK
SPKASLIEGG VAGTVELQTV SPLQSDKDHS FLVNARGMYN DRASEVYGAD EYGHRLSMSY
QGKFFDDTLG VALGYARLKQ PSVATQFIGL AYNDSKEVDG LANDTNGPID APANEYISEG
FELQHLGGVE IRNGYMSAIE WAPRDNFKLK ADAFLSRFDS ESFARGFRVK FGGSGAAIAN
PVLDGNSVIG GTFNRTSKSY TRVETVNDDN QDFDEVNSFG LNADWQVTDR LNVAADISYS
SAKSNFRNGL LWANVAVDAN ADTPVFDDNV SISYQLNGLN LPDVGFNQAD AFTDLDRVMV
SKYGIYPYEN EDAVKAYRLD FKYDLENDYI SSVEFGVRYS DRNYSNRRSV FEYGNDGAFS
SAEPPLKLTS DMASVVDWQG EFSYFPSYLA IDLDAALAAW FPEGIPQPVQ TWGNADGVLD
AKGYTTNYSW TVLQSGEVFE KVFAAYAMVN FDTEIGGIPV TGNLGLRRVE TDQSATVLEN
VGAHPELGAQ YIVDDLGIVN NYYAPKIKGI DYVDYLPSLN LSFKFTEDSQ IRLAAAKVMS
RPPINRLAGD ASATANSDGV INGSSTNNPY LKPFYADQYD ISYEKYFDEG AFVAALFYKN
IDSFIDTVAI TNFDFKGNGF NVPDYIVDPV TGVQTSTSNG TYTTAMNNAE GGYIRGLELA
YTQVFASLPD LFSGLGFNAS YSYTESEVQS ITSLGGDSAT QSLPGLSNNV FSATLFYGYE
GFETRISARY RDAFVSEQVA INDQVVNFDS ETVMDYQASY QVTDGLNVLF QVNNLTDEPT
KSYFGTEQKT GTLQYFGREF FLGFTYAL