Gene Sbal223_3192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3192 
Symbol 
ID7085805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3784712 
End bp3785812 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID643462076 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002359100 
Protein GI217974349 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.683038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00622676 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTAT TTAATAAGAT GACCACTCTA GCTCTGGTCA CTGCGAGCGT ATTAGCGAGC 
GCAGCGGCCC AAGCGGAAGA AGTGGTTCGC GTGTATAACT GGTCAGATTA TATCGCGGAA
GATACCTTAG ACAACTTCAA GAAAGAAACG GGCATTCGGG TAATTTACGA TGTGTTCGAT
AGTAACGAAG TACTTGAGGC TAAATTACTG TCTGGTCGTA GTGGCTACGA TATCGTTGTC
CCTTCTAACC ACTTTCTCGC TAAGCAAATC AAAGCGGGTG CTTTCAAACC TTTAGATCGC
GCTAAGCTAC CTAACTTCAA AAATTTAAAT CCTGCCCTGA TGAAGCTACT TGAGAAAGCC
GATCCGGGTA ACCAGTATGC CGTGCCTTAT TTATGGGGAA CCAATGGTAT TGGTTACAAC
ATAGATAAAG TGAAAGCGGC TGTGGGTGAA GATGCGCCAT TCAACTCAAT GGAACTGATC
TTCAATCCTA AATATGCTGA AAAAATCTCT AAGTGTGGCT TTGCTATGCT GGACTCTGCC
GACGATATGG TGCCTCAAGC ACTGATTTAT TTAGGTTTAG ATCCTAACAG CGCCAACCCA
AGCGATTATG AAAAAGCTGG TGAGTTACTG GAAAAAATCC GTCCTTACGT GACCTATTTC
CACTCATCTC GCTATATTTC CGACTTAGCA AACGGTGACA TTTGTGTGGC CTTTGGTTTC
TCTGGCGACG TATTCCAAGC TAAAGCACGT GCTGAAGAGG CGGGTAATGG CAATAAGATT
GGTTACTCGA TTCCAAAAGA AGGCGCCAAC CTGTGGTTTG ATATGTTAGC TATCCCAGCC
GATTCGACTA ACGCAGATAA TGCACTGACG CTGATTAACT ATTTTCTCCG TCCAGAAGTC
ATAGCGCCTA TCTCTAACTA TGTGGCCTAT GCTAACCCGA ACGATCCTGC CCAACCTCTG
GTTGATGAGG CTATCCGCAC CGATCCCGCG ATTTATCCAC CACAAGAAGT GTTAGATAAA
CTTTATATTG GTGAAATCCG TCCTTTGAAA ATCCAACGCG TATTAACCCG TGTTTGGACC
AAAGTGAAGT CAGGACAATA G
 
Protein sequence
MKLFNKMTTL ALVTASVLAS AAAQAEEVVR VYNWSDYIAE DTLDNFKKET GIRVIYDVFD 
SNEVLEAKLL SGRSGYDIVV PSNHFLAKQI KAGAFKPLDR AKLPNFKNLN PALMKLLEKA
DPGNQYAVPY LWGTNGIGYN IDKVKAAVGE DAPFNSMELI FNPKYAEKIS KCGFAMLDSA
DDMVPQALIY LGLDPNSANP SDYEKAGELL EKIRPYVTYF HSSRYISDLA NGDICVAFGF
SGDVFQAKAR AEEAGNGNKI GYSIPKEGAN LWFDMLAIPA DSTNADNALT LINYFLRPEV
IAPISNYVAY ANPNDPAQPL VDEAIRTDPA IYPPQEVLDK LYIGEIRPLK IQRVLTRVWT
KVKSGQ