Gene Sbal223_3017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3017 
Symbol 
ID7088926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3566571 
End bp3568112 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content49% 
IMG OID643461901 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002358925 
Protein GI217974174 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.227562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.425104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA ATAGTCCAAA TGCAGGCAAC GACCTGCGGG AAGTCAAACA GCTCGGTATG 
TGGGCCTCCA TTACTAGCTT AGGTTATGTG TTTTGGCTGG TCGGCGGGAT GGAGTTAGTC
GAGCGTATCG CTTACTACGG CGTCAAAGCC AGTGCGGGAC TGTACGCTAA AGCGCCTGAG
TCTGCGGGCG GCCTTGGGAT CAGCCTAAGC GACTACGGCA TTATTATTTC CCTCTGGGCG
ATCATGCAAA CCTTTGTGCC CGTGTTCACG GGTGGCATGT CTGACCGCGT CGGCTACAAA
GAAACCATCT TTGGCTCCAC CATCATTAAA ATATTTGGCT ATCTGGTGAT GGCATTCTTC
CCCAGTTTTT GGGGCTTTCT TGCAGGCGCA TTACTCCTCG CCATCGGTAC TGGGATATTT
AAACCGGGCA TTCAAGGCAC CTTAGTGCTG TCTACCAATC GCAATAATAC CTCGATGGCT
TGGGGCATTT TTTACCAAGT CGTCAACATT GGTGGTTTCC TCGGGCCGTT AGTGGCCGTA
CATATGCGCC AATTGTCGTG GGACAATGTG TTTTTCGCCT GCGCCGCGAT TATCTCACTC
AACTTCTTAT TTTTACTGAC CTATACAGAA CCAGGCAAAG CCGAGCGACT CGCACGTAAT
AAACAAATCA AGTCGGGTGA AGTCAAACAA GAAGCCCTGT GGCGTGATGC TTGGCGTGAG
CTGAAAAAGC CGATTGTGAT CTACTACATG TTGGTATTTG CAGGCTTTTG GTTCTTGTAC
AATGCCCTAT TCGATGTGTT GCCTATCCAT ATTTCCGAAT GGGTCGATAC CAGCGTAATC
GTCACGTCCC TTTTTGGCAG CGAAGGCACC AGTAACGGCA TTCTGCAATT TTGGCTTGGC
CTCAATAACG AAGGCACTAA GGTGATGCCC GAAGGCATGC TCAACCTTAA TGCCGGTATG
ATCATGACCA GCTGTTTTAT CGTCGCCGCA CTGACGGCTA AATACCGCAT CACTACCGCC
ATGTTTATTG GTTGTTTGCT GAGTATTTTG GCCTTTGTGT TTATCGGCGC CTTCCATGCG
GCTTGGTTTA TCATGCTCGC AATTGCCATG TTCTCCATTG GCGAAATGAT GATTAGCCCG
AAGAAAAATG AGTTTATGGG CAACATTGCC CCTGAAGGTA AAAAAGCCAT GTACTTGGGC
TTTGTGATGT TACCCCAAGG GATTGGCTGG GGATTAGAAG GCTACTTTGG CCCTAAACTC
TATGAGATTT ATGCATCGAA AGAATTGTTT TCGAGGGATT TATTGTTAGA GCGCGGCATG
AACAGCACTG AGGTTAGCGC CATTCCCCAA GGTGAAGCCT TTACTACCTT GGTGAGCTAC
ACAGGTGAAA GCGCCCAGGA TCTTACCCAA CTGCTGTACC ACAGCCATAA CATTGGCATG
GCGTGGTACA TCATCGCCGC CATAGGGACT ATCTCAGCAG TGGGGATTTT TATCTATGGT
AAGTGGTTAC TCACACTGCA AAGAGCCCAG CAAGCCGCCT AA
 
Protein sequence
MSQNSPNAGN DLREVKQLGM WASITSLGYV FWLVGGMELV ERIAYYGVKA SAGLYAKAPE 
SAGGLGISLS DYGIIISLWA IMQTFVPVFT GGMSDRVGYK ETIFGSTIIK IFGYLVMAFF
PSFWGFLAGA LLLAIGTGIF KPGIQGTLVL STNRNNTSMA WGIFYQVVNI GGFLGPLVAV
HMRQLSWDNV FFACAAIISL NFLFLLTYTE PGKAERLARN KQIKSGEVKQ EALWRDAWRE
LKKPIVIYYM LVFAGFWFLY NALFDVLPIH ISEWVDTSVI VTSLFGSEGT SNGILQFWLG
LNNEGTKVMP EGMLNLNAGM IMTSCFIVAA LTAKYRITTA MFIGCLLSIL AFVFIGAFHA
AWFIMLAIAM FSIGEMMISP KKNEFMGNIA PEGKKAMYLG FVMLPQGIGW GLEGYFGPKL
YEIYASKELF SRDLLLERGM NSTEVSAIPQ GEAFTTLVSY TGESAQDLTQ LLYHSHNIGM
AWYIIAAIGT ISAVGIFIYG KWLLTLQRAQ QAA