Gene Sbal223_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2804 
Symbol 
ID7088553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3296004 
End bp3297980 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content49% 
IMG OID643461690 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_002358714 
Protein GI217973963 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA AGTCGAGTAT TAATCCGCCG GTATTTTATT CGTCGGTCTT TTTCATCATC 
TTGATGGTGA TGATTTGTGC AGTATGGCCC ACAGAGGCTA ACACCGTTTT TAGATCAATA
CAGTCATGGA TAGAAGTGAA AGCCGGTTGG CTTTACATTC TGAGCGTGGC GTTTTTCCTG
ATTTTTATCA TCTTTGTCAT GGTCAGTCGC TTTGGCGATA TCAAACTCGG GCCAGATCAT
TCTGTGCCGG ATTACAGCTA TAAAAGCTGG ATTGCCATGT TGTTTTCGGC TGGTATGGGT
ATTGGCCTGA TGTTTTTTGG GGTCGCCGAA CCTGTAATGC ACTATCTGGC GCCGCCAGAT
GCTACACCTG AGAGTCTCGC CGCCGCAAAA GAGGCGATGA AGATTACTTT CTTCCATTGG
GGACTGCATG CTTGGGCGAT TTATGCCGTA GTGGCATTGA GTCTGGCCTA CTTCTCCTAT
CGTCATAAGT TACCGCTACT GCCACGAAGC GCCCTTTATC CCTTAATTGG TGAACGCATT
TATGGTCCTA TTGGCCATAT TGTCGATACC TTTGCCGTGC TCGGCACTAT GTTTGGGGTG
GCGACATCAC TGGGCTTTGG GGTGCTACAG GTTAACTCGG GTTTGAGTTA TCTATTTGAA
GGTTTACCTA ATAACGCAGT AGTGCAAGTG TGCTTGATCA TCGGCATTAC CTTGCTCGCG
ACCCTGTCGG TGTTTTCTGG CCTCGACAAA GGCGTCAAAC GCTTAAGCGA ACTCAATCTT
GGCCTCGCGA TTATCTTATT GCTTATCGTC TTGATCTTGG GCCCTACAGT CATGCTGTTG
CAGGCCTTTG TGCAAAACAC CGGCAGTTAT TTAAGTGATT TGGTTGGCAA AACCTTTAAT
TTGTATGCCT ATCAACATAA GGAAGATTGG CTCGGCGGCT GGACCTTGCT CTATTGGGGC
TGGTGGATTT CTTGGTCACC TTTTGTCGGT ACTTTTATTG CGCGTGTGAG CCGCGGTCGT
ACCATCCGTG AGTTTTTGAT TGGTGTGTTA TTTGTGCCAT CGGCGCTGAC CTTCCTGTGG
ATGACAGTGT TTGGTAACTC GGCGATTGAT TCGATTATGA ATCAAGGCGC GACCTACCTT
GCTGAAGCGG TGAATACCGA TGTGTCAGTG GCCTTGTTTG TGTTTTTCGA GCACATGCCG
TTCCCGACCT TACTCTCAGG GATTGCGATT TGTTTAGTGG TGACCTTCTT TGTGACCTCG
TCGGATTCGG GGTCCTTAGT GATTGATAAT CTGACCTCGG GCGGCGATAA CAATGCGCCT
GTATGGCAGA GGATCTTCTG GGCACTGTTG CAAGGGGTTG TGGCCTCGGT ATTGTTGTTG
GCTGGTGGTT TACAGGCGCT GCAAACGGCG GCTATCGCCA GCGCCATGCC GTTCCTCGTG
GTGATGTTAT TCATGTGTCT GGGATTATTT AAAGCCCTGA AGAATGACTG GCTTAAGATC
AACAGCGTGC AGTTACACAA TACCAGTGTG CAGTACGCCA AAACCAACAT GAGCTGGGAA
GAACGCATTG GTGTGCTCGT GTCGCATCCT ACCCATGAAG AAGCGCAAGT GTTCTTGAAT
AATGTCGCGA CGCCTGCTTT GTCTAAGGTG TGTCAGCACT TTATGGCCAA AGGCATAGTG
GCCGATCTCG AATACTTAGA TGGCCGCGTA CGTTTGGTGA TCAGCAACGA GGTGAACTTG
CCCTTTGTTT ACGGAGTACG CACCCGTTGT TTTGACATCA CTAACCCGAT AGGGACTGAA
ATCGAGCAGG GCAATACCTT GTATTACCGC GCCGAAGTGT ACCTTGAACA AGGTGGTCAG
CATTACGATG TGATGGGGTA TACCGAAGAG CAAATCCTCG CGGATGTGGT GACTCAGTAT
GAAAAATACT TACACTACTT GCATCTGTCG AATGCCGATC ATGCCCACAT AAGCTAG
 
Protein sequence
MSIKSSINPP VFYSSVFFII LMVMICAVWP TEANTVFRSI QSWIEVKAGW LYILSVAFFL 
IFIIFVMVSR FGDIKLGPDH SVPDYSYKSW IAMLFSAGMG IGLMFFGVAE PVMHYLAPPD
ATPESLAAAK EAMKITFFHW GLHAWAIYAV VALSLAYFSY RHKLPLLPRS ALYPLIGERI
YGPIGHIVDT FAVLGTMFGV ATSLGFGVLQ VNSGLSYLFE GLPNNAVVQV CLIIGITLLA
TLSVFSGLDK GVKRLSELNL GLAIILLLIV LILGPTVMLL QAFVQNTGSY LSDLVGKTFN
LYAYQHKEDW LGGWTLLYWG WWISWSPFVG TFIARVSRGR TIREFLIGVL FVPSALTFLW
MTVFGNSAID SIMNQGATYL AEAVNTDVSV ALFVFFEHMP FPTLLSGIAI CLVVTFFVTS
SDSGSLVIDN LTSGGDNNAP VWQRIFWALL QGVVASVLLL AGGLQALQTA AIASAMPFLV
VMLFMCLGLF KALKNDWLKI NSVQLHNTSV QYAKTNMSWE ERIGVLVSHP THEEAQVFLN
NVATPALSKV CQHFMAKGIV ADLEYLDGRV RLVISNEVNL PFVYGVRTRC FDITNPIGTE
IEQGNTLYYR AEVYLEQGGQ HYDVMGYTEE QILADVVTQY EKYLHYLHLS NADHAHIS