Gene Spro_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1512 
Symbol 
ID5602802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1646501 
End bp1648540 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content56% 
IMG OID640937044 
Productcholine transport protein BetT 
Protein accessionYP_001477744 
Protein GI157369755 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0971262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC GCGAAACCTC CTCAAAACCG CAAAAGGATG GCCTTAATCC TGTGGTTTTC 
TTTACCTCGG CAGGGCTGAT TTTGGCCTTC TCGCTGATGA CCATCTTTTT TACCGATTTC
TCCGGGCGAT GGATCACCCG CACCCTGAAT TGGGTTTCAA CCACCTTCGG CTGGTACTAT
CTGCTGGCCG CCACCCTGTA CATCGTATTT GTGGTGTTTA TCGCCGCTTC GCGCTTTGGT
TCTATCAAGC TCGGGCCTGA ACAATCAAAA CCCGAATTCA GCCTGATGAG CTGGGCGGCG
ATGCTGTTCG CCGCCGGTAT CGGCATCGAT CTGATGTTCT TCTCGGTCGC CGAACCGGTG
ACACAATATA TGATGCCGCC GGAAGGCCAG GGCCAGACGC TGGAAGCCGC ACGCCAGGCA
ATGGTCTGGA CGCTGTTCCA CTACGGCCTG ACCGGCTGGT CGATGTATGC GTTGATGGGT
ATCGCCCTCG GCTATTTCAG CTACCGCTAT AACCTGCCGC TGACCATCCG CTCTGCCCTG
TACCCGATTT TCGGTAAACG CATCAATGGC CCGATTGGCC ACAGCGTAGA TATCGCCGCG
GTGCTGGGTA CCATCTTCGG TATCGCCACC ACGCTCGGTA TTGGCGTAGT GCAGTTGAAC
TACGGGCTGA AAGTGCTGTT CCACATACCG GAAAACCTGA CGGTTCAGGC CGCGTTGATC
CTGCTTTCGG TGATCATGGC CACCATTTCG GTGACTTCCG GCGTCAATAA GGGCATCCGT
ATTCTGTCCG AGCTCAACGT GCTGTTGGCA CTGGGGCTGA TTTTATTCGT GCTGTTCTTC
GGCGATACCG AGTTCCTGCT CAATGCGCTG GTGCTTAACG TCGGTGATTA CGTCAACCGC
TTTATGGGCA TGACGCTCAA CAGCTTCGCC TTCGATCGCC CGGTGGAATG GATGAACAAC
TGGACCCTGT TCTTCTGGGC ATGGTGGGTG GCCTGGGCGC CGTTTGTCGG CCTGTTCCTG
GCGCGAATTT CACGTGGGCG CACCATTCGC CAGTTTGTGG TCGGCACGCT GATTATTCCG
TTCGTGTTTA CCCTGCTGTG GTTGTCAATC TTCGGTAACA GCGCGCTGTA CCAGATTATT
CACGGCAACG CCGCCTTCGC GCAGGAAGTG ATGCAGTACC CGGAACGGGG GTTCTACAGC
CTGCTGGCGC AGTATCCGGG CTTTACCTTC AGCGCTTCGG TGGCCACTAT CACTGGGCTG
CTGTTCTACG TCACCTCAGC GGATTCAGGC TCGCTGGTGC TGGGTAACTT CACCTCACGC
CTGAGTGATA TCAATAATGA CGCCCCCAAC TGGCTGCGTA TTTTCTGGTC GGTGGCGATT
GGTCTGCTGA CCATAGGTAT GTTGATGACC GACGGCGTGC CTGCCCTGCA GAAAACCACG
GTGATCATGG GCCTGCCGTT CAGTTTTGTG ATCTTCTTCG TGATGGCCGG GTTGTATAAA
TCGCTACGGG TGGAGGACTA CCGCAAGGCC AGCGCGCTGA ACACCAATGC GCCCATGCCG
GTGTCCAGCA ATGACGTGCT GAACTGGAAA CAGCGCCTGT CGCGGGTAAT GAATTATCCG
GGCAGCCAAT ATACACAGAA AATGATGGAT ACCCGCTGTC GGCCGGCGAT GCAGGAAGTG
GCGCGTGAGC TGGAATTACG CGGCGCCAAA GTGGAGTTCA GTGAAGTGCC GCCAACCGAA
GATGAGCGTT TAAACCATCT GGAGTTGCTG GTGCATTTAG GCGAGGAGCA GAACTTTATC
TATCAGATCT GGCCGCAGCG TTACTCGGTG CCGGGCTTTA CCTATCGCGC CCGCTCCGGC
AAGTCGCACT ACTACCGGCT GGAAACCTTC CTGATGGAAG GCACCCAGGG CAATGACCTG
ATGGACTACA GCAAGGAACA GGTGATCGGC GATATCCTCG ATCAGTACGA AAAACACCTG
AACTTTGTGC ACATCCACCG CGAGGCACCG GGTAACACCC TGACCTTCCC GGATATGTAA
 
Protein sequence
MTTRETSSKP QKDGLNPVVF FTSAGLILAF SLMTIFFTDF SGRWITRTLN WVSTTFGWYY 
LLAATLYIVF VVFIAASRFG SIKLGPEQSK PEFSLMSWAA MLFAAGIGID LMFFSVAEPV
TQYMMPPEGQ GQTLEAARQA MVWTLFHYGL TGWSMYALMG IALGYFSYRY NLPLTIRSAL
YPIFGKRING PIGHSVDIAA VLGTIFGIAT TLGIGVVQLN YGLKVLFHIP ENLTVQAALI
LLSVIMATIS VTSGVNKGIR ILSELNVLLA LGLILFVLFF GDTEFLLNAL VLNVGDYVNR
FMGMTLNSFA FDRPVEWMNN WTLFFWAWWV AWAPFVGLFL ARISRGRTIR QFVVGTLIIP
FVFTLLWLSI FGNSALYQII HGNAAFAQEV MQYPERGFYS LLAQYPGFTF SASVATITGL
LFYVTSADSG SLVLGNFTSR LSDINNDAPN WLRIFWSVAI GLLTIGMLMT DGVPALQKTT
VIMGLPFSFV IFFVMAGLYK SLRVEDYRKA SALNTNAPMP VSSNDVLNWK QRLSRVMNYP
GSQYTQKMMD TRCRPAMQEV ARELELRGAK VEFSEVPPTE DERLNHLELL VHLGEEQNFI
YQIWPQRYSV PGFTYRARSG KSHYYRLETF LMEGTQGNDL MDYSKEQVIG DILDQYEKHL
NFVHIHREAP GNTLTFPDM