Gene EcSMS35_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2801 
SymbolproX 
ID6144425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2882619 
End bp2883611 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content52% 
IMG OID641617670 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_001744830 
Protein GI170681945 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.56693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0222237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT 
GCTGCCGATC TGCCGGGCAA AGGCATTACT GTTAATCCAG TTCAGAGTAC CATCACTGAA
GAAACCTTCC AGACGCTGCT GGTCAGTCGT GCACTGGAGA AATTAGGTTA TACCGTCAAT
AAACCCAGCG AAGTGGATTA CAACGTTGGC TACACCTCAC TTGCTTCCGG CGATGCAACC
TTCACCGCCG TGAACTGGAC GCCGCTGCAT GACAACATGT ATGAAGCTGC CGGTGGCGAT
AAGAAATTTT ATCGTGAAGG AGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT
AAGAAAACCG CCGACCAGTA TAAAATTACC AACATCGCGC AACTTAAAGA TCCGAAGATC
GCCAAACTGT TCGATACCAA CGGCGACGGT AAAGCGGATT TAACCGGCTG TAACCCAGGC
TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGGACTGAC CAATACCGTG
ACGCATAATC AGGGGAACTA CGCAGCGATG ATGGCTGACA CCATCAGTCG TTACAAAGAG
GGTAAGCCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAACGA ACTGAAACCG
GGGAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCCGCAC TGCCGGGCGA TAAAAATGCC
GATACCAAAC TGCCGAATGG CGCGAATTAC GGCTTCCCGG TCAGTACCAT GCATATCGTT
GCCAACAAAG CCTGGGCTGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG
TTACCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA
GGCGATATTC AGGGCCACGT TGATGGTTGG ATCAAAGCCC ACCAGCAGCA GTTCGATGGC
TGGGTGAATG AAGCGCTGGC AGCGCAGAAG TAA
 
Protein sequence
MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN 
KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID
KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYGLTNTV
THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA
DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE
GDIQGHVDGW IKAHQQQFDG WVNEALAAQK