Gene EcSMS35_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2800 
SymbolproW 
ID6143762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2881497 
End bp2882561 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content58% 
IMG OID641617669 
Productglycine betaine transporter membrane protein 
Protein accessionYP_001744829 
Protein GI170680258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.591808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0101611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC AAAATAATCC GTGGGATACC ACGCCAGCGG CGGACAGTGC TGCACAATCC 
GCAGACGCCT GGGGTACACC GGCGACTGCA CCGACTGACG GCGGTGGCGC TGACTGGCTG
ACCAGTACGC CTGCGCCAAA CGTCGAGCAT TTTAATATTC TCGATCCGTT CCATAAAACG
CTAATCCCGC TCGACAGTTG GGTCACTGAA GGGATCGACT GGGTCGTTAC CCATTTTCGT
CCCGTCTTTC AGGGCGTGCG CCTTCCGGTT GATTACATCC TCAACGGTTT CCAGCAATTG
CTGCTGGGTA TGCCCGCGCC GGTGGCGATT ATCGTTTTCG CTCTCATCGC CTGGCAGATT
TCCGGGGTCG GAATGGGCGT GGCGACGCTG GTTTCGCTGA TTGCCATCGG CGCAATCGGT
GCCTGGTCGC AGGCCATGGT TACCCTGGCG CTGGTGTTAA CCGCCCTGCT GTTCTGTATC
GTCATAGGTT TGCCGTTGGG GATCTGGCTG GCGAGAAGTC CGCGAGCGGC GAAAATTATT
CGTCCACTGC TTGATGCCAT GCAGACCACG CCCGCGTTTG TTTATCTGGT GCCAATCGTC
ATGCTGTTTG GTATCGGTAA CGTGCCGGGC GTGGTGGTGA CAATCATCTT TGCGCTGCCG
CCGATTATCC GTCTGACGAT TCTGGGAATT AACCAGGTTC CGGCGGATCT GATTGAAGCC
TCGCGCTCAT TCGGTGCCAG CCCGCGCCAG ATGCTGTTCA AAGTTCAGTT ACCACTGGCG
ATGCCAACCA TTATGGCGGG CGTTAACCAG ACGCTGATGC TGGCCCTTTC TATGGTGGTC
ATCGCCTCGA TGATTGCCGT CGGCGGGCTG GGTCAGATGG TACTTCGCGG TATCGGTCGT
CTGGATATGG GGCTTGCCAC CGTTGGCGGC GTCGGGATTG TGATCCTCGC CATTATCCTC
GACCGCCTGA CGCAGGCCGT TGGGCGCGAC TCACGCAGTC GCGGCAACCG TCGCTGGTAC
ACCACTGGCC CTGTCGGTCT GCTGACCCGC CCATTCATTA AGTAA
 
Protein sequence
MADQNNPWDT TPAADSAAQS ADAWGTPATA PTDGGGADWL TSTPAPNVEH FNILDPFHKT 
LIPLDSWVTE GIDWVVTHFR PVFQGVRLPV DYILNGFQQL LLGMPAPVAI IVFALIAWQI
SGVGMGVATL VSLIAIGAIG AWSQAMVTLA LVLTALLFCI VIGLPLGIWL ARSPRAAKII
RPLLDAMQTT PAFVYLVPIV MLFGIGNVPG VVVTIIFALP PIIRLTILGI NQVPADLIEA
SRSFGASPRQ MLFKVQLPLA MPTIMAGVNQ TLMLALSMVV IASMIAVGGL GQMVLRGIGR
LDMGLATVGG VGIVILAIIL DRLTQAVGRD SRSRGNRRWY TTGPVGLLTR PFIK