Gene EcSMS35_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2003 
SymbolpotD 
ID6144835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2023681 
End bp2024727 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content51% 
IMG OID641616879 
Productspermidine/putrescine ABC transporter periplasmic substrate-binding protein 
Protein accessionYP_001744055 
Protein GI170680447 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.59191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT GGTCACGCCA CCTGCTCGCG GCGGGTGCTC TGGCACTGGG CATGAGCGCC 
GCTCACGCCG ATGACAACAA CACGCTGTAT TTCTACAACT GGACCGAGTA CGTGCCGCCA
GGACTGCTTG AACAGTTCAC CAAAGAAACC GGTATTAAGG TTATCTATTC GACTTACGAG
TCGAACGAAA CCATGTACGC GAAGCTGAAA ACGTACAAAG ACGGTGCCTA TGACCTGGTG
GTTCCTTCAA CCTATTACGT CGATAAAATG CGTAAAGAAG GGATGATCCA GAAGATCGAC
AAGTCGAAGT TAAGCAATTT CAGCAACCTC GATCCAGACA TGCTCAACAA GCCGTTTGAC
CCGAATAACG ACTACTCCAT TCCGTATATC TGGGGTGCGA CGGCGATTGG CGTTAACGGT
GATGCGGTGG ATCCGAAATC TGTCACCAGC TGGGCCGATC TGTGGAAACC TGAGTACAAA
GGCAGCCTGC TGCTGACCGA CGATGCCCGT GAAGTGTTCC AGATGGCGCT GCGCAAACTG
GGCTACTCCG GTAACACTAC CGATCCGAAA GAGATTGAAG CTGCATATAA CGAGCTGAAA
AAACTGATGC CAAACGTCGC AGCGTTTAAC TCCGATAACC CGGCTAACCC GTACATGGAA
GGCGAAGTTA ACCTCGGCAT GATCTGGAAC GGTTCTGCTT TCGTTGCACG GCAGGCGGGT
ACGCCAATTG ACGTGGTGTG GCCGAAAGAA GGCGGCATTT TCTGGATGGA CAGCCTGGCG
ATCCCGGCAA ATGCCAAAAA CAAAGAAGGT GCGCTGAAAT TGATCAACTT CCTGCTGCGC
CCCGATGTGG CAAAACAGGT TGCTGAAACT ATCGGTTATC CAACGCCAAA CCTTGCGGCG
CGTAAGCTGT TAAGTCCTGA AGTGGCGAAC GACAAAACGC TCTACCCGGA TGCTGAAACC
ATTAAAAATG GTGAATGGCA GAATGACGTT GGCGCAGCCA GCAGCATTTA TGAAGAGTAT
TATCAGAAGC TGAAAGCAGG ACGTTAA
 
Protein sequence
MKKWSRHLLA AGALALGMSA AHADDNNTLY FYNWTEYVPP GLLEQFTKET GIKVIYSTYE 
SNETMYAKLK TYKDGAYDLV VPSTYYVDKM RKEGMIQKID KSKLSNFSNL DPDMLNKPFD
PNNDYSIPYI WGATAIGVNG DAVDPKSVTS WADLWKPEYK GSLLLTDDAR EVFQMALRKL
GYSGNTTDPK EIEAAYNELK KLMPNVAAFN SDNPANPYME GEVNLGMIWN GSAFVARQAG
TPIDVVWPKE GGIFWMDSLA IPANAKNKEG ALKLINFLLR PDVAKQVAET IGYPTPNLAA
RKLLSPEVAN DKTLYPDAET IKNGEWQNDV GAASSIYEEY YQKLKAGR