Gene EcSMS35_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1831 
SymbolsapD 
ID6143579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1852107 
End bp1853099 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content51% 
IMG OID641616707 
Productpeptide ABC transporter, ATP-binding protein SapD 
Protein accessionYP_001743885 
Protein GI170679617 
COG category[V] Defense mechanisms 
COG ID[COG4170] ABC-type antimicrobial peptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.000565495 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATTAC TCGATATTCG TAACCTGACC ATTGAATTTA AAACCGGTGA TGAGTGGGTT 
AAAGCCGTCG ACCGCGTAAG TATGACCTTA ACCGAAGGTG AAATCCGCGG TCTTGTTGGC
GAATCAGGTT CCGGCAAAAG TTTGATTGCG AAAGCAATTT GTGGGGTGAA TAAAGATAAC
TGGCGTGTTA CTGCTGACCG TATGCGTTTT GATGATATCG ATTTGCTGCG TCTCTCCGCA
CGCGAACGGC GCAAGCTGGT TGGTCACAAC GTGTCGATGA TTTTCCAGGA ACCGCAGTCG
TGTCTTGACC CTTCAGAACG TGTGGGTCGC CAGTTGATGC AAAACATCCC AGCCTGGACC
TATAAAGGCC GTTGGTGGCA GCGTTTTGGC TGGCGCAAAC GCCGTGCCAT TGAACTGCTG
CACCGCGTGG GGATTAAAGA TCACAAAGAT GCGATGCGCA GTTTTCCCTA TGAGTTGACC
GAAGGTGAAT GTCAGAAAGT GATGATAGCC ATTGCACTGG CGAATCAACC GCGTCTGCTG
ATTGCTGACG AACCGACAAA CTCAATGGAG CCAACAACCC AGGCGCAAAT CTTTCGCCTG
CTGACGCGTC TCAACCAAAA CAGTAATACC ACTATTTTGC TTATCAGCCA TGACTTACAA
ATGCTTAGCC AATGGGCGGA TAAAATTAAC GTGCTTTACT GCGGTCAAAC GGTGGAAACC
GCGCCAAGTA AGGAGTTGGT GACAATGCCA CATCATCCTT ATACCCAGGC GCTGATCCGC
GCGATACCAG ACTTCGGCAG CGCGATGCCG CATAAAAGTC GCCTCAATAC GCTGCCCGGC
GCTATCCCGC TGCTGGAACA GTTACCGATT GGGTGTCGTC TGGGGCCGCG TTGCCCGTAT
GCACAACGAG AATGCATTGT GACGCCACGT TTGACGGGGG CGAAAAATCA TCTCTATGCC
TGTCATTTCC CGCTGAACAT GGAGAAAGAA TGA
 
Protein sequence
MPLLDIRNLT IEFKTGDEWV KAVDRVSMTL TEGEIRGLVG ESGSGKSLIA KAICGVNKDN 
WRVTADRMRF DDIDLLRLSA RERRKLVGHN VSMIFQEPQS CLDPSERVGR QLMQNIPAWT
YKGRWWQRFG WRKRRAIELL HRVGIKDHKD AMRSFPYELT EGECQKVMIA IALANQPRLL
IADEPTNSME PTTQAQIFRL LTRLNQNSNT TILLISHDLQ MLSQWADKIN VLYCGQTVET
APSKELVTMP HHPYTQALIR AIPDFGSAMP HKSRLNTLPG AIPLLEQLPI GCRLGPRCPY
AQRECIVTPR LTGAKNHLYA CHFPLNMEKE