Gene EcHS_A1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1572 
Symbol 
ID5592255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1577743 
End bp1579293 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content50% 
IMG OID640920725 
Productputative ABC transporter periplasmic-binding protein yddS precursor 
Protein accessionYP_001458281 
Protein GI157160963 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAAATTTC 
CCGGTTGCGC ACGCCGCCGT ACCAAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA
CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT
TATCAGCGGC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC
GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA
GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTTTCTTTT
GAGCGGCTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT
GATGCTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AGCCATTCGC ACCGTTCCTC
TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTGTTAAA GGAACATGCA
GCGGATGATG CCCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTG
AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCTGGCAAT
AAACCGAACT TTAAGCGAGT ATCGGTAAAA ATTATTGGTG AAAGTGCCTC CCGTCGCCTG
CAGCTCTCCC GTGGTGATAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC
CTGAAGCAGG AAAACAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTACCTG
TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG
TCTACCGATT ACCAGGGAAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAGATGCGC
GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC
GAAACGAAAG CCAAAGCTGA ATGGGATAAA GTGACGAACA AACCCACCAG CCTGACGTTT
CTCTATTCTG ATAATGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC
AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA
GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG
TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG
TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG
ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT
ATGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGGGGTGAA AGGCTTTGTG
TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
 
Protein sequence
MKRSISFRPT LLALVLATNF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS 
YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK DNAKFADGTP VTAEAVKLSF
ERLLKIGQGP AEAFPKDLKI DAPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA
ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL
QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW
STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTNKPTSLTF
LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP
YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY
MYLFQKNYQL AMNKGVKGFV FNPMLEQVFN INTMSK