Gene EcHS_A1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1524 
SymbolydcS 
ID5594940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1532084 
End bp1533229 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID640920679 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001458235 
Protein GI157160917 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGA CATTTGCCCG CAGCAGCCTG TGTGCGCTCA GCATGACAAT AATGACCGCT 
CACGCCGCCG AACCGCCTAC CAATTTAGAT AAACCGGAAG GGCGACTGGA TATTATCGCC
TGGCCGGGAT ACATCGAACG CGGACAAACT GATAAACAAT ACGACTGGGT AACGCAGTTC
GAAAAAGAGA CAGGCTGCGC GGTGAATGTG AAAACCGCCG CGACTTCCGA TGAAATGGTC
AGTCTGATGA CCAAAGGGGG TTACGATCTG GTTACGGCAT CCGGCGATGC CTCGCTGCGT
TTGATTATGG GTAAACGCGT GCAGCCGATT AATACCGCAT TGATTCCCAA CTGGAAAACG
CTCGATCCGC GCGTGGTTAA AGGCGACTGG TTTAATGTTG GCGGCAAAGT TTACGGCACA
CCTTACCAAT GGGGGCCGAA CCTGCTGATG TACAACACTA AAACCTTCCC GACGCCGCCG
GATAGCTGGC AAGTGGTTTT TGTTGAGCAA AATCTGCCGG ACGGCAAGAG CAATAAAGGC
CGCGTTCAGG CTTATGATGG CCCTATCTAC ATTGCGGACG CTGCGTTGTT CGTTAAAGCC
ACTCAGCCGC AGTTGGGCAT CAGCGATCCG TATCAACTCA CCGAAGAACA GTACCAGGCG
GTGCTGAAAG TGCTGCGCGC TCAACACAGT TTGATCCATC GCTACTGGCA TGACAATACC
GTGCAAATGA GCGATTTCAA AAACGAGGGT GTGGTTGCTT CCAGTGCCTG GCCCTATCAG
GCCAACGCCC TGAAAGCCGA AGGCCAGCCT GTTGCTACCG TTTTCCCGAA GGAGGGTGTT
ACCGGTTGGG CTGATACCAC CATGCTGCAT AGCGAAGCGA AACATCCGGT TTGCGCCTAC
AAATGGATGA ACTGGTCATT AACGCCAAAA GTGCAGGGCG ATGTGGCGGC CTGGTTTGGC
TCGTTACCGG TAGTGCCGGA AGGGTGTAAA GCCAGTCCGT TATTAGGCGA AAAAGGTTGT
GAAACCAACG GTTTTAACTA TTTCGACAAA ATCGCCTTCT GGAAAACGCC TATAGCAGAA
GGGGGCAAGT TTGTTCCCTA CAGTCGCTGG ACGCAGGATT ACATTGCCAT TATGGGCGGT
CGCTAA
 
Protein sequence
MSKTFARSSL CALSMTIMTA HAAEPPTNLD KPEGRLDIIA WPGYIERGQT DKQYDWVTQF 
EKETGCAVNV KTAATSDEMV SLMTKGGYDL VTASGDASLR LIMGKRVQPI NTALIPNWKT
LDPRVVKGDW FNVGGKVYGT PYQWGPNLLM YNTKTFPTPP DSWQVVFVEQ NLPDGKSNKG
RVQAYDGPIY IADAALFVKA TQPQLGISDP YQLTEEQYQA VLKVLRAQHS LIHRYWHDNT
VQMSDFKNEG VVASSAWPYQ ANALKAEGQP VATVFPKEGV TGWADTTMLH SEAKHPVCAY
KWMNWSLTPK VQGDVAAWFG SLPVVPEGCK ASPLLGEKGC ETNGFNYFDK IAFWKTPIAE
GGKFVPYSRW TQDYIAIMGG R