Gene ECH74115_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2045 
SymbolydcS 
ID6970725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1942008 
End bp1943153 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content52% 
IMG OID643385957 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_002270446 
Protein GI209397208 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.600032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000000000249454 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGA CATTTGCCCG CAGCAGCCTG TGTGCGCTCA GCATGACAAT AATGACCGCT 
CACGCCGCCG AACCGCCTAC CAATTTAGAT AAACCGGAAG GGCGACTGGA TATTATCGCC
TGGCCGGGAT ATATCGAACG CGGACAAACT GATAAACAAT ACGACTGGGT AACGCAATTC
GAAAAAGAGA CAGGCTGCGC GGTGAATGTT AAAACCGCCG CGACTTCCGA TGAGATGGTC
AGTCTGATGA CCAAAGGGGG TTACGATCTG GTTACGGCAT CCGGCGATGC CTCGCTGCGT
TTGATTATGG GTAAACGCGT GCAGCCGATT AATACCGCCT TGATTCCCAA CTGGAAAACG
CTTGATCCGC GCGTGGTTAA AGGCGACTGG TTTAACGTTG GCGGCAAAGT TTACGGCACA
CCTTACCAAT GGGGGCCGAA CCTGCTGATG TACAACACTA AAACCTTCCC GACGCCGCCG
GATAGCTGGC AAGTGGTTTT TGTTGAGCAA AATCTGCAGG ACGGCAAGAG CAATAAAGGC
CGCGTTCAGG CTTATGATGG CCCTATCTAC ATTGCGGACG CTGCGTTGTT CGTTAAAGCC
ACTCAGCCGC AGTTGGGCAT CAGCGATCCG TATCAACTCA CCGAAGAACA GTACCAGGCG
GTGCTGAAAG TGCTGCGTGA TCAACATAGT TTGATCCATC GCTACTGGCA TGACACTACC
GTGCAAATGA GCGATTTCAA AAACGAGGGT GTGGTCGCTT CCAGTGCCTG GCCCTATCAG
ACCAACGCCC TGAAAGCCGA AGGTCAGCCT GTCGCTACCG TTTTCCCGAA GGAGGGCGTT
ACCGGTTGGG CTGACACCAC CATGCTACAT AGCGAAGCGA AACATCCGGT TTGCGCCTAC
AAATGGATGA ACTGGTCATT AACGCCAAAA GTGCAGGGCG ATGTGGCGGC CTGGTTTGGC
TCGTTACCAG TCGTGCCGGA AGGGTGTAAA GCCAGTTCGT TATTAGGCGA GAAAGGTTGT
GAAACAAACG GTTTTAACTA TTTCGATAAA ATAGCCTTCT GGAAAACGCC TATAGCAGAA
GGGGGCAAGT TTGTTCCCTA CAGTCGCTGG ACGCAGGATT ACATTGCCAT TATGGGCGGT
CGCTAA
 
Protein sequence
MSKTFARSSL CALSMTIMTA HAAEPPTNLD KPEGRLDIIA WPGYIERGQT DKQYDWVTQF 
EKETGCAVNV KTAATSDEMV SLMTKGGYDL VTASGDASLR LIMGKRVQPI NTALIPNWKT
LDPRVVKGDW FNVGGKVYGT PYQWGPNLLM YNTKTFPTPP DSWQVVFVEQ NLQDGKSNKG
RVQAYDGPIY IADAALFVKA TQPQLGISDP YQLTEEQYQA VLKVLRDQHS LIHRYWHDTT
VQMSDFKNEG VVASSAWPYQ TNALKAEGQP VATVFPKEGV TGWADTTMLH SEAKHPVCAY
KWMNWSLTPK VQGDVAAWFG SLPVVPEGCK ASSLLGEKGC ETNGFNYFDK IAFWKTPIAE
GGKFVPYSRW TQDYIAIMGG R