Gene EcSMS35_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1734 
SymbolydcS 
ID6144278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1738437 
End bp1739582 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID641616610 
ProductABC transporter, periplasmic substrate-binding protein 
Protein accessionYP_001743788 
Protein GI170681458 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000000404923 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGA TTTTTGCCCG CAGCAGCCTG TGTGCGCTCA CCATGACAAT AATGACCGCT 
CACGCCGCCG AACCGCCTAC CAATTTAGAT AAACCGGAAG GGCGACTGGA TATTATCGCC
TGGCCGGGAT ACATCGAACG CGGACAAACT GATAATCAAT ACGACTGGGT AACGCAATTC
GAAAAAGAGA CAGGCTGCGC GGTGAATGTG AAAACCGCCG CGACTTCCGA TGAAATGGTC
AGTCTGATGA CCAAAGGGGG TTACGATCTG GTTACGGCAT CCGGCGATGC CTCGCTGCGC
CTGATTATGG GCAAACGCGT GCAGCCGATT AATACCGCAT TGATTCCCAA CTGGAAAGCG
CTCGATCCGC GCGTGGTTAA AGGCGACTGG TTTAACGTTG GCGGCAAAGT TTACGGCACA
CCTTACCAAT GGGGGCCGAA CCTGCTGATG TACAACACTA AAACCTTCCC GACGCCGCCG
AATAGCTGGC AAGTGGTTTT TGTTGAGCAG AATCTGCCGG ACGGCAAGAG CAATAAAGGC
CGCGTTCAGG CTTATGATGG CCCTATCTAC ATTGCGGACG CTGCGTTGTT CGTTAAAGCC
ACTCAGCCGC AGTTGGGCAT CAGCGATCCG TATCAACTCA CCGAAGAACA GTACCAGGCG
GTGCTGAAAG TGCTGCGCGA TCAACATAGT TTGATCCATC GCTACTGGCA TGACACTACC
GTGCAAATGA GCGATTTCAA AAACGAGGGT GTAGTTGCTT CCAGTGCATG GCCCTATCAG
GCCAACGCCC TGAAAGCCGA AGGCCAGCCT GTCGCTACCG TTTTCCCGAA AGAGGGCGTT
ACCGGTTGGG CTGACACCAC CATGCTACAT AGCGAAGCGA AACATCCGGT TTGCGCCTAC
AAATGGATGA ACTGGTCATT AACCCCAAAA GTGCAGGGCG ATGTGGCGGC CTGGTTTGGC
TCGCTACCAG TAGTGCCGCA AGGGTGTAAA GCCAGTCCGT TATTAGGCGA GAAAGGTTGT
GAAACAAACG GTTTTAACTA TTTCGATAAA ATCGCCTTCT GGAAAACGCC TATAGCAGAA
GGGGGCAAGT TTGTTCCCTA CAGTCGCTGG ACGCAGGATT ACATTGCCAT TATGGGTGGT
CGCTAA
 
Protein sequence
MSKIFARSSL CALTMTIMTA HAAEPPTNLD KPEGRLDIIA WPGYIERGQT DNQYDWVTQF 
EKETGCAVNV KTAATSDEMV SLMTKGGYDL VTASGDASLR LIMGKRVQPI NTALIPNWKA
LDPRVVKGDW FNVGGKVYGT PYQWGPNLLM YNTKTFPTPP NSWQVVFVEQ NLPDGKSNKG
RVQAYDGPIY IADAALFVKA TQPQLGISDP YQLTEEQYQA VLKVLRDQHS LIHRYWHDTT
VQMSDFKNEG VVASSAWPYQ ANALKAEGQP VATVFPKEGV TGWADTTMLH SEAKHPVCAY
KWMNWSLTPK VQGDVAAWFG SLPVVPQGCK ASPLLGEKGC ETNGFNYFDK IAFWKTPIAE
GGKFVPYSRW TQDYIAIMGG R