Gene EcSMS35_3187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3187 
Symbol 
ID6146755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3270859 
End bp3271968 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content49% 
IMG OID641618024 
Productputative immunoglobuling-binding protein 
Protein accessionYP_001745174 
Protein GI170681895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.725317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAA GACGTGATGC TTTTTTAAAG AAAAGCGCGC TGGCAGTGAG TGTGGCCCTG 
CTGCTCTCAT CTCAGGCTTC GGCTAATAAA TCAATTACTG ATTCAACGGC GGGTATTATC
TGGATTGATG GTGGCGGTCA GTCACTTGAA AAAGTAGCTG TGATTGACCG TCAACTAAAT
GATACAGGAT ATAATTTTGC CGTAGGTAGT GGTGCAGCAA TTCTGGATGC GGATAAATCC
ATGGCAGTGG GAAATAATAC AGCTGTTTTT AATGCAGACA ACAGTGTCGC TCTGGGGTAT
GGCTCTCAGG TGAATGGAGA AAGCAATGTA CTTTCTGTAG GGGCCGGTCC TTCAGGATAT
GGATTTTCAG TTGATGGTGC ACCGGAAACC CGCCGGATTA TAAATGTTTC AGATGGTGTT
AAGGATAGTG ATGCGGCCAC AAAAGGACAG ATGGACAACG CCATTGCAGA TGCTGTACGG
GAGTCGGGGG ATGCCCTGCG CGGTGAGATA GGAGCTGTCT ACCGTGATGC TGTTGCTGAT
GCTAAGAGTC GAGTGGAATC GGCAGAAAAC AGACTTAACG GTAATATTAC AGCTGCCAGG
GCTTCTGCGC AGGAATACAC GGATGCGGTG AAGTCGGATG TTCTGGACGA GACGCGTACA
TATACAGACA GCAGTGTGCG TACTGTCCGT AACGAGGTGA AAAGCCAGGC AGAACATCTC
AGCGATGTGC TTGTGAAGAA CAGGGCGCAG ACGGATGCAG CAATAGCCTC AAATACAGCA
GCGATAAGGA ATAACAGTCA TCGTCTGGAT TTGACGGAAG CCTGGCAGAA GATGGCGACA
GAGAGAATGA ATAATATGCA GGAGCAGATT AAAGAGAACC GGAAGGAGTT AAGGGAGAGT
GCAGCCCAGA GCGCGGCACT GGCAGGTCTT TTCCAGCCAT ACAGTGTTGG AAAATTTAAC
GCGACAGCAG CCGTTGGTGG TTACCGTGAT GAGCAGGCCA TTGCGGTGGG CGTGGGCTAC
CGTTTCACAG AGAATGTGGC AGGAAAAGTT GCAGTTGCTG CAGGTGGGTC ATCCGCATCG
TGGAATGCTG GTGTGAATTT TGAGTTCTGA
 
Protein sequence
MLKRRDAFLK KSALAVSVAL LLSSQASANK SITDSTAGII WIDGGGQSLE KVAVIDRQLN 
DTGYNFAVGS GAAILDADKS MAVGNNTAVF NADNSVALGY GSQVNGESNV LSVGAGPSGY
GFSVDGAPET RRIINVSDGV KDSDAATKGQ MDNAIADAVR ESGDALRGEI GAVYRDAVAD
AKSRVESAEN RLNGNITAAR ASAQEYTDAV KSDVLDETRT YTDSSVRTVR NEVKSQAEHL
SDVLVKNRAQ TDAAIASNTA AIRNNSHRLD LTEAWQKMAT ERMNNMQEQI KENRKELRES
AAQSAALAGL FQPYSVGKFN ATAAVGGYRD EQAIAVGVGY RFTENVAGKV AVAAGGSSAS
WNAGVNFEF