Gene EcSMS35_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0991 
Symbol 
ID6145469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1005923 
End bp1007275 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content56% 
IMG OID641615878 
Productputative chaperone 
Protein accessionYP_001743070 
Protein GI170681799 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.232925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTATTG GTTTTGATTA CGGTACAGCA AACTGTTCAG TGGCGGTCAT GCGCGACGGT 
AAGCCGCACT TGCTAAAAAT GGAAAACGAC AGCACGCTGC TGCCTTCAAT GCTTTGCGCG
CCAACGCGTG AAGCGGTAAG CGAATGGCTG TACCGCCATC ACGACGTTCC GGCAGACGAC
GATGAAACGC AGGCGCTGTT ACGTCGGGCG ATTCGTTATA ACCGCGAAGA AGATATCGAT
GTTACGGCGA AAAGCGTGCA GTTCGGTCTT TCCTCACTGG CACAGTACAT TGATGATCCA
GAAGAGGTGT GGTTTGTGAA ATCACCAAAA TCGTTCCTCG GTGCCAGCGG CTTAAAACCG
CAGCAGGTAG CGCTGTTTGA GGATCTGGTC TGCGCAATGA TGCTGCACAT TCGCCAACAG
GCGCAGGCGC AATTGCCAGA AGCGATTACC CAGGCGGTGA TTGGTCGCCC GATCAACTTC
CAGGGGCTGG GCGGCGATGA AGCAAACGCC CAGGCACAAG GGATTCTGGA GCGCGCAGCG
AAGCGTGCCG GGTTTAAGGA CGTGGTATTC CAGTACGAGC CGGTGGCGGC TGGGCTGGAT
TACGAAGCCA CCTTGCAGGA AGAAAAACGG GTGCTGGTGG TGGATATTGG TGGTGGTACG
ACTGACTGTT CATTGCTGCT GATGGGGCCG CAGTGGCGTT CGCGTCTCGA TCGTGAAGCC
AGCCTGCTGG GTCACAGTGG TTGCCGTATT GGCGGTAACG ATCTGGATAT CGCACTGGCG
TTTAAAAACC TGATGCCGTT ACTGGGCATG GGTGGCGAAA CCGAAAAAGG CATCGCCCTG
CCGATTCTGC CGTGGTGGAA TGCGGTTGCC ATCAACGACG TCCCTGCGCA AAGTGATTTC
TACAGTAGTG CCAACGGTCG TCTGCTTAAC GATCTGGTAC GCGATGCCCG CGAACCGGAA
AAAGTGGCCC TGTTACAGAA AGTCTGGCGT CAGCGTTTAA GCTATCGCCT GGTACGTAGC
GCAGAAGAGA GCAAAATTGC GCTTTCAAGC GTAGCGGAAA CCCGCGCCTC ACTGCCGTTT
ATCAGCGATG AACTGGCAAC GCTGATTAGC CAGCAAGGGC TGGAAAACGC CCTCAGTCAG
CCGCTGGCGC GAATTCTGGA ACAGGTGCAA CTGGCGCTGG ATAACGCCCA GGAAAAACCG
GATGTCATCT ATCTGACTGG TGGTAGCGCC CGATCGCCGC TGATTAAAAA AGCGCTGGCA
GAACAGTTGC CGGGCATTCC GATTGCAGGC GGCGATGACT TTGGCTCCGT CACCGCCGGG
CTGGCACGCT GGGCGGAAGT GGTGTTTCGT TAA
 
Protein sequence
MFIGFDYGTA NCSVAVMRDG KPHLLKMEND STLLPSMLCA PTREAVSEWL YRHHDVPADD 
DETQALLRRA IRYNREEDID VTAKSVQFGL SSLAQYIDDP EEVWFVKSPK SFLGASGLKP
QQVALFEDLV CAMMLHIRQQ AQAQLPEAIT QAVIGRPINF QGLGGDEANA QAQGILERAA
KRAGFKDVVF QYEPVAAGLD YEATLQEEKR VLVVDIGGGT TDCSLLLMGP QWRSRLDREA
SLLGHSGCRI GGNDLDIALA FKNLMPLLGM GGETEKGIAL PILPWWNAVA INDVPAQSDF
YSSANGRLLN DLVRDAREPE KVALLQKVWR QRLSYRLVRS AEESKIALSS VAETRASLPF
ISDELATLIS QQGLENALSQ PLARILEQVQ LALDNAQEKP DVIYLTGGSA RSPLIKKALA
EQLPGIPIAG GDDFGSVTAG LARWAEVVFR