Gene EcSMS35_4686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4686 
Symbol 
ID6147003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4784536 
End bp4785693 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content47% 
IMG OID641619502 
Producthypothetical protein 
Protein accessionYP_001746610 
Protein GI170683978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0147095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAGC GCAGTTGGTT ATTAATCGCC GCATTACCTC CCTCTATTTC ACCTTCCTGG 
GGCGCGGATT TTTATTACCG CCAGCAGGAG AAAGGCACGG TTTATGTTGT CGAACAGAAG
GGGGAAAAAG ATGAGATCCT CTCCGAATTA CCAGATATTA ATTTTTCCCG CCTTTGGCGT
ATTGCCAATT TAGCCAATAA ACAAGATTCC CGGTTACTGT CCGATTTTAA TCCCGATAAG
TTCGATTGCG ATGATGAGGG GGATTGCGAA CATGCCTGGC TCACCGATGG ACGCTCTGTT
CTTTGGTCTG GCAAAGTCCT GAAAAATCCC CCCGGTAAAC CTATAGTCGA CGCTGCCAGT
TTTCAGGCAT TCGGCGCTTT CGCTGCTGAT AAACGCAGTA TCTATTTTGA TGGTCAGCGT
ACCGATGATA ATAGCGGTGA TAAGCAGGTG GATATGTCAA CGCTTGAAGA GACGGACATC
TGGAATTTAC TGCGTGATAA AAATAGTCTT TGGCATAAGG GGCACTGGTT GGGAAGCGCT
GACGGATTTC AAATCCTGCG GCATGATTCC TCCCTGCAAT TTGTTGTGCA GACAAATTCG
CAGGTGATTG TTAATGGCAA GCCACTGCCC GCCGATCGCA AAACTTTTCA GATTAAACGT
TGGATGCCTG GCGAACGCTT AGTTTATCGC GATAAAAGCG GCGAGCGTGA CTATGAGCTG
GAGGATACCA GCTATCGCTG TGCACCTTTT AATATTGGTC TGAATAACGT GTCCTGGCTC
AAATATGAAG CCACTCCAGC GGGCAGTGAG TGTATTTATG AAACGCTGGC GGGAGTTGAT
CCGGAATATT TTTATCTGTT TGTTCGGAAT ACCGGTTTAT ATAAGAACCA AATATATAAA
GTCACAATTA ACGCCCTGGG CGAAGGTGAG TTGGTTAATC TCAAGCCAGA GGATCTCTCC
GACTCACTTG AAGCAGGGGG TAGTTGGGGA TTAACTAACA CGTTTATATC AACAGACGGG
CAGCTTTACA CTCAACAAGC GACTGGAATT GGGAAAGAAC ACGCCCAACA AGGTGAATGG
CTGCGTTATA ACTTAGGCAA GGGAGGTTGG TTATCGGTGA AGCAACCCCC GAGCGGGCTT
AAACCCTTAT TTAAATAA
 
Protein sequence
MVKRSWLLIA ALPPSISPSW GADFYYRQQE KGTVYVVEQK GEKDEILSEL PDINFSRLWR 
IANLANKQDS RLLSDFNPDK FDCDDEGDCE HAWLTDGRSV LWSGKVLKNP PGKPIVDAAS
FQAFGAFAAD KRSIYFDGQR TDDNSGDKQV DMSTLEETDI WNLLRDKNSL WHKGHWLGSA
DGFQILRHDS SLQFVVQTNS QVIVNGKPLP ADRKTFQIKR WMPGERLVYR DKSGERDYEL
EDTSYRCAPF NIGLNNVSWL KYEATPAGSE CIYETLAGVD PEYFYLFVRN TGLYKNQIYK
VTINALGEGE LVNLKPEDLS DSLEAGGSWG LTNTFISTDG QLYTQQATGI GKEHAQQGEW
LRYNLGKGGW LSVKQPPSGL KPLFK