Gene EcSMS35_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1703 
Symbol 
ID6143842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1707032 
End bp1708291 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content35% 
IMG OID641616579 
Producthypothetical protein 
Protein accessionYP_001743757 
Protein GI170680907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0566348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA TTAATACAGC TTGTGTAAAA AATAATGCCA GTTATCAATT AAATAACGCA 
TTACCTAACA AAGAAACCAT CTCCAGCAAT TTTTGTGAAC GACTGGCACA ATGGGGTAAT
AAGTCGCTTA ATAATGGTGA AGAAAGAGCA ATTGCCGTGG AGCGAATTAA AGAAGCTTAC
AATTCGAACA TGGCATCTCT TGATTTGTCA TATCTTGATT TGAGTGAGTT ACCCCCTATC
CCCTCGACAG TTAACACGTT AAATCTGGAA AACAATTGTC TCACTTGTCT TGACTTTACT
GACAATGCCA GCCTCGTCAA TATCAACCTC AGCTTTAATA AAATTAAAAC GATAACCTTT
CCAAATCAAT CAAAACTGGA AAATATTTAT ATTGATCACA ATAATCTGGA AAATTTGGAT
TTAAAAAATC AGCTTTCATT GGTTAACCTG GAAGCGCAAA ATAACAACCT GACAAAAATT
AATATTTCTG ATAGTTATAA ACTGAAATTT CTTAATCTTG ATTATAACAA ACTAGCATCG
CTGGATCTCT CCCGGCAAGA ATCTCTGATT GAGTTAAGTG CCCATCACAA CATGATCAAT
GACCTTATAT TACACAATCA CCCCATAGTG GAGAAAATCA CTTTAAACGA CAACCATATT
GCACATTTAA ACGCGAAAAC CACTACAAAA CTGGAATATT TAAACTTAAG CAATAACAAT
TTATTGCCAA CAGATGACAT TGATCAATTA ATATCATCAA AACATCTTTG GCATGTATTA
GTTAACGGCA TCAACAATGA TCCACTTGCC CAAATGCAGT ACTGGACTGC AGTAAGAAAT
ATAATTGATG ACACTAATGA AGTGACCATT GATTTATCTT ATAACCTGGC AATCACAAAT
ATCGATACCA GCGATGAACA TCTTGTAGAA GTAAGCGAGA ATTCCGAAGG AAATCATATA
AAAGAAAATG ACTCAATGTC TATTCGTTAT AGATCAAAAT ATTATTCCAG AGAGTACGCC
TTAATAGAAG AAGAAACAAT ATTTTCTGAC GCAGAACTAA AAGCTATTCT GCCTATGCGT
CGCATGTACG GGGTTGGTGA CTATAAGTCA AATTCCTCTT CTCTACCCTC ACACTCGGGG
CTAAAGGACC CAACGGGCAC ACCCGTCTGT TATTATATTC ATAATGAGGA TAAACCTTCC
TTAGGTTTTG GTCCAACATC CAATAATTGG TTAAGCCAAT CCTTTACAAC AGAGTTATAA
 
Protein sequence
MTNINTACVK NNASYQLNNA LPNKETISSN FCERLAQWGN KSLNNGEERA IAVERIKEAY 
NSNMASLDLS YLDLSELPPI PSTVNTLNLE NNCLTCLDFT DNASLVNINL SFNKIKTITF
PNQSKLENIY IDHNNLENLD LKNQLSLVNL EAQNNNLTKI NISDSYKLKF LNLDYNKLAS
LDLSRQESLI ELSAHHNMIN DLILHNHPIV EKITLNDNHI AHLNAKTTTK LEYLNLSNNN
LLPTDDIDQL ISSKHLWHVL VNGINNDPLA QMQYWTAVRN IIDDTNEVTI DLSYNLAITN
IDTSDEHLVE VSENSEGNHI KENDSMSIRY RSKYYSREYA LIEEETIFSD AELKAILPMR
RMYGVGDYKS NSSSLPSHSG LKDPTGTPVC YYIHNEDKPS LGFGPTSNNW LSQSFTTEL