Gene EcSMS35_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4003 
Symbol 
ID6146293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4082773 
End bp4085493 
Gene Length2721 bp 
Protein Length906 aa 
Translation table11 
GC content30% 
IMG OID641618828 
Producthypothetical protein 
Protein accessionYP_001745966 
Protein GI170679594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTGC TAAACAAACA TATAGCTAGT ATTTTAAGAG CAAAAGATGA GGAGTCTTTA 
GCTATTTTTG TAGGGGCTGG TGTTTCAAAA TCCTCTGAAA CAAAAACTAT CAAAATGCCT
TCATGGGGGG ACTTAATTGA CACTCTCATT TCAGATCTCA ATATCAAAGA TGAATCAGAT
TATTTAAAAA TAGCACAGCT TTATTACCTC ACATTTGGTG AGCATCTTTA CTACAAAAGA
ATCAAAGATT TCTTTCCTGA AAATGTTCCC CATTCAAAGA TTCACGATCT AATATTTAAA
CTGAATCCTC ACTCTGTTAT CACAACGAAT TGGGACACTT TGCTTGAAGC AGCTATAAAT
GCCAAATCTT ACTTTTATAA TATAATAAGC AGTGACAAAG ATTTGATGAA ATCTTACCTT
GGCAAAAAAT TGATAAAAAT GCATGGTGAT TTTAAAAATC ACAACATCGT CTTTAAAGAA
GATGATTACT TAAATTATAG CTTCAATTTT CCCTTAATTG AAAATTATGT TAAAAGTGTT
ATTTCCACCC ATACTGTGCT ATTTTTGGGG TATTCATATA ATGATATAGA TCTAAAGCAA
ATAATTAAAT GGACTCAGAA TCATTCATCT GTTAGACCCC CTATGTATTT AGTAGTATTC
GAAGATATTC CTGCTCAAAG AAAATATCTT GAAAGTCATG GTATAACCAC TATTATTTTA
ACCGATGAAA CAATCAAACC TTTCAATAAC AATTCATATT CAAACAAGTT ATATACTTTC
CTGCATAGCC TAAATAGCCT AGAACTATGT GCAGACTTAG ATGATATCGA AATAATTAAT
GCTATTTATT CAAGAGTGAG ATCTCTTCAA TCGTTAAATG CAATACTTGC TGAACAAGTT
ACCAGATGTT TCACTAATTG CGGTTTAATG TATATTGATG ACAATGGACC AAAAGCTTTA
TTGCAATTTT ATGACACCGA AGTAACCTCT AATGATAATA ATATAGAGCT AAGAGGATTT
TATAAGAAGT TTTTGAGTAT ACTCAGTAAT GATAAAAAAG TTAAAGACTA TAAGCCGCAT
CTCCAAAAAT TATTTTTTAT ATTAAAAAAA GCAAGTATAT ATGGTATTGT TTTAAACGAT
AAACATGATG AAGTGCTACT GATTACTGAA GTGCTACCAA ATGATTCATT AATTAAAATA
GATAAGGAAA TTAATTTCAA CTATAACGAA ATAGTTACCT ATACACGCCC CCATAAGGTT
GCAAATAGTA TAGACAAGAC TAAGAAATAT AATTGCTTCC AGTTAAATAA ATACGATGAG
GCTTACAGTA TTATAGAGGA GGAATTATCA GAAGAGATCA GACAAAAGGA CTATGCAAAT
ATTCTCATAT CGCTTTTCAA TCAAAATATT ATTTTGAATA GATTGAAGTA TGATTTTTCA
ATTAATAGAG AAAGATACTC AACTCTTGAG GAGAATAAAA TTCATGAATT ATATGATAAC
TTACCGAGAA ATATAAAAAA AACCGTCTCC GTAATCTATG ATTTAGTAAC CTTTAATTAT
TTATTAAATC TTCACTACAC CGTTAGCTCA TTATTAACAA AGTATAGCGA TATAAGAAAA
AGAAATACAA AGTTATTAGT TGACGGGGAT CTTCATAAAA CGGAATTTCT TTTCGAAAAC
CTTATAATTT TTGTTGTAAA AAATGGATGT TTAATTGATG TTTATAAAGA ATTCAAGGAT
GTGATTCGCA AATTCATTGA AATAAAAATA ATCAAGGATT CTGATAAAGA CGAGATTTCT
CTAACAAGAT TAGAATTATA TTCATGCATT AAATACATTG ATAATAAAAC CCTATCTCTT
ATACTAAGGA AAGAAGATAA AAAACTATTA TCGCTGTCTG TTCAACCCAA AGAATTAGAT
TGGTTAATAA ATACTGTACT GCAAAACCTA GCGAAGTCAT ATAGCAAGTT CGCTACGTTT
CTCAACCCTA TAGAAGGAAA GTTAATCAAC GCATTAAAAC TACTATCTTT AATGAAAATA
ACCACAGAGC AAGACGCGGT AGTATTAAAA ACATTAAACG ACACTTTAAA GTCCTCATAC
CACAACCTAG CATTCTATGA TGCTATTTCC GAATATGTCG TTATAAGGTA CAACACCCAC
AGTGAAACTT TATCCACTGA CAGCATAAAA ACTTTAATAT ACACGATTCT CGATAAGTTA
ATAAGTAGAA ACCTTGGTAG GTATGAGGTG ATAGCTATAG TCAATAGAGG CCTTGCTAAT
ATTTTTTCAG TAGCCAAAAA ACTGGGCGTT AATATAGAAG ACGACAGCAA GGTCGATAAA
TTACTTCATG AAATTAGTTC TTATCCGAAT ACAGATAAAG CAAGAGCTGC TGAAACAATA
CTATATGATC TGTACAGAAT ATCAACAGAA AAGAATAGAG ATAAAATAAA ATCATTTATC
AAAAATATTT CCACAACTGA TTTCAATGAA GAAAGAAAGA TTAAATTTGA ATTATTTTTA
TTAGCATCAG AGATTTCAGA CAGCTATGAC AATCTTCCTG AGAAAGTATC CAAGCTTGTT
GAAAACTATA AAGGATTTAG ATTTAACTCA GAAGCAGAGA CAATCAGAGG TTTATTACGC
TATATAGTCA ACACACGGAA ATTAAGTGAT TTCTCACAAG CACTTTTGAA GATTGAAGAG
ATAATAAATA ATTACAAATA A
 
Protein sequence
MDLLNKHIAS ILRAKDEESL AIFVGAGVSK SSETKTIKMP SWGDLIDTLI SDLNIKDESD 
YLKIAQLYYL TFGEHLYYKR IKDFFPENVP HSKIHDLIFK LNPHSVITTN WDTLLEAAIN
AKSYFYNIIS SDKDLMKSYL GKKLIKMHGD FKNHNIVFKE DDYLNYSFNF PLIENYVKSV
ISTHTVLFLG YSYNDIDLKQ IIKWTQNHSS VRPPMYLVVF EDIPAQRKYL ESHGITTIIL
TDETIKPFNN NSYSNKLYTF LHSLNSLELC ADLDDIEIIN AIYSRVRSLQ SLNAILAEQV
TRCFTNCGLM YIDDNGPKAL LQFYDTEVTS NDNNIELRGF YKKFLSILSN DKKVKDYKPH
LQKLFFILKK ASIYGIVLND KHDEVLLITE VLPNDSLIKI DKEINFNYNE IVTYTRPHKV
ANSIDKTKKY NCFQLNKYDE AYSIIEEELS EEIRQKDYAN ILISLFNQNI ILNRLKYDFS
INRERYSTLE ENKIHELYDN LPRNIKKTVS VIYDLVTFNY LLNLHYTVSS LLTKYSDIRK
RNTKLLVDGD LHKTEFLFEN LIIFVVKNGC LIDVYKEFKD VIRKFIEIKI IKDSDKDEIS
LTRLELYSCI KYIDNKTLSL ILRKEDKKLL SLSVQPKELD WLINTVLQNL AKSYSKFATF
LNPIEGKLIN ALKLLSLMKI TTEQDAVVLK TLNDTLKSSY HNLAFYDAIS EYVVIRYNTH
SETLSTDSIK TLIYTILDKL ISRNLGRYEV IAIVNRGLAN IFSVAKKLGV NIEDDSKVDK
LLHEISSYPN TDKARAAETI LYDLYRISTE KNRDKIKSFI KNISTTDFNE ERKIKFELFL
LASEISDSYD NLPEKVSKLV ENYKGFRFNS EAETIRGLLR YIVNTRKLSD FSQALLKIEE
IINNYK