Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4003 |
Symbol | |
ID | 6146293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4082773 |
End bp | 4085493 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641618828 |
Product | hypothetical protein |
Protein accession | YP_001745966 |
Protein GI | 170679594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTGC TAAACAAACA TATAGCTAGT ATTTTAAGAG CAAAAGATGA GGAGTCTTTA GCTATTTTTG TAGGGGCTGG TGTTTCAAAA TCCTCTGAAA CAAAAACTAT CAAAATGCCT TCATGGGGGG ACTTAATTGA CACTCTCATT TCAGATCTCA ATATCAAAGA TGAATCAGAT TATTTAAAAA TAGCACAGCT TTATTACCTC ACATTTGGTG AGCATCTTTA CTACAAAAGA ATCAAAGATT TCTTTCCTGA AAATGTTCCC CATTCAAAGA TTCACGATCT AATATTTAAA CTGAATCCTC ACTCTGTTAT CACAACGAAT TGGGACACTT TGCTTGAAGC AGCTATAAAT GCCAAATCTT ACTTTTATAA TATAATAAGC AGTGACAAAG ATTTGATGAA ATCTTACCTT GGCAAAAAAT TGATAAAAAT GCATGGTGAT TTTAAAAATC ACAACATCGT CTTTAAAGAA GATGATTACT TAAATTATAG CTTCAATTTT CCCTTAATTG AAAATTATGT TAAAAGTGTT ATTTCCACCC ATACTGTGCT ATTTTTGGGG TATTCATATA ATGATATAGA TCTAAAGCAA ATAATTAAAT GGACTCAGAA TCATTCATCT GTTAGACCCC CTATGTATTT AGTAGTATTC GAAGATATTC CTGCTCAAAG AAAATATCTT GAAAGTCATG GTATAACCAC TATTATTTTA ACCGATGAAA CAATCAAACC TTTCAATAAC AATTCATATT CAAACAAGTT ATATACTTTC CTGCATAGCC TAAATAGCCT AGAACTATGT GCAGACTTAG ATGATATCGA AATAATTAAT GCTATTTATT CAAGAGTGAG ATCTCTTCAA TCGTTAAATG CAATACTTGC TGAACAAGTT ACCAGATGTT TCACTAATTG CGGTTTAATG TATATTGATG ACAATGGACC AAAAGCTTTA TTGCAATTTT ATGACACCGA AGTAACCTCT AATGATAATA ATATAGAGCT AAGAGGATTT TATAAGAAGT TTTTGAGTAT ACTCAGTAAT GATAAAAAAG TTAAAGACTA TAAGCCGCAT CTCCAAAAAT TATTTTTTAT ATTAAAAAAA GCAAGTATAT ATGGTATTGT TTTAAACGAT AAACATGATG AAGTGCTACT GATTACTGAA GTGCTACCAA ATGATTCATT AATTAAAATA GATAAGGAAA TTAATTTCAA CTATAACGAA ATAGTTACCT ATACACGCCC CCATAAGGTT GCAAATAGTA TAGACAAGAC TAAGAAATAT AATTGCTTCC AGTTAAATAA ATACGATGAG GCTTACAGTA TTATAGAGGA GGAATTATCA GAAGAGATCA GACAAAAGGA CTATGCAAAT ATTCTCATAT CGCTTTTCAA TCAAAATATT ATTTTGAATA GATTGAAGTA TGATTTTTCA ATTAATAGAG AAAGATACTC AACTCTTGAG GAGAATAAAA TTCATGAATT ATATGATAAC TTACCGAGAA ATATAAAAAA AACCGTCTCC GTAATCTATG ATTTAGTAAC CTTTAATTAT TTATTAAATC TTCACTACAC CGTTAGCTCA TTATTAACAA AGTATAGCGA TATAAGAAAA AGAAATACAA AGTTATTAGT TGACGGGGAT CTTCATAAAA CGGAATTTCT TTTCGAAAAC CTTATAATTT TTGTTGTAAA AAATGGATGT TTAATTGATG TTTATAAAGA ATTCAAGGAT GTGATTCGCA AATTCATTGA AATAAAAATA ATCAAGGATT CTGATAAAGA CGAGATTTCT CTAACAAGAT TAGAATTATA TTCATGCATT AAATACATTG ATAATAAAAC CCTATCTCTT ATACTAAGGA AAGAAGATAA AAAACTATTA TCGCTGTCTG TTCAACCCAA AGAATTAGAT TGGTTAATAA ATACTGTACT GCAAAACCTA GCGAAGTCAT ATAGCAAGTT CGCTACGTTT CTCAACCCTA TAGAAGGAAA GTTAATCAAC GCATTAAAAC TACTATCTTT AATGAAAATA ACCACAGAGC AAGACGCGGT AGTATTAAAA ACATTAAACG ACACTTTAAA GTCCTCATAC CACAACCTAG CATTCTATGA TGCTATTTCC GAATATGTCG TTATAAGGTA CAACACCCAC AGTGAAACTT TATCCACTGA CAGCATAAAA ACTTTAATAT ACACGATTCT CGATAAGTTA ATAAGTAGAA ACCTTGGTAG GTATGAGGTG ATAGCTATAG TCAATAGAGG CCTTGCTAAT ATTTTTTCAG TAGCCAAAAA ACTGGGCGTT AATATAGAAG ACGACAGCAA GGTCGATAAA TTACTTCATG AAATTAGTTC TTATCCGAAT ACAGATAAAG CAAGAGCTGC TGAAACAATA CTATATGATC TGTACAGAAT ATCAACAGAA AAGAATAGAG ATAAAATAAA ATCATTTATC AAAAATATTT CCACAACTGA TTTCAATGAA GAAAGAAAGA TTAAATTTGA ATTATTTTTA TTAGCATCAG AGATTTCAGA CAGCTATGAC AATCTTCCTG AGAAAGTATC CAAGCTTGTT GAAAACTATA AAGGATTTAG ATTTAACTCA GAAGCAGAGA CAATCAGAGG TTTATTACGC TATATAGTCA ACACACGGAA ATTAAGTGAT TTCTCACAAG CACTTTTGAA GATTGAAGAG ATAATAAATA ATTACAAATA A
|
Protein sequence | MDLLNKHIAS ILRAKDEESL AIFVGAGVSK SSETKTIKMP SWGDLIDTLI SDLNIKDESD YLKIAQLYYL TFGEHLYYKR IKDFFPENVP HSKIHDLIFK LNPHSVITTN WDTLLEAAIN AKSYFYNIIS SDKDLMKSYL GKKLIKMHGD FKNHNIVFKE DDYLNYSFNF PLIENYVKSV ISTHTVLFLG YSYNDIDLKQ IIKWTQNHSS VRPPMYLVVF EDIPAQRKYL ESHGITTIIL TDETIKPFNN NSYSNKLYTF LHSLNSLELC ADLDDIEIIN AIYSRVRSLQ SLNAILAEQV TRCFTNCGLM YIDDNGPKAL LQFYDTEVTS NDNNIELRGF YKKFLSILSN DKKVKDYKPH LQKLFFILKK ASIYGIVLND KHDEVLLITE VLPNDSLIKI DKEINFNYNE IVTYTRPHKV ANSIDKTKKY NCFQLNKYDE AYSIIEEELS EEIRQKDYAN ILISLFNQNI ILNRLKYDFS INRERYSTLE ENKIHELYDN LPRNIKKTVS VIYDLVTFNY LLNLHYTVSS LLTKYSDIRK RNTKLLVDGD LHKTEFLFEN LIIFVVKNGC LIDVYKEFKD VIRKFIEIKI IKDSDKDEIS LTRLELYSCI KYIDNKTLSL ILRKEDKKLL SLSVQPKELD WLINTVLQNL AKSYSKFATF LNPIEGKLIN ALKLLSLMKI TTEQDAVVLK TLNDTLKSSY HNLAFYDAIS EYVVIRYNTH SETLSTDSIK TLIYTILDKL ISRNLGRYEV IAIVNRGLAN IFSVAKKLGV NIEDDSKVDK LLHEISSYPN TDKARAAETI LYDLYRISTE KNRDKIKSFI KNISTTDFNE ERKIKFELFL LASEISDSYD NLPEKVSKLV ENYKGFRFNS EAETIRGLLR YIVNTRKLSD FSQALLKIEE IINNYK
|
| |