Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0924 |
Symbol | |
ID | 6142862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 932748 |
End bp | 935027 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615812 |
Product | hypothetical protein |
Protein accession | YP_001743004 |
Protein GI | 170684260 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.504013 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC CGTTAATTGT CGGCATCCGG CATCATAGTC CGGCCTGCGC CCGGCTGGTG AAATCGTTAA TCGAAAGCCA GCGGCCACGA TACGTGTTGA TTGAAGGCCC GGCTGATTTT AATGACCGGG TAGACGAACT GTTGTTAGCC CACCAGCTTC CGGTAGCTAT TTACAGTTAT TGCCAGTATC AGGACGGTGC AGCCCCCGGG CGTGGTGCCT GGACGCCATT TGCTGAATTT TCGCCGGAGT GGCAGGCGCT ACAAGCCGCA CGTCGTATTC AGGCACAAAC TTACTTCATC GATTTGCCTT GCTGGGCACA AAGTGAAGAA GAGGACGATT CGCCTGATAT GCAAGAGGAA AGCCAGGCCT TGCTGCTGCG TGCCACCCGC ATGGATAACA GCGACACCCT GTGGGATCAC TTGTTCGAAG ATGAAAGCCA GCAAACTATA TTACCCTCTG CACTGGCGCA CTATTTTGCA CAACTGCGGG GCGACGCCTC CGGCGATGCA CTCAATCGTC AGCGCGAAGC CTTTATGGCC CGCTGGATTG CATGGGCGAT GCAGCAAAAT AATGGCGACG TATTAGTCGT CTGCGGAGGC TGGCACGCTC CGGCACTGGC AAAAATGTGG CGCGAATGCC CGCAAGAAAT TAACACGCCA GAATTGCTCT TACTGGATGA TGCCGTTACA GGTTGCTATC TCACGCCCTA CAGTGAAAAG CGCCTTGATG TGCTGGCAGG ATACCTTTCA GGAATGCCTG CCCCGGTCTG GCAAAACTGG TGCTGGCGGT GGGGCTTGCA GCAGGCAGGT GAACAACTGC TGAAAACGGT TCTTACCCGT TTGCGCCAGC ACCACTTGCC TGCTTCAACA GCGGATATGG CTGCCGCTCA TCTGCATGCG ATGGCACTGG CACAGTTGCG CGGTCATACA CTACCGTTAC GCACTGACTG GCTGGATGCC ATAGCAGGCT CGCTGATTAA AGAAGCCCTG AATGCGCCGT TGCCGTGGAG CTATCGCGGC GTTATTCATC CCGATACCGA TCCGATTCTG CTAACGTTGA TAGACACATT AGCGGGTGAC GGATTCGGTA AACTTGCCCC TTCTACGCCA CAACCGCCTC TGCCAAAAGA TGTCACCTGC GAACTGGAAC GTACCGCAAT CTCTCTTCCG GCGGAGCTTA CCTTAAATCG CTTTAACCCC AATGGACTAG CGCAAAGTCA GGTGTTACAT CGGCTGGCAA TACTGGAGAT CCCAGGGATT GTACGCCAGC AGGGAAGCAC ACTGACACTT GCAGGTAACG GTGAAGAACG CTGGAAATTA ACCCGCCCGC TTAGCCAACA TGCGGCATTA ATTGAGGCCG CCTGCTTTGG TGCCACACTC CAGGAAGCCG CACGCCATAA ATTAGAAGCC GATATGCTGG ACGCGGGTGG AATCGGCAGT ATCACCACAT GTCTTAGCCA GGCGGCGTTA GCGGGTCTGG CGTCCTTCAG TCAACAATTA CTGGAGCAAC TTACATTATT AATCGCCCAG GAAAATCAGT TTGCCGAAAT GGGCCAGGCG CTGGAAGTGC TATATGCCTT ATGGCGGCTG GATGAAATTA GCGGTATGCA AGGCGCGCAG ATATTACAAA CGACGTTATG CGCGGCTATC GATCGCACGC TGTGGCTGTG TGAATCTAAC GGCAGACCGG ATGAAAAGGA GTTTCACGCT CACCTGCATA GCTGGCAAGC GCTTTGCCAT ATCCTGCGCG ATCTACATAG CGGCGTTAAT TTACCCGGCG TTTCTCTTTC TGCGGCGGTA GCCTTACTGG AGCGACGCAG TCAGGCGATT CATGCCCCGG CGCTGGATCG CGGCGCGGCT CTTGGCGCAC TAATGCGTCT GGAACATCCC AACGCCAGTG CCGAAGCGGC GCTGACGATG CTGGCGCAGT TATCCCCGGC ACAATCCGGC GAGGCGCTGC ACGGTTTGCT GGCGCTGGCC CGCCATCAAC TGGCCTGTCA GCCGGCATTT ATCGCCGGTT TCAGCAGTCA TTTAAATCAA CTGAGTGATG CCGATTTTAT CAATGCCCTG CCCGATTTAC GCGCGGCGAT GGCCTGGCTA CCACCACGAG AACGCGGGAC ACTGGCGCAT CAGGTGCTTG AGCATTACCA GCTCGCACAG CTTCCCATTT CAGCGCTGCA AATGCCGTTG CATTGTCCAC CGCAAGCCAT TGCACATCAT CAACAACTCG AACAGCAGGC ACTGGCATCG CTGCAACACT GGGGAGTTTT CCATGTCTGA
|
Protein sequence | MSEPLIVGIR HHSPACARLV KSLIESQRPR YVLIEGPADF NDRVDELLLA HQLPVAIYSY CQYQDGAAPG RGAWTPFAEF SPEWQALQAA RRIQAQTYFI DLPCWAQSEE EDDSPDMQEE SQALLLRATR MDNSDTLWDH LFEDESQQTI LPSALAHYFA QLRGDASGDA LNRQREAFMA RWIAWAMQQN NGDVLVVCGG WHAPALAKMW RECPQEINTP ELLLLDDAVT GCYLTPYSEK RLDVLAGYLS GMPAPVWQNW CWRWGLQQAG EQLLKTVLTR LRQHHLPAST ADMAAAHLHA MALAQLRGHT LPLRTDWLDA IAGSLIKEAL NAPLPWSYRG VIHPDTDPIL LTLIDTLAGD GFGKLAPSTP QPPLPKDVTC ELERTAISLP AELTLNRFNP NGLAQSQVLH RLAILEIPGI VRQQGSTLTL AGNGEERWKL TRPLSQHAAL IEAACFGATL QEAARHKLEA DMLDAGGIGS ITTCLSQAAL AGLASFSQQL LEQLTLLIAQ ENQFAEMGQA LEVLYALWRL DEISGMQGAQ ILQTTLCAAI DRTLWLCESN GRPDEKEFHA HLHSWQALCH ILRDLHSGVN LPGVSLSAAV ALLERRSQAI HAPALDRGAA LGALMRLEHP NASAEAALTM LAQLSPAQSG EALHGLLALA RHQLACQPAF IAGFSSHLNQ LSDADFINAL PDLRAAMAWL PPRERGTLAH QVLEHYQLAQ LPISALQMPL HCPPQAIAHH QQLEQQALAS LQHWGVFHV
|
| |