Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1070 |
Symbol | |
ID | 6143973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1081213 |
End bp | 1083609 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615957 |
Product | hypothetical protein |
Protein accession | YP_001743149 |
Protein GI | 170682670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0611355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGAAA AGAACATCGC CCTGCTTTGT GATGAAGCCG ACCGACTTTT GCAACTGAAC ATTAATCTGC TCCGGCAAAT GGTTGAGGAG CCAGATGTGT TATCTGACAG TAAGAACGAA AACAGACTGC TTTTTGATAA ACAGAAAGCA CTGAAAAGAA TTGAGGAGCT GGAGGGCGAA CAAATCAAAA CCGCCCGCAG GGAGATGGTG CTGGCTGTTG TCGGCACGAT GAAAGCAGGC AAATCAACCA CCATCAACGC CATTGTGGGG CAGGAAATTC TGCCTAACCG TAACCGCCCC ATGACCTCTG TACCGACGCT CATCCGCCAC GTTCCCGGAA AAACTGAGCC GGTTCTCCAT CTGGAACATA TTCAGCCTGT CCGCAATTTA TTAATCACAC TGCAGGAAAA ACTCGCCACC CCGGCAGGAC AGCAGGTCGC ACAGACCCTG CAGCAAACCG GGGATACCCG CGAACTGCTG GATATTCTGA CGGATGATGG CTGGCTCAAA AATGAATACC ACGGGGAGGA GGAAATCTTT ACCGGACTGG CATCGTTAAA CGATCTGGTT CGTCTTGCTG CGGCAATGGG GACTGAATTT CCTTTTGATG AATACGCAGA AGTGCAGAAA CTGCCGGTGA TCGACGTGGA ATTCAGCCAT CTGGTGGGGA TGGATGCATG CCAGGGAACA CTCACACTGC TGGATACCCC CGGCCCTAAT GAGGCCGGAC AACCGCAGAT GGAAGTGATG ATGCGGGATC AACTGCAGAA AGCCTCTGCG GTTCTGGCTG TGATGGATTA CACCCAGATG AACTCAAAAG CGGATGAAGA CGTCCGTAAA GAGCTTAATG CCATTGCTGA CGTATCAGCC GGCCGCCTGT TTGTACTGGT CAATAAATTT GATGAGAAAG ACCGCAATGG CGATGGGGCA GATGCCGTAC GCCAGAAAGT TCCGGCAATG CTGAACAGCG ATGTGCTGCC CGCCTCCCGC GTTTATCCCG GATCCTCACG CCAGGCATAC CTGGCTAACC GTGCGCTTCA TGAGTTACGG AAAAACGGAA CCCTTCCTGT TGATGAAGCC TGGGTCGATG ATTTTATCAG AGGTGCCTTC GGTTGCATGA AAAAAGAATA CGTCTGTAAA GACAGTGAAC TGGCAACTGA AGGGGCGACA GACCTGTGGG AAGGCTCACT TATCGATCAA CTGATAACGG AAGTCATACA GAGCTCACAT TCCAGAGCTG CGGCACTGGC GGTTGACTCT GCCGCCGCAA AACTGATGCA GAATGCAGAA AATATCAGTG AATACCTGTT GTTACGCCAT CAGGGGCTAC AGCAAAGCAT TCAGTCACTG CAGTCGCATA TCACCAGCCT GCTTGCGGAT ATCCGGGAAA TCGCGGACTG TCAGGAGCAG ATGACCACTG ATGTCAGAAT GGCCATGGAG GAGATCGATA CTAAAACCCG GGAATTACTG ACGGGGGTCT GCACCTCACT GGAAGAGGAG CTGAATGACT ATTTCAGAAG CGGTAAACGC AAAGAACAGC AAATGCTGGA GGAAGAAGAC GCAGAACAAC GAAGGTCTCA GTCCGGATTA TGGGGGAAAA TTTCTCAATG GTCCGGTATC AACAACCAGG GCCGGGAGGA TTACAGGAAA CGAGATTTTG CCCCGGACAG CCCCGAAATA AAATTCAGTG ATCGCAGGGA AGCCCTTGAA CTGATGACGC AAATCGAATC GACCGTGACC AGCCTGCACC GTGAGGCTGA AGCACAGTTC CGGCCTGAGC TGGAGAAAAT CGTCAGCGGG ATTGAAACAG GTTTTCGTGG CACGGCCCTG TACGCCACAG AAAACATTGC CGGTCGCATC AATGCCCGCC TGGAGGATGA GGGCTTTACC GTAAAAATCA GTTTTCCGGC AGTCAGCCAG TTACAGACCC GGCTCGCGGT AAAAATAAAT CTGAGTGCGC TTATGGAGGA AAGAACGGAG ACCGTCACCC GTCGCCGTCG GCAGAGTGGC GTATGGGGAA CCGTTTGTCG ATGGTTTGGC ACCAGTGACC TGGGCTGGGA AAACTATGAC GAGGATGTGA GTCGCAGCGT GATCAATATC AACAAGGTCA GAGAGGAAGT TATGTCACTG ACCCGGGCAT ATTTCGGGGA GCTGCAGGCA TCCATTGAGC AGGATATTAA CCAGCCCGTC CGCCAGGAAA TCGATGCCTT TTTCTGCGCA TTCAGGGAGA AAGTTGAACA ACTGCGTAAC ACGCTGATTC AGAGCTCTGA AGATCATAAA CGCGATCAGC AGGCGCAGGA ACGGCTTACC GGGCGACTTC AGGCATTAAA TGAAAGGGTT CCTGAGCTCA TTACTGACAG TAAGGCGCTG AGGGAAGAGC TGGAGACAAT GCTGTGA
|
Protein sequence | MHEKNIALLC DEADRLLQLN INLLRQMVEE PDVLSDSKNE NRLLFDKQKA LKRIEELEGE QIKTARREMV LAVVGTMKAG KSTTINAIVG QEILPNRNRP MTSVPTLIRH VPGKTEPVLH LEHIQPVRNL LITLQEKLAT PAGQQVAQTL QQTGDTRELL DILTDDGWLK NEYHGEEEIF TGLASLNDLV RLAAAMGTEF PFDEYAEVQK LPVIDVEFSH LVGMDACQGT LTLLDTPGPN EAGQPQMEVM MRDQLQKASA VLAVMDYTQM NSKADEDVRK ELNAIADVSA GRLFVLVNKF DEKDRNGDGA DAVRQKVPAM LNSDVLPASR VYPGSSRQAY LANRALHELR KNGTLPVDEA WVDDFIRGAF GCMKKEYVCK DSELATEGAT DLWEGSLIDQ LITEVIQSSH SRAAALAVDS AAAKLMQNAE NISEYLLLRH QGLQQSIQSL QSHITSLLAD IREIADCQEQ MTTDVRMAME EIDTKTRELL TGVCTSLEEE LNDYFRSGKR KEQQMLEEED AEQRRSQSGL WGKISQWSGI NNQGREDYRK RDFAPDSPEI KFSDRREALE LMTQIESTVT SLHREAEAQF RPELEKIVSG IETGFRGTAL YATENIAGRI NARLEDEGFT VKISFPAVSQ LQTRLAVKIN LSALMEERTE TVTRRRRQSG VWGTVCRWFG TSDLGWENYD EDVSRSVINI NKVREEVMSL TRAYFGELQA SIEQDINQPV RQEIDAFFCA FREKVEQLRN TLIQSSEDHK RDQQAQERLT GRLQALNERV PELITDSKAL REELETML
|
| |