Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0794 |
Symbol | |
ID | 6143163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 795514 |
End bp | 797775 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615682 |
Product | hypothetical protein |
Protein accession | YP_001742874 |
Protein GI | 170683967 |
COG category | [C] Energy production and conversion |
COG ID | [COG1048] Aconitase A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGT TATCTGAAAA AGGCGTGTTT CTCGCCAGTA ATAACGAAAT AATTGCCGAA GAACATTTCA CCGGTGAAAT TAAAAAAGAA GAAGCCAAAA AAGGCACTAT TGCCTGGTCT ATTCTCTCTT CTCATAATAC GTCTGGTAAT ATGGATAAAC TTAAAATTAA GTTTGATTCA TTAGCCTCTC ACGATATTAC CTTTGTTGGT ATTGTACAGA CCGCTAAAGC GTCCGGTATG GAACGTTTCC CGCTGCCGTA TGTGCTGACC AACTGCCATA ACTCACTCTG CGCCGTCGGC GGTACTATTA ATGGCGATGA CCATATTTTT GGCTTATCGG CAGCGCAACG TTATGGCGGT ATTTTTGTGC CACCGCATAT TGCGGTCATC CATCAATATA TGCGTGAAAT GATGGCGGGC GGCGGCAAAA TGATTCTCGG GTCAGACAGC CATACCCGTT ACGGTGCATT AGGGACAATG GCAGTCGGTG AGGGTGGCGG TGAGTTGGTA AAACAGCTGC TTAATGACAC CTGGGATATC GACTATCCGG GAGTTGTTGC GGTGCATTTG ACCGGAAAAC CTGCGCCGTA TGTGGGGCCA CAGGATGTTG CGCTGGCTAT CATCGGTGCC GTGTTCAAAA ACGGCTACGT CAAAAACAAA GTGATGGAAT TCGTAGGTCC CGGTGTTGCT GCGCTCTCTA CCGATTTCCG TAACAGCGTT GACGTGATGA CCACTGAAAC GACCTGTTTA AGTTCTGTCT GGCAAACCGA TGAAGAAGTC CATAACTGGC TGGCGCTGCA CGGTCGCGGC CAGGATTACT GCCAGCTTAA CCCTCAACCG ATGGCGTACT ACGATGGCTG CATCAGCGTT GATTTAAGCG CCATCAAACC AATGATTGCG CTGCCGTTTC ACCCGAGCAA CGTGTATGAA ATCGACACGC TGAACCAGAA CCTGACCGAC ATTCTGCGTG AGATTGAAAT TGAGTCCGAG CGCGTGGCGC ACGGTAAAGC CAAACTCTCG CTGCTGGATA AAGTCGAAAA CGGTCGCCTG AAAGTGCAGC AGGGGATTAT CGCGGGCTGT TCTGGCGGTA ACTACGAAAA CGTCATCGCG GCGGCGAATG CACTGCGCGG TCAATCCTGT GGCAATGACA CCTTCTCGTT AGCGGTTTAC CCGTCATCAC AGCCCGTGTT TATGGATCTC GCCAAAAAAG GTGTGGTAGC AGATTTGATT GGCGCAGGCG CAATCATCAG AACCGCGTTC TGCGGCCCAT GCTTTGGCGC GGGCGATACA CCAATCAATA ACGGTTTGAG TATTCGCCAC ACCACGCGTA ACTTCCCGAA CCGCGAAGGC TCTAAGCCAG CTAATGGGCA GATGTCAGCG GTGGCGTTGA TGGACGCTCG TTCTATCGCT GCGACTGCGG CAAACGGTGG CTATTTAACC TCTGCCAGCG AGCTGGATTG CTGGGACAAC GTGCCGGAGT ACGCCTTCGA TGTAACGCCG TATAAAAACC GTGTTTATCA GGGCTTTGTG AAAGGGGCGA CCCAGCAACC GCTGATTTAC GGACCGAACA TTAAAGACTG GCCAGAATTG GGTGCGCTGA CTGACAATAT CGTCCTCAAA GTGTGCTCGA AGATCCTCGA CGAAGTGACC ACCACCGACG AACTGATTCC TTCCGGTGAA ACCTCTTCAT ATCGTTCAAA TCCGATTGGT CTGGCGGAGT TTACTCTGTC ACGCCGCGAT CCAGGTTATG TTGGCAGAAG TAAAGCGACT GCCGAGCTGG AAAATCAGCG TCTGGCGGGA AATGTCAGCG AGCTGACAGA GGTGTTTGCG CGCATTAAGC AGATTGCTGG TCAGGAGCAT ATTGATCCGC TGCAAACTGA AATTGGCAGT ATGGTCTATG CGGTGAAACC AGGCGATGGT TCTGCGCGTG AACAGGCGGC GAGTTGTCAG CGTGTGATTG GCGGTCTGGC GAATATTGCC GAGGAGTACG CGACTAAACG CTATCGTTCT AACGTCATCA ACTGGGGGAT GTTACCGCTG CAGATGGCGG AAGCGCCAAC CTTTGAAGTG GGGGATTACA TTTACATCCC TGGCATTAAA GCGGCACTGG ATAATCCGGG TACGACGTTT AAAGGTTATG TGATCCATGA AGATGCGCCG GTAACGGAAA TTACGCTTTA CATGGAAAGC CTGACGGCCG AAGAGCGCGA GATTATCAAG GCGGGTAGTT TGATTAACTT CAATAAAAAC CGTCAGATGT AA
|
Protein sequence | MIKLSEKGVF LASNNEIIAE EHFTGEIKKE EAKKGTIAWS ILSSHNTSGN MDKLKIKFDS LASHDITFVG IVQTAKASGM ERFPLPYVLT NCHNSLCAVG GTINGDDHIF GLSAAQRYGG IFVPPHIAVI HQYMREMMAG GGKMILGSDS HTRYGALGTM AVGEGGGELV KQLLNDTWDI DYPGVVAVHL TGKPAPYVGP QDVALAIIGA VFKNGYVKNK VMEFVGPGVA ALSTDFRNSV DVMTTETTCL SSVWQTDEEV HNWLALHGRG QDYCQLNPQP MAYYDGCISV DLSAIKPMIA LPFHPSNVYE IDTLNQNLTD ILREIEIESE RVAHGKAKLS LLDKVENGRL KVQQGIIAGC SGGNYENVIA AANALRGQSC GNDTFSLAVY PSSQPVFMDL AKKGVVADLI GAGAIIRTAF CGPCFGAGDT PINNGLSIRH TTRNFPNREG SKPANGQMSA VALMDARSIA ATAANGGYLT SASELDCWDN VPEYAFDVTP YKNRVYQGFV KGATQQPLIY GPNIKDWPEL GALTDNIVLK VCSKILDEVT TTDELIPSGE TSSYRSNPIG LAEFTLSRRD PGYVGRSKAT AELENQRLAG NVSELTEVFA RIKQIAGQEH IDPLQTEIGS MVYAVKPGDG SAREQAASCQ RVIGGLANIA EEYATKRYRS NVINWGMLPL QMAEAPTFEV GDYIYIPGIK AALDNPGTTF KGYVIHEDAP VTEITLYMES LTAEEREIIK AGSLINFNKN RQM
|
| |