Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2653 |
Symbol | |
ID | 5591508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2664820 |
End bp | 2666160 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640921768 |
Product | hypothetical protein |
Protein accession | YP_001459295 |
Protein GI | 157161977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.00498534 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATATA TCGAAAGTGA AAAGGATGTA AGTGATCGCT TAGGTATGTA CTTTATCTCT GTCTGGGAAG ATTGCGATAC CGTTGTTGCC GCTGGCATAC GGGAATTTCG AGAGCGATGG GCAAACTACA CCGTACATTC AAAAAGAAAG GACCCAAGAA GCCATTTGAA GGGTATTTCA TCCTACGACA AACGAGAAGC GGATAAGCGT CCTATTGGAG AACTTATCTT AGAGGTGATT GATCCTAAAG TTTCAGCATT CCTTAACTCT CTTCCTGAGA GTCATTTTCA ATTCTTTCCT ATCCCCTATA CGAATATTGC TCACTCTCCC TTACCACGTT CAAATGATAA ATTTACCAGT ACTGAAAGCG TTCCTAGTTT TTCGCCTATT AAATATGGTT CTAAATTTAA AGAAGATACT CCGATAACCA ATCTAGTGAT GGTTATGGGT ATCAAAAATG CTCATGATGT TATTAAACAG CAACTAAGGC GTTTTGAAAA AACGGAACAT CAATATCGAC CTATTGATCT TAAGTTTCAA GCAACCGTGG ATAATCTGCT TGAGCTATTA TGGCAGCTTC ACCTTACACC GACGCGCTTC AAGCAACACA GTGCAGACAC AAAACTCAAT GCGCGACGAA AGCAAACTTT CTGCGAGCTG TGTGGCCAAA GAAATGAACT TGCAGAATAT TTCTATAAGC TAGATAACAA TATGCTAGAA CTGGAAGATG AGATAGAAAG TCACAACGAA CAGAATCCTG ATAATCAGAA AAAACTACAA CTCAGCCACA GGTATTGTTC TTACCATAAA CCGAAACACA AAAATGGCTG TACGTGGAAC TCCGCTTACA AGAGTGCTCT GCACTCAAAA GACCAATTCG AGAATGAATT GCAGAGATTG CAACTTCACA TTGTCAAAGT CGAAGAGCTT AAAGTCATTT CTAGAGATGA ACTAGTTGAC CTTTATTTCT ATCATTTCCT CCAAGATAAA TGCGTCACTC AGAAACAAAG TGACGCATTT TTCCATTACG TTAGGGATAA TTTTAATTAC CCAATCGTAA TTAAGGAAGA AACAGAAAGA CTCATTCATG AAGCAGCCGT GCGCTTGACT GGAGCTGGTA CGACTCTTGG AGCTGATGAT GTCGGAAAAC TGCGAGATAT CGCTCGCCAC ATGGTTGACT CACGATTAAC AGATAGTAAG AAACGAATGC TCGTTCTTAA GAAACAAGGA TTCAATCAGA GACACATTGC AGATAAGTTA ACGGAAATTG AGGGAAGAAC TATTTCACCC CAGGCGGTTT CTAAAGCTTT GAAAAGTGTA GATAGTAACT TTAATATTTA A
|
Protein sequence | MAYIESEKDV SDRLGMYFIS VWEDCDTVVA AGIREFRERW ANYTVHSKRK DPRSHLKGIS SYDKREADKR PIGELILEVI DPKVSAFLNS LPESHFQFFP IPYTNIAHSP LPRSNDKFTS TESVPSFSPI KYGSKFKEDT PITNLVMVMG IKNAHDVIKQ QLRRFEKTEH QYRPIDLKFQ ATVDNLLELL WQLHLTPTRF KQHSADTKLN ARRKQTFCEL CGQRNELAEY FYKLDNNMLE LEDEIESHNE QNPDNQKKLQ LSHRYCSYHK PKHKNGCTWN SAYKSALHSK DQFENELQRL QLHIVKVEEL KVISRDELVD LYFYHFLQDK CVTQKQSDAF FHYVRDNFNY PIVIKEETER LIHEAAVRLT GAGTTLGADD VGKLRDIARH MVDSRLTDSK KRMLVLKKQG FNQRHIADKL TEIEGRTISP QAVSKALKSV DSNFNI
|
| |