Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1846 |
Symbol | |
ID | 5591030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1861913 |
End bp | 1862953 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640920990 |
Product | hypothetical protein |
Protein accession | YP_001458542 |
Protein GI | 157161224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.000163018 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTTTATTACA AAACCATCCT GGGAGCGAGA AGTATTCTTT TAATGGCTGG GAAATATTTA ATAGTAATTT TGAACGGATG ATTAAAGAAA ATAAGGCCAT GCTGCTTTGT AAGTGGGGGT TTTATTTAAC ATGTGTTGTC GCTGTAATGT TTGTGTTCGC AGCGATAACA TCCAACGGTT TGAATGAAAG AGGCCTGATT ACCGCGGGAT GCTCTTTTCT TTATCTATTA ATTATGATGG GGCTTATTGT TCGGGCCGGT TTTAAAGCAA AAAAAGAACA ACTGCATTAT TATCAGGCTA AAGGTATTGA GCCGCTCAGT ATCGAGAAGT TACAGGCGCT ACAATTGATC GCACCTTATC GATTCTATCA TAAGCAATGG TCTGAAACGC TGGAGTTCTG GCCGCGAAAG CCTGAACCTG GCAAAGATAC CTTCCAATAT CATGTGCTTC CTTTTGACTC GATCGATATC ATAAGTAAAA GACGCGAGTC TTTAGAGGAT CAATGGGGTA TCGAAGATAG CGAAAGTTAT TGTGCCTTAA TGGAGCATTT TCTTTCTGGC GACCATGGAG CCAATACCTT TAAAGCAAAC ATGGAGGAAG CCCCAGAGCA GGTTATCGCC TTGTTGAATA AATTTGCTGT TTTTCCCTCA GACTATATCT CTGATTGCGC TAATCATAGC TCCGGTAAAT CCTCGGCGAA GCTAATATGG GCGGCGGAAT TATCATGGAT GATCTCGATA TCAAGCACAG CTTTTCAAAA CGGGACAATT GAAGAAGAAC TGGCCTGGCA TTATATAATG CTTGCTTCTC GAAAGGCGCA CGAGTTGTTC GAAAGCGAAG AAGATTATCA AAAAAATAGT CAAATGGGAT TTCTTTACTG GCATATCTGC TGCTATCGCA GAAAGTTAAC GGATGCTGAA CTTGAAGCGT GTTATCGTTA CGACAAGCAG TTTTGGGAGC ACTACAGTAA GAAATGCCGT TGGCCCATAA GAAATGTTCC GTGGGGAGCA TCATCCGTTA AATACTCATA A
|
Protein sequence | MKKVLLQNHP GSEKYSFNGW EIFNSNFERM IKENKAMLLC KWGFYLTCVV AVMFVFAAIT SNGLNERGLI TAGCSFLYLL IMMGLIVRAG FKAKKEQLHY YQAKGIEPLS IEKLQALQLI APYRFYHKQW SETLEFWPRK PEPGKDTFQY HVLPFDSIDI ISKRRESLED QWGIEDSESY CALMEHFLSG DHGANTFKAN MEEAPEQVIA LLNKFAVFPS DYISDCANHS SGKSSAKLIW AAELSWMISI SSTAFQNGTI EEELAWHYIM LASRKAHELF ESEEDYQKNS QMGFLYWHIC CYRRKLTDAE LEACYRYDKQ FWEHYSKKCR WPIRNVPWGA SSVKYS
|
| |