Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1148 |
Symbol | |
ID | 5593232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1158919 |
End bp | 1159905 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640920307 |
Product | hypothetical protein |
Protein accession | YP_001457871 |
Protein GI | 157160553 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.00962043 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCTG ATTATTTAAC TTTTATTCGC TTTCAGGATA AACGAAATCT GATATACATT TATGCTATTG GACTTATTCT GATAGGCTTT TATTGGAAGA ATGCAGGGTT TATTTTTCCA TCAGAGGATA TTGGTGTAGT TAGTGGGATT CTGGCTCTGG TGCTGTATAA TTTTATTTTT GATCTCAAGG CGTACTGGGC TTATAAATGC GTCACGAAGA ATATCGATTT TTCGTGGTTT AAGAAAAAGC AGAACCACAA AATAGAATTA TTTCTTACAC AACCTCTGGT GGCAGGATTT CTGTCGTTAA TCATGTTGAG TGCAATGAGT TGGGGGCTAT ACCAGCTTCT ACCCTCGTTA TATGCGCTGT TCCTGATTTC GTTACTTGGG CCGTTGGTCA TCTTTCTGCT GTTTCGGATG ATCCGCACCA GTTATGTCAA GCAGGTCGCT ATTTCAGTAG CGAAAAAAGT AAAATATAAA AGTCTGACTC GCTATGTGCT GCTTTCGGTG TGCATCTCAA CGGTTGTTAA CCTGCTTACT ATCAGCCCGT TGCGTAACAG TGATTCTTTT GTGACAGAGG GGCAGTGGTT AACGTTTAAA TCGATAATTG CATTGCTCAT TCTTTGTGGC GTAGTGTTGG CGATTAATCT GTTTTTTCTG CGCTTCTCCA AGCGGTACGC TTTTCTGGGC AGGCTTTTTT TGCAGGAAAT CGATCTGTTT TTCTCCAGTG AAAATGCGTT GTCGACCTTT TTTGCCAAGC CGCTTTGGCT TCGGTTATTC ATATTGCTGG TTATTGAAGT GATGTGGATT ACGCTGGTGT CGGTATTGGC AACGCTTGTG GAATGGCGGA TTTGGTTTGA AGCCTATTTT TTACTCTGCT ATGTACCGTG CTTAATTTAC TATTTTTTCT ATTGTCGATT CCTCTGGCAT AACGATTTTA TGATGGCATG TGACATGTAT TTCCGTTGGG GGCATTTTAA TAAGTGA
|
Protein sequence | MIPDYLTFIR FQDKRNLIYI YAIGLILIGF YWKNAGFIFP SEDIGVVSGI LALVLYNFIF DLKAYWAYKC VTKNIDFSWF KKKQNHKIEL FLTQPLVAGF LSLIMLSAMS WGLYQLLPSL YALFLISLLG PLVIFLLFRM IRTSYVKQVA ISVAKKVKYK SLTRYVLLSV CISTVVNLLT ISPLRNSDSF VTEGQWLTFK SIIALLILCG VVLAINLFFL RFSKRYAFLG RLFLQEIDLF FSSENALSTF FAKPLWLRLF ILLVIEVMWI TLVSVLATLV EWRIWFEAYF LLCYVPCLIY YFFYCRFLWH NDFMMACDMY FRWGHFNK
|
| |