Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0500 |
Symbol | |
ID | 5592967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 510806 |
End bp | 511801 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640919683 |
Product | hypothetical protein |
Protein accession | YP_001457268 |
Protein GI | 157159950 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.113871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTT GGTGGTTAAT TATTATGGAA ATATATGACA GTGAGGACAT AAAGGTATAC ATGTTAACTA AAAGAATATT ATTTTCTGTA ATGCTGATAG TTTCACCAAG CGTGGTGGCC AGCGAGAAAG CGCATGAATT GTATGACAGC ATATACGGTG GTAAACCTGC TCCCGATGTT ATAAATACAC TGCATAAAAT GGCTGAATCG GGAGATATCG ACGCGCAGAG CCTGTTAGGC TGGGAATATT ATCAGCCTCG TTATGATACC AAACCTGATG TTCAGGAAGC GATTAAATGG TTCGAGTTAG CGGCTAAACA AGGTGACAGA GAAGCTCCGT TAGCGCTGGG GGGTATCTAC TACGACGGTG AGCAGGTGCG GGTGGATTAT GCCAAAGCGT ATGCACTATT TAATCAGGCC GCGCAGCACG GTGTGAATTT AGCATGGTCC AGGTTGGGCA TTATGTACGC CAATGGTCAG TATGTTGAGG TAGATTGTAA GAAAGCGAAG GAATATCTTG ATAAAGGTGT CCACATTTAT GGTGGCCCAG AAGACTTTCT GGCTACTTGT CGAAAAGACA TGATTGACAG AAAAACGGTT GACGATACGT TACCCGTGAT TACGGTTACC CGTTCTGGGA TGCGAGATAA TTTTTTAGAT AAAGGTTTTT CCTGCATGGA TAGCCTCTTC GCCACCACCA ATAAATTAGG CGAAGTGGCC AATCTTCGCG TGACATTCAG CATTCGTCGC CCATCCGGTA AGGAGATTAA CCAAACGGTT GGCTTCGCGC CTTTCGGTTT AAACCGGCTG AATATTAGCT TTACTGATTA CCTATTTGGT TCATTTACCA GTAATTCGTC TCTTATTCTT TATAAACCGG AATTTGAGCG AAAATCTTGT GCAACTGTAA GGACCACAAT CGTTGCCGCA ACGGCAACCA TTAACGGCAA GGATGTGGAG TTGCTGAAAA CGGGGGCAAT TGAACAAAAG TGGTAA
|
Protein sequence | MDIWWLIIME IYDSEDIKVY MLTKRILFSV MLIVSPSVVA SEKAHELYDS IYGGKPAPDV INTLHKMAES GDIDAQSLLG WEYYQPRYDT KPDVQEAIKW FELAAKQGDR EAPLALGGIY YDGEQVRVDY AKAYALFNQA AQHGVNLAWS RLGIMYANGQ YVEVDCKKAK EYLDKGVHIY GGPEDFLATC RKDMIDRKTV DDTLPVITVT RSGMRDNFLD KGFSCMDSLF ATTNKLGEVA NLRVTFSIRR PSGKEINQTV GFAPFGLNRL NISFTDYLFG SFTSNSSLIL YKPEFERKSC ATVRTTIVAA TATINGKDVE LLKTGAIEQK W
|
| |