Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4189 |
Symbol | |
ID | 5591201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4178444 |
End bp | 4180177 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923291 |
Product | hypothetical protein |
Protein accession | YP_001460750 |
Protein GI | 157163432 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTCCA CAGAAGTCCA GGCTAAACCT CTTTTTAGCT GGAAAGCCCT GGGTTGGGCA CTGCTCTACT TTTGGTTTTT CTCTACTCTG CTACAGGCCA TTATTTACAT CAGTGGTTAT AGTGGCACTA ACGGCATTCG CGACTCGCTG TTATTCAGTT CGCTGTGGTT GATCCCGGTA TTCCTCTTTC CGAAGCGGAT TAAAATTATT GCCGCAGTAA TCGGCGTGGT GCTATGGGCG GCCTCTCTGG CGGCGCTGTG CTACTACGTC ATCTACGGTC AGGAGTTCTC GCAGAGCGTT CTGTTTGTGA TGTTCGAAAC CAACACCAAC GAAGCCAGCG AGTATTTAAG CCAGTATTTC AGCCTGAAAA TTGTGCTTAT CGCGCTGGCC TATACGGCGG TGGCAGTTCT GCTGTGGACA CGCCTGCGCC CGGTCTATAT TCCAAAGCCG TGGCGTTATG TTGTCTCTTT TGCCCTGCTT TATGGCTTGA TTCTGCATCC GATCGCCATG AATACGTTTA TCAAAAACAA GCCGTTTGAG AAAACGTTGG ATAACCTGGC CTCGCGTATG GAGCCTGCCG CACCGTGGCA ATTCCTGACC GGCTATTATC AGTATCGTCA GCAACTAAAC TCGCTAACAA AGCTACTGAA TGAAAATAAT GCCTTGCCGC CGCTGGCTAA TTTCAAAGAT GAATCGGGTA ACGAACCGCG CACCTTAGTG CTGGTGATTG GCGAGTCAAC CCAGCGTGGA CGCATGAGTC TGTACGGTTA TCCGCGTGAA ACCACGCCGG AGCTGGATGC GCTGCATAAA ACCGATCCGA ATCTGACCGT GTTTAATAAC GTGGTTACGT CTCGTCCGTA CACCATTGAA ATCCTGCAAC AGGCGCTGAC CTTTGCCAAT GAAAAGAACC CGGACCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAACAGGCG GGTTATAAAA CCTTCTGGAT CACCAACCAG CAGACGATGA CCGCCCGCAA TACCATGCTG ACGGTATTTT CGCGCCAGAC CGACAAGCAG TACTACATGA ACCAGCAACG TACGCAGAGT GCGCGTGAAT ACGACACCAA CGTGCTGAAG CCGTTCCAGG ATGTGCTGAA TGACCCTGCG CCGAAGAAAC TGATCATCGT TCATCTGCTG GGTACGCATA TCAAATACAA ATACCGCTAC CCGGAAAATC AGGGCAAGTT TGATGGCAAT ACCGATCATG TTCCGCCAGG ATTAAGCGCA GAAGAGCTGG AATCATATAA CGATTATGAC AACGCTAACC TGTATAACGA TCATGTGGTT GCCAGCCTGA TTAAAGACTT TAAAGCGGCA GACCCGAACG GATTCCTGGT TTACTTCTCT GACCACGGTG AAGAGGTTTA CGACACGCCG CCTCATAAAA CCCAGGGGCG TAATGAGGAC AACCCGACGC GTCATATGTA CACCATTCCG TTCCTGCTGT GGACGTCGGA AAAATGGCAA GCGACTCATC CCCGTGATTT CTCACAGGAT GTCGATCGTA AATACAGCCT GGCGGAACTG ATCCACACCT GGTCAGATTT GGCGGGCTTA TCTTACGACG GTTACGATCC AACCCGTTCA GTGGTGAATC CGCAGTTCAA AGAAACTACC CGCTGGATTG GTAACCCGTA CAAGAAAAAC GCGCTGATCG ATTACGACAC ACTGCCGTAT GGCGATCAGG TGGGTAATCA GTAA
|
Protein sequence | MHSTEVQAKP LFSWKALGWA LLYFWFFSTL LQAIIYISGY SGTNGIRDSL LFSSLWLIPV FLFPKRIKII AAVIGVVLWA ASLAALCYYV IYGQEFSQSV LFVMFETNTN EASEYLSQYF SLKIVLIALA YTAVAVLLWT RLRPVYIPKP WRYVVSFALL YGLILHPIAM NTFIKNKPFE KTLDNLASRM EPAAPWQFLT GYYQYRQQLN SLTKLLNENN ALPPLANFKD ESGNEPRTLV LVIGESTQRG RMSLYGYPRE TTPELDALHK TDPNLTVFNN VVTSRPYTIE ILQQALTFAN EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTMTARNTML TVFSRQTDKQ YYMNQQRTQS AREYDTNVLK PFQDVLNDPA PKKLIIVHLL GTHIKYKYRY PENQGKFDGN TDHVPPGLSA EELESYNDYD NANLYNDHVV ASLIKDFKAA DPNGFLVYFS DHGEEVYDTP PHKTQGRNED NPTRHMYTIP FLLWTSEKWQ ATHPRDFSQD VDRKYSLAEL IHTWSDLAGL SYDGYDPTRS VVNPQFKETT RWIGNPYKKN ALIDYDTLPY GDQVGNQ
|
| |