Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4475 |
Symbol | |
ID | 5594926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4479307 |
End bp | 4483086 |
Gene Length | 3780 bp |
Protein Length | 1259 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923573 |
Product | hypothetical protein |
Protein accession | YP_001461014 |
Protein GI | 157163696 |
COG category | [S] Function unknown |
COG ID | [COG2911] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAT GGAAAAAAAT CAGCCTCGGC GTGGTTATCG TTATCTTACT GTTGCTGGGA TCGGTGGCGT TTCTGGTGGG CACCACCAGC GGCCTGCATC TGGTATTTAA AGCGGCGGAT CGCTGGGTGC CAGGACTGGA TATTGGCAAG GTCTCCGGCG GCTGGCGCGA TCTCACCTTG TCTGACGTTC GTTATGAGCA GCCAGGCGTG GCGGTAAAAG CGGGTAATCT GCATCTGGCG GTCGGTCTTG AGTGCCTGTG GAACAGTAGC GTTTGTATTA ATGACCTGGC GCTGAAAGAC ATTCAGGTCA ACATCGACAG TAAAAAAATG CCTCCTTCTG AACAGGTTGA AGAAGAGGAA GATAGCGGTC CGCTGGATCT CTCCACGCCG TATCCCATCA CCCTGACACG GGTGGCGCTG GACAACGTCA ACATCAAGAT TGATGACACC ACGGTGTCGG TGATGGACTT CACCTCCGGC CTGAACTGGC AGGAGAAAAC CCTGACCCTG AAACCGACGT CGCTGAAAGG CCTGCTGATT GCTCTGCCGA AAGTGGCGGA AGTGGCGCAG GAAGAAGTGG TCGAACCGAA AATTGAAAAT CCGCAGCCGG AGGAAAAGCC GCTCGGCGAA ACGCTGAAAG ATCTCTTTTC TCGCCCGGTA TTGCCGGAAA TGACCGACGT GCATTTGCCG CTTAACCTGA ACATTGAAGA GTTTAAAGGC GAGCAGTTAC GCGTGACGGG CGACACGGAC ATCACCGTGC GCACCATGCT GCTGAAAGTG AGCAGCATTG ACGGCAATAC TAAACTGGAC GCCCTGGATA TCGATTCCAG TCAAGGGATC GTCAACGCCA GCGGCACGGC GCAGCTGTCA GACAACTGGC CGGTGGATAT CACCCTCAAC AGCACACTAA ACGTGGAGCC GTTGAAAGGA GAAAAACTGA AGCTGAAAGT GGGCGGTGCG CTGCGCGAAC AGCTGGAGAT TGGCGTTAAC CTTTCCGGTC CGGTGGATAT GGATTTACGC GCCCAGACGC GACTGGCGGA AGCCGGATTG CCACTCAACG TGGAAGTGAA CAGCAAACAG CTTTACTGGC CGTTCACCGG TGAGAAGCAG TATCAGGCGG ATGATCTGAA ACTGAAACTT ACCGGCAAAA TGACCGATTA CACGCTCTCT ATGCGTACGG CAGTGAAGGG ACTGGAGATC CCGCCAGCCA CCATTACCCT TGATGCCAAA GGTAATGAAC AGCAGGTCAA TCTCGACAAA CTCACCGTCG CGGCGCTGGA AGGGAAAACT GAACTCAAGG CGTTGCTCGA CTGGCAGCAG GCCATTAGTT GGCGCGGTGA GCTAACGCTT AACGGCATTA ACACCGCCAA AGAGATCCCA GAGTGGCCGT CGAAACTCAA TGGCTTGATT AAAACCCGCG GTAGCCTGTA CGGCGGCACC TGGCAGATGG AGGTGCCAGA ACTGAAGCTG ACCGGTAACG TCAAACAGAA CAAAGTGAAC GTTGACGGCA CGCTGAAAGG CAACAGTTAT ATGCAGTGGA TGATCCCAGG GCTTCATCTG GAACTGGGGC CAAACAGTGC CGAAGTGAAA GGCGAGCTGG GGGTAAAAGA TCTCAATCTT GATGCCACCA TCAACGCGCC GGGGCTGGAT AACGCGCTGC CGGGGCTTGG CGGTACAGCG AAAGGGCTGG TGAAAGTACG CGGCACGGTG GAAGCGCCAC AACTACTGGC AGATATCACC GCGCGCGGCC TGCGCTGGCA GGAACTTTCC GTGACGCAGG TTCGCGTGGA AGGCGACATT AAATCCACCG ATCAGATCGC CGGGAAACTC GACGTACGCG TTGAGCAAAT TTCGCAGCCG GATGTAAATA TCAACCTCGT CACCCTGAAT GCCAAAGGCA GCGAAAAGCA GCACGAGCTA CAGTTGCGGA TTCAGGGCGA GCCGGTTTCC GGGCAGCTTA ATCTGGCAGG AAGTTTTGAT CGCAAAGAAG AACGCTGGAA GGGAACTCTT AGCAATACCC GCTTCCAGAC GCCGGTCGGC CCGTGGTCGC TGACCCGCGA TATTGCGCTG GATTACCGCA ATAAGGAGCA AAAAATCAGC ATCGGGCCAC ACTGCTGGCT TAACCCGAAT GCGGAACTGT GCGTGCCGCA AACTATCGAT GCGGGTGCCG AAGGGCGTGC GGTGGTGAAT CTCAACCGCT TCGACCTCGC CATGCTGAAA CCGTTTATGC CAGAAACCAC TCAGGCCAGC GGTATCTTCA CGGGTAAAGC GGATGTTGCC TGGGACACCA CGAAAGAGGG GCTGCCGCAG GGCAGTATCA CCCTTTCGGG GCGTAACGTG CAGGTAACGC AAACCGTCAA CGATGCGGCG CTGCCGGTGG CGTTTCAGAC ACTGAATCTG ACGGCGGAAT TGCGTAACAA CCGTGCCGAA TTGGGCTGGA CCATCCGCCT GACCAATAAC GGCCAGTTTG ATGGACAGGT GCAGGTGACC GATCCGCAAG GCCGCCGTAA TCTTGGTGGC AACGTCAATA TCCGTAACTT CAACCTTGCG ATGATAAACC CCATCTTTAC CCGTGGGGAA AAAGCAGCGG GGATGGTGAG TGCCAACTTG CGTCTGGGTG GTGATGTGCA AAGCCCGCAG TTGTTTGGTC AGCTTCAGGT TACGGGTGTG GATATCGACG GCAACTTTAT GCCGTTTGAT ATGCAGCCGA GCCAGCTTGC GGTCAACTTT AACGGTATGC GCTCGACGCT TGCCGGTACA GTACGGACCC AGCAGGGTGA AATCTACCTG AACGGTGATG CCGACTGGAG CCAAATTGAA AACTGGCGGG CACGAGTAAC GGCGAAAGGC AGTAAAGTGC GGATCACCGT GCCGCCGATG GTACGAATGG ATGTATCGCC AGATGTTGTA TTCGAGGCTA CACCAAACCT GTTTACCCTC GATGGTCGCG TGGATGTCCC GTGGGCGCGC ATCGTGGTGC ACGATCTGCC GGAAAGCGCA GTAGGCGTCT CCAGCGATGT GGTGATGCTT AACGATAACC TGCAACCGGA AGAGCCGAAA ACGGCGTCGA TTCCGATTAA CAGTAACCTG ATTGTCCACG TTGGCAACAA TGTGCGCATT GACGCCTTTG GCCTGAAAGC GCGGCTGACG GGCGATCTTA ACGTCGTACA GGACAAACAA GGGCTGGGCC TGAACGGGCA GATCAACATC CCTGAAGGGC GCTTCCATGC CTATGGTCAG GATCTGATTG TGCGTAAGGG TGAGTTACTG TTCTCTGGTC CGCCGGATCA ACCGTATCTT AATATTGAAG CTATTCGTAA CCCGGATGCT ACAGAAGACG ACGTAATCGC CGGAGTTCGC GTCACTGGTC TGGCGGACGA ACCGAAAGCG GAGATCTTCT CTGACCCGGC GATGTCGCAA CAAGCTGCAT TGTCTTATTT GCTACGTGGA CAAGGGCTGG AGAGCGATCA GAGCGACAGT GCGGCAATGA CCTCGATGCT GATTGGTCTG GGGGTTGCGC AAAGTGGCCA GATTGTGGGT AAAATCGGCG AGACGTTTGG CGTAAGCAAT TTAGCGCTCG ACACCCAGGG AGTAGGCGAC TCCTCCCAGG TAGTGGTCAG CGGCTATGTA TTGCCAGGTC TGCAAGTGAA ATATGGCGTG GGTATATTTG ACTCTATAGC AACACTCACG TTACGTTATC GCCTGATGCC TAAGCTATAT CTGGAAGCCG TGTCTGGTGT AGACCAGGCA CTGGATTTGC TCTATCAGTT CGAGTTTTAG
|
Protein sequence | MSLWKKISLG VVIVILLLLG SVAFLVGTTS GLHLVFKAAD RWVPGLDIGK VSGGWRDLTL SDVRYEQPGV AVKAGNLHLA VGLECLWNSS VCINDLALKD IQVNIDSKKM PPSEQVEEEE DSGPLDLSTP YPITLTRVAL DNVNIKIDDT TVSVMDFTSG LNWQEKTLTL KPTSLKGLLI ALPKVAEVAQ EEVVEPKIEN PQPEEKPLGE TLKDLFSRPV LPEMTDVHLP LNLNIEEFKG EQLRVTGDTD ITVRTMLLKV SSIDGNTKLD ALDIDSSQGI VNASGTAQLS DNWPVDITLN STLNVEPLKG EKLKLKVGGA LREQLEIGVN LSGPVDMDLR AQTRLAEAGL PLNVEVNSKQ LYWPFTGEKQ YQADDLKLKL TGKMTDYTLS MRTAVKGLEI PPATITLDAK GNEQQVNLDK LTVAALEGKT ELKALLDWQQ AISWRGELTL NGINTAKEIP EWPSKLNGLI KTRGSLYGGT WQMEVPELKL TGNVKQNKVN VDGTLKGNSY MQWMIPGLHL ELGPNSAEVK GELGVKDLNL DATINAPGLD NALPGLGGTA KGLVKVRGTV EAPQLLADIT ARGLRWQELS VTQVRVEGDI KSTDQIAGKL DVRVEQISQP DVNINLVTLN AKGSEKQHEL QLRIQGEPVS GQLNLAGSFD RKEERWKGTL SNTRFQTPVG PWSLTRDIAL DYRNKEQKIS IGPHCWLNPN AELCVPQTID AGAEGRAVVN LNRFDLAMLK PFMPETTQAS GIFTGKADVA WDTTKEGLPQ GSITLSGRNV QVTQTVNDAA LPVAFQTLNL TAELRNNRAE LGWTIRLTNN GQFDGQVQVT DPQGRRNLGG NVNIRNFNLA MINPIFTRGE KAAGMVSANL RLGGDVQSPQ LFGQLQVTGV DIDGNFMPFD MQPSQLAVNF NGMRSTLAGT VRTQQGEIYL NGDADWSQIE NWRARVTAKG SKVRITVPPM VRMDVSPDVV FEATPNLFTL DGRVDVPWAR IVVHDLPESA VGVSSDVVML NDNLQPEEPK TASIPINSNL IVHVGNNVRI DAFGLKARLT GDLNVVQDKQ GLGLNGQINI PEGRFHAYGQ DLIVRKGELL FSGPPDQPYL NIEAIRNPDA TEDDVIAGVR VTGLADEPKA EIFSDPAMSQ QAALSYLLRG QGLESDQSDS AAMTSMLIGL GVAQSGQIVG KIGETFGVSN LALDTQGVGD SSQVVVSGYV LPGLQVKYGV GIFDSIATLT LRYRLMPKLY LEAVSGVDQA LDLLYQFEF
|
| |