Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0341 |
Symbol | |
ID | 5595010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 352200 |
End bp | 354284 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640919526 |
Product | hypothetical protein |
Protein accession | YP_001457112 |
Protein GI | 157159794 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4930] Predicted ATP-dependent Lon-type protease |
TIGRFAM ID | [TIGR02653] conserved hypothetical protein [TIGR02688] conserved hypothetical protein TIGR02688 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.168371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACCC ATCATGATTT ACCTGTTTCA GGCGTATCCG CAGGGGAAAT TGCCTCCGAG GGTTACGATC TGGACGCCCT GCTGAACCAG CATTTTGCTG GTCGTGTGGT GCGTAAAGAT CTCACCAAGC AACTCAAGGA AGGGGCAAAC GTCCCGGTGT ATGTGCTGGA GTATCTGCTC GGCATGTACT GCGCCTCTGA CGATGACGAC GTGGTCGAGC AAGGGTTGCA AAACGTTAAG CGTATTCTGG CTGATAACTA TGTGCGCCCG GATGAAGCAG AGAAAGTGAA GTCGCTGATC CGCGAGCGTG GTTCGTACAA AATCATCGAT AAAGTGTCGG TGAAGCTAAA CCAGAAAAAA GACGTTTACG AAGCCCAGCT TTCTAACCTC GGCATCAAAG ACGCGCTGGT GCCATCGCAG ATGGTTAAAG ACAACGAGAA GCTACTAACG GGCGGTATCT GGTGCATGAT TACCGTCAAC TATTTCTTTG AAGAAGGGCA GAAGACTTCG CCCTTCTCAT TGATGACGCT TAAGCCTATC CAGATGCCGA ATATGGATAT GGAAGAAGTG TTCGATGCGC GTAAACACTT TAACCGTGAT CAGTGGATCG ATGTGCTGCT GCGCTCAGTG GGTATGGAGC CCGCCAATAT TGAGCAACGC ACCAAATGGC ACCTTATCAC CCGTATGATC CCGTTCGTGG AGAACAACTA TAACGTTTGC GAGCTGGGGC CGCGTGGCAC CGGTAAAAGC CATGTGTATA AAGAGTGTTC TCCTAACTCT CTGTTAGTTT CTGGCGGGCA AACGACCGTT GCCAACTTGT TCTACAACAT GGCCAGTCGC CAGATCGGCC TGGTTGGCAT GTGGGATGTG GTAGCGTTCG ACGAAGTCGC GGGGATCACT TTCAAAGATA AAGACGGCGT GCAAATCATG AAAGATTACA TGGCGTCAGG ATCTTTCTCT CGCGGCAGAG ATTCGATTGA AGGTAAAGCG TCGATGGTTT TCGTCGGCAA CATCAATCAA AGCGTAGAGA CTCTCGTTAA AACCAGCCAT TTGCTGGCGC CATTTCCGGC TGCGATGATT GATACTGCAT TTTTCGACCG CTTTCATGCC TATATTCCCG GTTGGGAAAT CCCCAAAATG CGCCCGGAAT TTTTTACCAA CCGTTACGGG CTGATTACGG ATTATCTCGC TGAATATATG CGCGAAATGC GCAAACGCAG TTTCTCTGAT GCGATTGATA AATTCTTTAA GCTGGGTAAC AACCTCAACC AGCGTGACGT TATTGCCGTT CGACGTACCG TGTCGGGGTT GTTAAAACTC ATGCATCCCG ATGGCGCGTA CAGCAAAGAA GATGTGCGAG TCTGCCTGAC CTATGCGATG GAAGTTCGTC GCCGCGTGAA AGAGCAACTT AAAAAACTGG GCGGTCTGGA GTTCTTCGAT GTGAACTTTA GCTACATCGA CAACGAAACG CTGGAAGAGT TTTTTGTGAG CGTACCGGAA CAGGGCGGCA GCGAACTTAT TCCTGCCGGA ATGCCAAAGC CGGGTGTTGT GCATCTGGTC ACTCAGGCAG AAAGCGGCAT GACCGGGCTG TATCGTTTTG AAACACAGAT GACTGCCGGT AATGGTAAGC ATAGTGTATC GGGTCTGGGT TCAAATACCT CCGCGAAAGA AGCTATCCGC GTCGGTTTCG ATTACTTCAA AGGCAATTTG AATCGGGTAA GCGCGGCCGC GAAATTCTCC GATCATGAAT ATCACCTTCA TGTCGTTGAA CTGCATAATA CTGGCCCAAG CACCGCAACC AGTCTTGCTG CGCTTATCGC TTTATGTTCG ATATTGCTGG CAAAACCGGT GCAGGAACAG ATGGTGGTGT TGGGCAGTAT GACGCTTGGT GGGGTAATTA ACCCGGTGCA GGATCTTGCC GCCAGTTTAC AGCTCGCCTT CGACAGCGGT GCAAAACGGG TTCTGTTGCC GATGTCCTCG GCTATGGATA TTCCAACGGT TCCGGCAGAG TTATTTACCA AGTTTCAGGT GAGTTTTTAC TCAGACCCGG TTGATGCTGT TTATAAGGCG CTGGGTGTGA ATTAA
|
Protein sequence | MQTHHDLPVS GVSAGEIASE GYDLDALLNQ HFAGRVVRKD LTKQLKEGAN VPVYVLEYLL GMYCASDDDD VVEQGLQNVK RILADNYVRP DEAEKVKSLI RERGSYKIID KVSVKLNQKK DVYEAQLSNL GIKDALVPSQ MVKDNEKLLT GGIWCMITVN YFFEEGQKTS PFSLMTLKPI QMPNMDMEEV FDARKHFNRD QWIDVLLRSV GMEPANIEQR TKWHLITRMI PFVENNYNVC ELGPRGTGKS HVYKECSPNS LLVSGGQTTV ANLFYNMASR QIGLVGMWDV VAFDEVAGIT FKDKDGVQIM KDYMASGSFS RGRDSIEGKA SMVFVGNINQ SVETLVKTSH LLAPFPAAMI DTAFFDRFHA YIPGWEIPKM RPEFFTNRYG LITDYLAEYM REMRKRSFSD AIDKFFKLGN NLNQRDVIAV RRTVSGLLKL MHPDGAYSKE DVRVCLTYAM EVRRRVKEQL KKLGGLEFFD VNFSYIDNET LEEFFVSVPE QGGSELIPAG MPKPGVVHLV TQAESGMTGL YRFETQMTAG NGKHSVSGLG SNTSAKEAIR VGFDYFKGNL NRVSAAAKFS DHEYHLHVVE LHNTGPSTAT SLAALIALCS ILLAKPVQEQ MVVLGSMTLG GVINPVQDLA ASLQLAFDSG AKRVLLPMSS AMDIPTVPAE LFTKFQVSFY SDPVDAVYKA LGVN
|
| |