Gene EcHS_A0341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0341 
Symbol 
ID5595010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp352200 
End bp354284 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content50% 
IMG OID640919526 
Producthypothetical protein 
Protein accessionYP_001457112 
Protein GI157159794 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4930] Predicted ATP-dependent Lon-type protease 
TIGRFAM ID[TIGR02653] conserved hypothetical protein
[TIGR02688] conserved hypothetical protein TIGR02688 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.168371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCC ATCATGATTT ACCTGTTTCA GGCGTATCCG CAGGGGAAAT TGCCTCCGAG 
GGTTACGATC TGGACGCCCT GCTGAACCAG CATTTTGCTG GTCGTGTGGT GCGTAAAGAT
CTCACCAAGC AACTCAAGGA AGGGGCAAAC GTCCCGGTGT ATGTGCTGGA GTATCTGCTC
GGCATGTACT GCGCCTCTGA CGATGACGAC GTGGTCGAGC AAGGGTTGCA AAACGTTAAG
CGTATTCTGG CTGATAACTA TGTGCGCCCG GATGAAGCAG AGAAAGTGAA GTCGCTGATC
CGCGAGCGTG GTTCGTACAA AATCATCGAT AAAGTGTCGG TGAAGCTAAA CCAGAAAAAA
GACGTTTACG AAGCCCAGCT TTCTAACCTC GGCATCAAAG ACGCGCTGGT GCCATCGCAG
ATGGTTAAAG ACAACGAGAA GCTACTAACG GGCGGTATCT GGTGCATGAT TACCGTCAAC
TATTTCTTTG AAGAAGGGCA GAAGACTTCG CCCTTCTCAT TGATGACGCT TAAGCCTATC
CAGATGCCGA ATATGGATAT GGAAGAAGTG TTCGATGCGC GTAAACACTT TAACCGTGAT
CAGTGGATCG ATGTGCTGCT GCGCTCAGTG GGTATGGAGC CCGCCAATAT TGAGCAACGC
ACCAAATGGC ACCTTATCAC CCGTATGATC CCGTTCGTGG AGAACAACTA TAACGTTTGC
GAGCTGGGGC CGCGTGGCAC CGGTAAAAGC CATGTGTATA AAGAGTGTTC TCCTAACTCT
CTGTTAGTTT CTGGCGGGCA AACGACCGTT GCCAACTTGT TCTACAACAT GGCCAGTCGC
CAGATCGGCC TGGTTGGCAT GTGGGATGTG GTAGCGTTCG ACGAAGTCGC GGGGATCACT
TTCAAAGATA AAGACGGCGT GCAAATCATG AAAGATTACA TGGCGTCAGG ATCTTTCTCT
CGCGGCAGAG ATTCGATTGA AGGTAAAGCG TCGATGGTTT TCGTCGGCAA CATCAATCAA
AGCGTAGAGA CTCTCGTTAA AACCAGCCAT TTGCTGGCGC CATTTCCGGC TGCGATGATT
GATACTGCAT TTTTCGACCG CTTTCATGCC TATATTCCCG GTTGGGAAAT CCCCAAAATG
CGCCCGGAAT TTTTTACCAA CCGTTACGGG CTGATTACGG ATTATCTCGC TGAATATATG
CGCGAAATGC GCAAACGCAG TTTCTCTGAT GCGATTGATA AATTCTTTAA GCTGGGTAAC
AACCTCAACC AGCGTGACGT TATTGCCGTT CGACGTACCG TGTCGGGGTT GTTAAAACTC
ATGCATCCCG ATGGCGCGTA CAGCAAAGAA GATGTGCGAG TCTGCCTGAC CTATGCGATG
GAAGTTCGTC GCCGCGTGAA AGAGCAACTT AAAAAACTGG GCGGTCTGGA GTTCTTCGAT
GTGAACTTTA GCTACATCGA CAACGAAACG CTGGAAGAGT TTTTTGTGAG CGTACCGGAA
CAGGGCGGCA GCGAACTTAT TCCTGCCGGA ATGCCAAAGC CGGGTGTTGT GCATCTGGTC
ACTCAGGCAG AAAGCGGCAT GACCGGGCTG TATCGTTTTG AAACACAGAT GACTGCCGGT
AATGGTAAGC ATAGTGTATC GGGTCTGGGT TCAAATACCT CCGCGAAAGA AGCTATCCGC
GTCGGTTTCG ATTACTTCAA AGGCAATTTG AATCGGGTAA GCGCGGCCGC GAAATTCTCC
GATCATGAAT ATCACCTTCA TGTCGTTGAA CTGCATAATA CTGGCCCAAG CACCGCAACC
AGTCTTGCTG CGCTTATCGC TTTATGTTCG ATATTGCTGG CAAAACCGGT GCAGGAACAG
ATGGTGGTGT TGGGCAGTAT GACGCTTGGT GGGGTAATTA ACCCGGTGCA GGATCTTGCC
GCCAGTTTAC AGCTCGCCTT CGACAGCGGT GCAAAACGGG TTCTGTTGCC GATGTCCTCG
GCTATGGATA TTCCAACGGT TCCGGCAGAG TTATTTACCA AGTTTCAGGT GAGTTTTTAC
TCAGACCCGG TTGATGCTGT TTATAAGGCG CTGGGTGTGA ATTAA
 
Protein sequence
MQTHHDLPVS GVSAGEIASE GYDLDALLNQ HFAGRVVRKD LTKQLKEGAN VPVYVLEYLL 
GMYCASDDDD VVEQGLQNVK RILADNYVRP DEAEKVKSLI RERGSYKIID KVSVKLNQKK
DVYEAQLSNL GIKDALVPSQ MVKDNEKLLT GGIWCMITVN YFFEEGQKTS PFSLMTLKPI
QMPNMDMEEV FDARKHFNRD QWIDVLLRSV GMEPANIEQR TKWHLITRMI PFVENNYNVC
ELGPRGTGKS HVYKECSPNS LLVSGGQTTV ANLFYNMASR QIGLVGMWDV VAFDEVAGIT
FKDKDGVQIM KDYMASGSFS RGRDSIEGKA SMVFVGNINQ SVETLVKTSH LLAPFPAAMI
DTAFFDRFHA YIPGWEIPKM RPEFFTNRYG LITDYLAEYM REMRKRSFSD AIDKFFKLGN
NLNQRDVIAV RRTVSGLLKL MHPDGAYSKE DVRVCLTYAM EVRRRVKEQL KKLGGLEFFD
VNFSYIDNET LEEFFVSVPE QGGSELIPAG MPKPGVVHLV TQAESGMTGL YRFETQMTAG
NGKHSVSGLG SNTSAKEAIR VGFDYFKGNL NRVSAAAKFS DHEYHLHVVE LHNTGPSTAT
SLAALIALCS ILLAKPVQEQ MVVLGSMTLG GVINPVQDLA ASLQLAFDSG AKRVLLPMSS
AMDIPTVPAE LFTKFQVSFY SDPVDAVYKA LGVN