Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0237 |
Symbol | clpV |
ID | 6972231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 251110 |
End bp | 253881 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643384308 |
Product | type VI secretion ATPase, ClpV1 family |
Protein accession | YP_002268824 |
Protein GI | 209397886 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR03345] type VI secretion ATPase, ClpV1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCAGA TTGATCTTCC CACGCTGGTA AAACGGCTGA ACCTGTTCTC CCGCCAGGCG CTGGAGATGG CCGCCTCAGA ATGTATGAGC CAGCAGGCAG CGGAAATTAC CGTCAGCCAT GTGCTTATTC AGATGCTCGC CATGCCACGC AGTGACCTGC GGGTTATTAC CCGCCAGGGC GATATTGGCA TGGAAGAGTT GCGCCAGGCG TTGACGGTAG AGAACTACAC AACCGCCCGT TCTGCGGACA GCTACCCGGC GTTTTCCCCG ATGCTGGTTG AGTGGCTTAA AGAGGGCTGG CTGCTGGCGT CGGCTGAGAT GCAGCACAGC GAACTGCGCG GCGGCGTGTT GCTGCTGGCC TTGCTGCATT CGCCGCTGCG TTATATACCG CCTGCTGCCG CCCGGCTGTT GACCGGCATT AACCGTGACC GTCTGCAACA GGACTTTGTG CAGTGGACAC AGGAGTCGGC GGAATCAGTC GTGCCGGATG CAGACGGTAA AGGCGCAGGC ACACTGACGG ACGCCTCTGA CACCCTGCTT GCCCGCTACG CCAAAAACAT GACAGAAGAC GCCCGTAACG GCAGGCTAGC CCCAGTACTG TGCCGCGACC ACGAAATCGA CCTGATGATC GACATTCTCT GCCGCCGCCG TAAAAACAAC CCGGTGGTGG TGGGCGAAGC AGGCGTGGGT AAAAGCGCGC TGATTGAAGG GCTGGCGCTG CGCATCGTGG CAGGCCAGGT GCCGGACAAA CTGAAAAACA CCGATATCAT GACCCTTGAC CTGGGCGCAT TGCAGGCCGG GGCATCGGTG AAGGGTGAGT TTGAAAAACG CTTCAAGGGG CTGATGGCGG AGGTCATTTC CTCTCCGGTG CCGGTCATTC TGTTTATCGA CGAAGCGCAT ACCCTGATTG GCGCGGGCAA CCAGCAGGGC GGGCTGGATA TCTCCAACCT GCTGAAACCG GCGCTGGCGC GTGGCGAACT GAAAACCATC GCCGCCACCA CCTGGAGCGA GTACAAAAAA TACTTCGAGA AAGACGCCGC CCTGTCGCGC CGCTTCCAGT TAGTGAAGGT CAGCGAACCC AACGCTGCCG AAGCCACCAT TATTCTGCGT GGTCTGTCGG CGGTCTATGA ACGGTCTCAC GGCGTGCTGA TTGACGATGA TGCCTTGCAG GCCGCAGCAA CGCTGAGCGA ACGTTATCTC TCCGGGCGTC AGTTACCGGA CAAAGCGATT GATGTGCTGG ATACCGCCTG CGCCCGTGTG GCCATCAACC TGTCGTCGCC GCCGAAGCAA ATCTCGGCGC TGACCACTCT GAGCCACCAG CAGGAGGCGG AAATTCGCCA GCTTGAGCGC GAGCTTCGCA TCGGACTGCG TACCGACACA TCACGGATGA CCGAGGTGCT GGTGCAGTAT GATGAAACGC TGACGGCGCT GGATGAACTG GAAGCGGCTT GGCACCAGCA GCAGACGCTG GTCCGGGAGA TTATCGCGCT GCGCCAGCAG TTACTGGGCG TGGCAGAGGA CGATGCGGCG CCGTTGCCGG ACGCAGATAC TGTGGAGGAT ACGCAGCCAG AGTCAGAGTC AGAGTCAGAA CAGGATAATA CCGGTGCCGT ACCGGCTGAT GAGACCGACA GAGAACAGCC GGAAGAGACC GCTGAAACAG TTTCCCCGGT ACAGCGGCTG GCCCAGCTCA CTGCCGAACT GGACGCCCTG CATAACGACC GGTTGCTGGT CTCCCCGCAC GTCGATAAAA AACAGATTGC GGCGGTGATT GCCGAATGGA CCGGCGTGCC GCTCAACCGC CTGTCGCAGA ATGAAATGTC GGTCATCACC GACCTGCCGA AATGGCTGGG CGACACCATC AAAGGCCAGG ACCTGGCGAT TGCCAGCCTG CATAAGCATC TACTGACCGC ACGCGCCGAC CTGCGTCGTC CGGGACGCCC GCTCGGTGCG TTCCTGCTCG CTGGCCCCAG CGGCGTGGGT AAAACCGAAA CCGTCCTGCA ACTGGCAGAG CTGCTCTACG GCGGTCGCCA GTACCTGACC ACCATCAATA TGTCCGAGTT CCAGGAGAAA CACACCGTTT CGCGGCTGAT TGGCTCCCCT CCGGGCTATG TCGGCTATGG CGAAGGCGGC GTGCTGACTG AAGCCATCCG CCAGAAACCC TACTCGGTGG TGCTGCTCGA TGAAGTGGAA AAAGCGCACC CGGATGTCCT CAACCTGTTC TACCAGGCGT TTGACAAGGG CGAGATGGCC GACGGCGAAG GCCGCCTGAT TGACTGTAAA AACATCGTCT TCTTCCTCAC CTCCAACCTC GGTTACCAGG TGATTGTCGA ACACGCGGAT GACCCGGAAA CCATGCAGGA AGCCCTCTAT CCGGTGCTGG CGGACTTCTT TAAACCTGCC CTGCTGGCGC GAATGGAGGT GGTGCCGTAC CTGCCGCTGT CGAAAGAGAC GCTCGCCACC ATTATCGCCG GGAAACTGGC CCGTCTGGAT AACGTGCTGC GCAGTCGCTT TGGTGCAGAA GTGGTCATTG AACCGGAAGT GACGGACGAA ATCATGAGCC GCGTCACCCG CGCGGAAAAC GGCGCGAGGA TGCTGGAATC TGTCATCGAT GGCGACATGC TACCGCCGCT CTCGCTGCTG CTGTTGCAGA AAATGGCGGC TAACACGGCG ATTGCCCGGA TTCGGTTGTC GGCAGTGGAC GGCGCATTTA CGGCAGACGT GGAAGATGCT CAGAACGACG AGTCCGTCAC AAAGGATGAA ACGGTTTTAT GA
|
Protein sequence | MIQIDLPTLV KRLNLFSRQA LEMAASECMS QQAAEITVSH VLIQMLAMPR SDLRVITRQG DIGMEELRQA LTVENYTTAR SADSYPAFSP MLVEWLKEGW LLASAEMQHS ELRGGVLLLA LLHSPLRYIP PAAARLLTGI NRDRLQQDFV QWTQESAESV VPDADGKGAG TLTDASDTLL ARYAKNMTED ARNGRLAPVL CRDHEIDLMI DILCRRRKNN PVVVGEAGVG KSALIEGLAL RIVAGQVPDK LKNTDIMTLD LGALQAGASV KGEFEKRFKG LMAEVISSPV PVILFIDEAH TLIGAGNQQG GLDISNLLKP ALARGELKTI AATTWSEYKK YFEKDAALSR RFQLVKVSEP NAAEATIILR GLSAVYERSH GVLIDDDALQ AAATLSERYL SGRQLPDKAI DVLDTACARV AINLSSPPKQ ISALTTLSHQ QEAEIRQLER ELRIGLRTDT SRMTEVLVQY DETLTALDEL EAAWHQQQTL VREIIALRQQ LLGVAEDDAA PLPDADTVED TQPESESESE QDNTGAVPAD ETDREQPEET AETVSPVQRL AQLTAELDAL HNDRLLVSPH VDKKQIAAVI AEWTGVPLNR LSQNEMSVIT DLPKWLGDTI KGQDLAIASL HKHLLTARAD LRRPGRPLGA FLLAGPSGVG KTETVLQLAE LLYGGRQYLT TINMSEFQEK HTVSRLIGSP PGYVGYGEGG VLTEAIRQKP YSVVLLDEVE KAHPDVLNLF YQAFDKGEMA DGEGRLIDCK NIVFFLTSNL GYQVIVEHAD DPETMQEALY PVLADFFKPA LLARMEVVPY LPLSKETLAT IIAGKLARLD NVLRSRFGAE VVIEPEVTDE IMSRVTRAEN GARMLESVID GDMLPPLSLL LLQKMAANTA IARIRLSAVD GAFTADVEDA QNDESVTKDE TVL
|
| |