Gene ECH74115_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0237 
SymbolclpV 
ID6972231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp251110 
End bp253881 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content60% 
IMG OID643384308 
Producttype VI secretion ATPase, ClpV1 family 
Protein accessionYP_002268824 
Protein GI209397886 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03345] type VI secretion ATPase, ClpV1 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGA TTGATCTTCC CACGCTGGTA AAACGGCTGA ACCTGTTCTC CCGCCAGGCG 
CTGGAGATGG CCGCCTCAGA ATGTATGAGC CAGCAGGCAG CGGAAATTAC CGTCAGCCAT
GTGCTTATTC AGATGCTCGC CATGCCACGC AGTGACCTGC GGGTTATTAC CCGCCAGGGC
GATATTGGCA TGGAAGAGTT GCGCCAGGCG TTGACGGTAG AGAACTACAC AACCGCCCGT
TCTGCGGACA GCTACCCGGC GTTTTCCCCG ATGCTGGTTG AGTGGCTTAA AGAGGGCTGG
CTGCTGGCGT CGGCTGAGAT GCAGCACAGC GAACTGCGCG GCGGCGTGTT GCTGCTGGCC
TTGCTGCATT CGCCGCTGCG TTATATACCG CCTGCTGCCG CCCGGCTGTT GACCGGCATT
AACCGTGACC GTCTGCAACA GGACTTTGTG CAGTGGACAC AGGAGTCGGC GGAATCAGTC
GTGCCGGATG CAGACGGTAA AGGCGCAGGC ACACTGACGG ACGCCTCTGA CACCCTGCTT
GCCCGCTACG CCAAAAACAT GACAGAAGAC GCCCGTAACG GCAGGCTAGC CCCAGTACTG
TGCCGCGACC ACGAAATCGA CCTGATGATC GACATTCTCT GCCGCCGCCG TAAAAACAAC
CCGGTGGTGG TGGGCGAAGC AGGCGTGGGT AAAAGCGCGC TGATTGAAGG GCTGGCGCTG
CGCATCGTGG CAGGCCAGGT GCCGGACAAA CTGAAAAACA CCGATATCAT GACCCTTGAC
CTGGGCGCAT TGCAGGCCGG GGCATCGGTG AAGGGTGAGT TTGAAAAACG CTTCAAGGGG
CTGATGGCGG AGGTCATTTC CTCTCCGGTG CCGGTCATTC TGTTTATCGA CGAAGCGCAT
ACCCTGATTG GCGCGGGCAA CCAGCAGGGC GGGCTGGATA TCTCCAACCT GCTGAAACCG
GCGCTGGCGC GTGGCGAACT GAAAACCATC GCCGCCACCA CCTGGAGCGA GTACAAAAAA
TACTTCGAGA AAGACGCCGC CCTGTCGCGC CGCTTCCAGT TAGTGAAGGT CAGCGAACCC
AACGCTGCCG AAGCCACCAT TATTCTGCGT GGTCTGTCGG CGGTCTATGA ACGGTCTCAC
GGCGTGCTGA TTGACGATGA TGCCTTGCAG GCCGCAGCAA CGCTGAGCGA ACGTTATCTC
TCCGGGCGTC AGTTACCGGA CAAAGCGATT GATGTGCTGG ATACCGCCTG CGCCCGTGTG
GCCATCAACC TGTCGTCGCC GCCGAAGCAA ATCTCGGCGC TGACCACTCT GAGCCACCAG
CAGGAGGCGG AAATTCGCCA GCTTGAGCGC GAGCTTCGCA TCGGACTGCG TACCGACACA
TCACGGATGA CCGAGGTGCT GGTGCAGTAT GATGAAACGC TGACGGCGCT GGATGAACTG
GAAGCGGCTT GGCACCAGCA GCAGACGCTG GTCCGGGAGA TTATCGCGCT GCGCCAGCAG
TTACTGGGCG TGGCAGAGGA CGATGCGGCG CCGTTGCCGG ACGCAGATAC TGTGGAGGAT
ACGCAGCCAG AGTCAGAGTC AGAGTCAGAA CAGGATAATA CCGGTGCCGT ACCGGCTGAT
GAGACCGACA GAGAACAGCC GGAAGAGACC GCTGAAACAG TTTCCCCGGT ACAGCGGCTG
GCCCAGCTCA CTGCCGAACT GGACGCCCTG CATAACGACC GGTTGCTGGT CTCCCCGCAC
GTCGATAAAA AACAGATTGC GGCGGTGATT GCCGAATGGA CCGGCGTGCC GCTCAACCGC
CTGTCGCAGA ATGAAATGTC GGTCATCACC GACCTGCCGA AATGGCTGGG CGACACCATC
AAAGGCCAGG ACCTGGCGAT TGCCAGCCTG CATAAGCATC TACTGACCGC ACGCGCCGAC
CTGCGTCGTC CGGGACGCCC GCTCGGTGCG TTCCTGCTCG CTGGCCCCAG CGGCGTGGGT
AAAACCGAAA CCGTCCTGCA ACTGGCAGAG CTGCTCTACG GCGGTCGCCA GTACCTGACC
ACCATCAATA TGTCCGAGTT CCAGGAGAAA CACACCGTTT CGCGGCTGAT TGGCTCCCCT
CCGGGCTATG TCGGCTATGG CGAAGGCGGC GTGCTGACTG AAGCCATCCG CCAGAAACCC
TACTCGGTGG TGCTGCTCGA TGAAGTGGAA AAAGCGCACC CGGATGTCCT CAACCTGTTC
TACCAGGCGT TTGACAAGGG CGAGATGGCC GACGGCGAAG GCCGCCTGAT TGACTGTAAA
AACATCGTCT TCTTCCTCAC CTCCAACCTC GGTTACCAGG TGATTGTCGA ACACGCGGAT
GACCCGGAAA CCATGCAGGA AGCCCTCTAT CCGGTGCTGG CGGACTTCTT TAAACCTGCC
CTGCTGGCGC GAATGGAGGT GGTGCCGTAC CTGCCGCTGT CGAAAGAGAC GCTCGCCACC
ATTATCGCCG GGAAACTGGC CCGTCTGGAT AACGTGCTGC GCAGTCGCTT TGGTGCAGAA
GTGGTCATTG AACCGGAAGT GACGGACGAA ATCATGAGCC GCGTCACCCG CGCGGAAAAC
GGCGCGAGGA TGCTGGAATC TGTCATCGAT GGCGACATGC TACCGCCGCT CTCGCTGCTG
CTGTTGCAGA AAATGGCGGC TAACACGGCG ATTGCCCGGA TTCGGTTGTC GGCAGTGGAC
GGCGCATTTA CGGCAGACGT GGAAGATGCT CAGAACGACG AGTCCGTCAC AAAGGATGAA
ACGGTTTTAT GA
 
Protein sequence
MIQIDLPTLV KRLNLFSRQA LEMAASECMS QQAAEITVSH VLIQMLAMPR SDLRVITRQG 
DIGMEELRQA LTVENYTTAR SADSYPAFSP MLVEWLKEGW LLASAEMQHS ELRGGVLLLA
LLHSPLRYIP PAAARLLTGI NRDRLQQDFV QWTQESAESV VPDADGKGAG TLTDASDTLL
ARYAKNMTED ARNGRLAPVL CRDHEIDLMI DILCRRRKNN PVVVGEAGVG KSALIEGLAL
RIVAGQVPDK LKNTDIMTLD LGALQAGASV KGEFEKRFKG LMAEVISSPV PVILFIDEAH
TLIGAGNQQG GLDISNLLKP ALARGELKTI AATTWSEYKK YFEKDAALSR RFQLVKVSEP
NAAEATIILR GLSAVYERSH GVLIDDDALQ AAATLSERYL SGRQLPDKAI DVLDTACARV
AINLSSPPKQ ISALTTLSHQ QEAEIRQLER ELRIGLRTDT SRMTEVLVQY DETLTALDEL
EAAWHQQQTL VREIIALRQQ LLGVAEDDAA PLPDADTVED TQPESESESE QDNTGAVPAD
ETDREQPEET AETVSPVQRL AQLTAELDAL HNDRLLVSPH VDKKQIAAVI AEWTGVPLNR
LSQNEMSVIT DLPKWLGDTI KGQDLAIASL HKHLLTARAD LRRPGRPLGA FLLAGPSGVG
KTETVLQLAE LLYGGRQYLT TINMSEFQEK HTVSRLIGSP PGYVGYGEGG VLTEAIRQKP
YSVVLLDEVE KAHPDVLNLF YQAFDKGEMA DGEGRLIDCK NIVFFLTSNL GYQVIVEHAD
DPETMQEALY PVLADFFKPA LLARMEVVPY LPLSKETLAT IIAGKLARLD NVLRSRFGAE
VVIEPEVTDE IMSRVTRAEN GARMLESVID GDMLPPLSLL LLQKMAANTA IARIRLSAVD
GAFTADVEDA QNDESVTKDE TVL