Gene EcHS_A0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0229 
SymbolclpV 
ID5592204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp246871 
End bp249642 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content59% 
IMG OID640919416 
Producttype VI secretion ATPase 
Protein accessionYP_001457003 
Protein GI157159685 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03345] type VI secretion ATPase, ClpV1 family 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCAGA TTGATCTTCC CACGCTGGTA AAACGGCTTA ACCTGTTCTC CCGCCAGGCG 
CTGGAGATGG CCGCCTCTGA ATGTATGAGC CAGCAGGCAG CGGAAATTAC CGTCAGCCAT
GTGCTTATTC AGATGCTCGC CATGCCACGC AGTGACCTGC GGGTTATTAC CCGCCAGGGC
GATATTGGCA TGGAAGAGTT GCGCCAGGCG CTGACGGTGG AGAACTACAC AACTGCCCGT
TCTGCGGACA GCTACCCGGC ATTTTCCCCG ATGCTGGTTG AGTGGCTGAA AGAGGGCTGG
CTGCTGGCGT CGGCTGAGAT GCAGCACAGC GAATTGCGTG GCGGCGTGTT GCTGCTGGCC
CTGCTGCATT CGCCGCTGCG TTATATACCG CCTGCTGCCG CCCGGCTGTT GACCGGCATT
AACCGTGACC GTCTGCAACA GGACTTTGTG CAATGGACAC AGGAGTCGGC GGAATCAGTC
GTGCCGGATG CAGACGGTAA AGGCGCAGGC ACACTGACGG ACGCTGCTGA CACCCTGCTT
GCCCGCTACG CCAAAAACAT GACCGCAGAC GCCCGTAACG GCAGGCTTGA CCCGGTACTG
TGCCGCGACC ACGAAATCGA CCTGATGATC GACATTCTCT GCCGCCGCCG CAAGAACAAC
CCGGTGGTGG TGGGCGAAGC AGGCGTGGGT AAAAGCGCCT TAATCGAAGG GCTGGCGCTG
CGCATCGTGG CAGGCCATGT GCCGGACAAG CTGAAAAACA CCGATATCAT GACCCTTGAC
TTGGGCGCAT TGCAGGCCGG AGCGTCGGTG AAGGGTGAAT TCGAAAAACG TTTCAAAGGG
CTGATGGCGG AGGTCATTTC CTCCCCGGTG CCGGTCATTC TGTTTATCGA CGAAGCACAT
ACCCTGATTG GCGCGGGCAA CCAGCAGGGC GGGCTGGATA TCTCCAACCT GCTCAAACCG
GCGCTGGCGC GCGGCGAGCT GAAAACCATC GCCGCCACCA CCTGGAGCGA GTACAAAAAA
TACTTCGAAA AAGATGCCGC CCTGTCGCGC CGCTTCCAGT TGGTGAAGGT CAGCGAACCC
AACGCTGCCG AAGCCACCAT TATTCTGCGC GGTCTGTCGG CGGTCTATGA ACAGTCTCAC
GGCGTGCTGA TTGATGATGA CGCCTTGCAG GCCGCTGCGA CATTAAGCGA GCGTTATCTC
TCCGGGCGTC AGTTACCGGA CAAAGCGATT GATGTGCTGG ATACCGCCTG CGCCCGTGTG
GCCATCAACC TGTCGTCGCC GCCGAAGCAA ATCTCGGCGC TGACCACTCT GAGCCACCAG
CAGGAGGCGG AAATTCGCCA GCTTGAGCGC GAGCTTCGCA TCGGACTGCG TACCGACACA
TCACGGATGA CCGAGGTGCT GGTGCAGTAT GATGAAACGC TGACGGCGCT GGATGAACTG
GAAGCGGCCT GGCACCAGCA GCAGACGCTG GTCCGGGAGA TTATTGCGCT GCGCCAGCAG
TTACTGGGCG TGGCAGAGGA CGATGCGGCG CCGTTGCCGG ACGCAGATAC CGTGGAGGAT
ACGCAGCCAG AGTCAGAGTC AGAGTCAGAG CAGGATAATA CCGGTGCCGA ACCGGCTGAT
GAAGCTGGCA GCGAACAGCC GAAAGAGACC GCTGAAACAG TTTCCCCGGT ACAACGTCTG
GCACATCTCA CTGCCGAACT GGACGCCCTG CATAACGACC GGTTGCTGGT TTCCCCGCAC
GTCGATAAAA AACAGATTGC GGCGGTGATT GCCGAATGGA CCGGCGTGCC GCTTAACCGC
CTGTCGCAGA ATGAAATGTC GGTCATCACC GACCTGCCAA AATGGCTCGG CGACACCATC
AAAGGCCAGG ACCTGGCGAT TGCCAGCCTG CACAAACACC TGCTGACCGC ACGCGCCGAC
CTGCGCCGTC CGGGACGCCC GCTGGGCGCG TTCCTGCTGG CTGGCCCCAG CGGCGTGGGT
AAAACTGAAA CCGTCCTGCA ACTGGCAGAG CTGCTCTACG GCGGTCGCCA GTACCTGACC
ACCATCAATA TGTCCGAGTT CCAGGAGAAA CATACCGTCT CGCGACTGAT TGGTTCGCCT
CCGGGCTACG TTGGCTACGG TGAAGGCGGC GTACTGACCG AAGCGATTCG CCAGAAACCC
TACTCGGTAG TACTGCTCGA TGAAGTGGAA AAAGCGCACC CGGATGTGCT CAACCTGTTC
TACCAGGCGT TCGATAAGGG CGAAATGGCA GACGGTGAAG GCCGCCTGAT TGACTGCAAA
AATATCGTCT TCTTCCTGAC GTCCAACCTC GGCTACCAGG TAATAGTCGA GCATGCCGAT
GACCCGGAAA CCATGCAGGA AGTACTGTAT CCGGTGCTGG CCGACTTCTT CAAACCTGCC
CTGCTGGCGC GTATGGAAGT GGTGCCGTAC CTGCCGCTGT CGAAAGAGAC GCTCGCCACC
ATCATTGCCG GGAAACTGGC CCGCCTGGAT AACGTGCTGC GCAGCCGCTT TGGCGCGGAG
GTGATTATAG AACCGGAAGT GACGGACGAA ATCATGAGCC GCGTCACCCG CGCGGAAAAC
GGCGCGAGGA TGCTGGAATC GGTCATCGAC GGCAATATGC TGCCGCCGCT CTCCCTGCTG
CTGTTGCAGA AAATGGCGGC GAATACGGCG GTTGCCCGGA TTCGGTTGTC GGCAGTGGAC
GGCGCATTTA CGGCAGACGT GGAAGATGCT CAGAACGACG AGTCCGTCAC AAAGGATGAA
ACGGTTTTAT GA
 
Protein sequence
MIQIDLPTLV KRLNLFSRQA LEMAASECMS QQAAEITVSH VLIQMLAMPR SDLRVITRQG 
DIGMEELRQA LTVENYTTAR SADSYPAFSP MLVEWLKEGW LLASAEMQHS ELRGGVLLLA
LLHSPLRYIP PAAARLLTGI NRDRLQQDFV QWTQESAESV VPDADGKGAG TLTDAADTLL
ARYAKNMTAD ARNGRLDPVL CRDHEIDLMI DILCRRRKNN PVVVGEAGVG KSALIEGLAL
RIVAGHVPDK LKNTDIMTLD LGALQAGASV KGEFEKRFKG LMAEVISSPV PVILFIDEAH
TLIGAGNQQG GLDISNLLKP ALARGELKTI AATTWSEYKK YFEKDAALSR RFQLVKVSEP
NAAEATIILR GLSAVYEQSH GVLIDDDALQ AAATLSERYL SGRQLPDKAI DVLDTACARV
AINLSSPPKQ ISALTTLSHQ QEAEIRQLER ELRIGLRTDT SRMTEVLVQY DETLTALDEL
EAAWHQQQTL VREIIALRQQ LLGVAEDDAA PLPDADTVED TQPESESESE QDNTGAEPAD
EAGSEQPKET AETVSPVQRL AHLTAELDAL HNDRLLVSPH VDKKQIAAVI AEWTGVPLNR
LSQNEMSVIT DLPKWLGDTI KGQDLAIASL HKHLLTARAD LRRPGRPLGA FLLAGPSGVG
KTETVLQLAE LLYGGRQYLT TINMSEFQEK HTVSRLIGSP PGYVGYGEGG VLTEAIRQKP
YSVVLLDEVE KAHPDVLNLF YQAFDKGEMA DGEGRLIDCK NIVFFLTSNL GYQVIVEHAD
DPETMQEVLY PVLADFFKPA LLARMEVVPY LPLSKETLAT IIAGKLARLD NVLRSRFGAE
VIIEPEVTDE IMSRVTRAEN GARMLESVID GNMLPPLSLL LLQKMAANTA VARIRLSAVD
GAFTADVEDA QNDESVTKDE TVL