Gene Ent638_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1399 
SymbolclpA 
ID5114364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1531051 
End bp1533330 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content56% 
IMG OID640491586 
ProductATP-dependent Clp protease ATP-binding subunit 
Protein accessionYP_001176131 
Protein GI146311057 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.103795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATC AAGAACTGGA ACTCAGTTTA AACATGGCTT TCGCCAGAGC GCGTGAGCAC 
CGACATGAGT TTATGACTGT CGAGCACCTG CTTCTGGCAC TGCTCAGCAA CCCATCTGCC
CGTGAAGCGC TGGAAGCGTG TTCTGTGGAT CTGGTGGCGT TACGTCAGGA ACTCGAAGCC
TTCATCGAAC AAACGACACC GGTGCTGCCC GTCAGTGAAG AGGAGCGCGA CACTCAGCCG
ACGCTCAGCT TCCAGCGCGT ATTGCAGCGC GCGGTGTTCC ACGTCCAGTC TTCTGGTCGT
AACGAAGTGA CCGGCGCAAA CGTGTTGGTT GCCATCTTCA GTGAGCAGGA GTCGCAAGCG
GCGTATTTGC TGCGCAAACA CGAAGTCAGC CGACTCGACG TGGTCAACTT TATCTCTCAC
GGAACGCGAA AAGACGAGCC GAATCAGGCG TCAGATTCCA GCAGTCAGGC GAGCAATCCT
GAAGAGCAAG CAGGCGGGGA GGATCGTATG GAAAACTTCA CCACCAACCT TAATCAGCTT
GCTCGCGTTG GCGGAATCGA TCCGCTGATA GGCCGCGACA AAGAGCTGGA ACGCGCGATT
CAGGTGCTGT GCCGTCGCCG CAAAAACAAC CCGCTGCTGG TGGGTGAATC GGGCGTGGGT
AAAACCGCGA TTGCCGAAGG GCTGGCCTGG CGTATCGTGC AGGGCGACGT TCCGGAAATC
ATGTCAGATT GCACCATCTA CTCGCTGGAT ATCGGTTCAC TGCTGGCGGG TACAAAATAT
CGCGGTGATT TCGAAAAACG TTTCAAAGCC TTGCTGAAAC AGCTGGAACA AGACACCAGC
AGCATCCTGT TTATCGATGA AATCCATACC ATTATCGGTG CGGGCGCGGC GTCTGGCGGC
CAGGTGGATG CGGCTAACCT GATTAAACCG CTGCTGTCGA GCGGTAAGAT CCGCGTGATG
GGCTCGACGA CGTACCAGGA ATTCAGCAAC ATTTTCGAGA AAGACCGTGC ATTAGCGCGT
CGCTTCCAGA AAATTGATAT TACTGAGCCG TCCGTTGAAG AAACGGTCCA GATCATCAAC
GGCCTGAAAC CGAAGTACGA AGCGCACCAC GACGTGCGTT ATACCGCGAA AGCGGTGCGT
GCGGCAGTGG AGCTGGCGGT GAAATACATT AATGATCGTC ATCTGCCGGA TAAAGCGATT
GATGTGATCG ACGAAGCGGG CGCGCGTGCG CGTCTGATGC CGATCAGCAA GCGCAAGAAA
ACCGTCAACG TAGCGGACAT TGAATCCGTG GTGGCGCGTA TTGCGCGTAT CCCTGAGAAG
AGCGTTTCTC AGAGCGATCG CGACACGCTG CGTACCCTCG GCAATCGCCT GAAAATGCTG
GTCTTCGGGC AGGATAAAGC GATTGAAGCT CTAACCGAAG CCATCAAAAT GGCGCGTGCG
GGACTGGGCC ACGAACATAA ACCCGTCGGT TCCTTCCTGT TCGCCGGTCC GACCGGTGTG
GGGAAAACCG AGGTGACGGT TCAGCTTTCC AAAGCTCTCG GCATTGAACT GCTGCGTTTC
GATATGTCCG AGTATATGGA GCGCCATACC GTTAGCCGTT TGATCGGTGC GCCTCCGGGA
TACGTGGGCT TTGATCAGGG CGGTTTGCTT ACCGATGCGG TGATCAAACA TCCACATGCG
GTGCTGCTGC TTGATGAAAT CGAGAAAGCG CACCCGGACG TGTTCAATAT CCTGCTGCAG
GTGATGGACA ACGGCACGCT GACCGATAAC AACGGGCGCA AAGCGGACTT CCGCAATGTG
GTGCTGGTGA TGACCACCAA CGCCGGTGTG CGTGAAACCG AGCGTAAATC AATCGGCCTG
ATCCACCAGG ATAACAGCAC CGATGCGATG GAAGAGATCA AGAAGATCTT TACGCCGGAA
TTCCGTAACC GTCTGGATAA CATTATCTGG TTCGATCACT TGTCTACCGA GGTGATTCAC
CAGGTTGTGG ACAAGTTCAT CGTCGAGTTG CAGGTTCAGC TTGATCAGAA AGGCGTATCG
CTGGAAGTGA GCCAGGAGGC CCGCAACTGG CTGGCCGAGA AAGGCTACGA CCGTGCGATG
GGCGCGCGTC CAATGGCGCG AGCCATTCAG GATAACCTGA AAAAACCGCT GGCTAACGAA
CTGCTGTTTG GTTCTTTAGT GGACGGCGGG CAGGTGACTG TGGCGCTGGA TCAGGCGAAG
GGCGAACTGA CGTATGACTT CCAGAGTGCG GCGAAGCACA AGCCGGAAGC AGCACACTAA
 
Protein sequence
MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALRQELEA 
FIEQTTPVLP VSEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA
AYLLRKHEVS RLDVVNFISH GTRKDEPNQA SDSSSQASNP EEQAGGEDRM ENFTTNLNQL
ARVGGIDPLI GRDKELERAI QVLCRRRKNN PLLVGESGVG KTAIAEGLAW RIVQGDVPEI
MSDCTIYSLD IGSLLAGTKY RGDFEKRFKA LLKQLEQDTS SILFIDEIHT IIGAGAASGG
QVDAANLIKP LLSSGKIRVM GSTTYQEFSN IFEKDRALAR RFQKIDITEP SVEETVQIIN
GLKPKYEAHH DVRYTAKAVR AAVELAVKYI NDRHLPDKAI DVIDEAGARA RLMPISKRKK
TVNVADIESV VARIARIPEK SVSQSDRDTL RTLGNRLKML VFGQDKAIEA LTEAIKMARA
GLGHEHKPVG SFLFAGPTGV GKTEVTVQLS KALGIELLRF DMSEYMERHT VSRLIGAPPG
YVGFDQGGLL TDAVIKHPHA VLLLDEIEKA HPDVFNILLQ VMDNGTLTDN NGRKADFRNV
VLVMTTNAGV RETERKSIGL IHQDNSTDAM EEIKKIFTPE FRNRLDNIIW FDHLSTEVIH
QVVDKFIVEL QVQLDQKGVS LEVSQEARNW LAEKGYDRAM GARPMARAIQ DNLKKPLANE
LLFGSLVDGG QVTVALDQAK GELTYDFQSA AKHKPEAAH