Gene ECH_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0997 
SymbolhslU 
ID3927521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1021414 
End bp1022877 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content33% 
IMG OID637902113 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_507784 
Protein GI88657724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTTA CACCAAATAA TAAACTCAAA TTGAATAATG ATACAACAAA TAATATAAAT 
GATGAACAAG CTAGTAGTGA AGTACTAAAT AGTGAAAACG CTATTAGCAC AGAAGGATAT
GAAGATGACA TAAATCTAGA TGACTTATAC AACCCACAAG AATTAACTCC TCAGCAAATT
ACACAGGAAC TAGATAGGTT TATTATAGGA CAAGCAGACG CAAAACGTGC TGTAGCCATT
GCCTTAAGAA ATCGCTGGCG TCGTAACAGG GTACCTGAAC CATTAAGAGA AGAGATTATC
CCTAAAAACA TCCTAATGAT AGGACATACA GGTATTGGTA AAACAGAAAT AGCTCGCAGG
TTAGCAAAAC TTGCAAAAGC ACCTTTCATA AAAGTTGAAG CTACAAAATT TACTGAAATA
GGATATGTAG GGAGAGATGT AGACTCTATT ATACGTGACT TAGTTGATGT AGCAATCAAT
CTTGAAAAAG AAAAAAGTCG TAAATTTGTA GAGACAAAAG CAAAATCTTT AGCAGAAAAT
ATAATTCTTG AAGCACTGGT AGGAGCTGAT GCAAGTCAAG AGACAAAAAC TATTTTTCAA
GAAAAGCTAA GAAATGGTGA ATTTGAAAAT TTTGAGATCT CCATATCCAT AAAGGAAAGT
AAAAATGCAA TCCCTTCTAT TGATATTCCA AATATTCCAG GAAATCAAGT TGGCATTATG
AATATCAATG AGATTGTACA TAAAATGCTA GGAAATAACA AACAACTTAA GACTATAAAA
GTTACTGTAA AAGAGGCAAG AGAACTGCTA ATTAATGAAG AAAGCGAAAA ATTAATGGAT
GAAGATAAAA TCATCAAAGA CGCTCTTTTG TTAGCAAGTA ACGACGGCAT CGTATTTCTA
GATGAAATAG ACAAAATTGC AGCTCGTACA GAAATCAGAG GAGAAGTGAA CAGAGAAGGT
GTACAAAGAG ATCTTTTACC ATTACTTGAA GGAACAAGTG TAACAACAAA GTATGGTACT
ATTACAACCG ATCACATTTT ATTCATAGCA TCTGGAGCAT TTCACTTGGC TAAACCTTCT
GACTTATTAC CTGAGTTACA AGGACGTCTT CCTATACGAG TAGAACTGAA ACCGCTTAGT
AAAGATGATT TAGTACGAAT TTTAACCGAA CCAGAATCAA GCTTATTAAA GCAATACTGC
GCATTAATGA AAACAGAAAA TATTACTATT GACTTTACTG ATGAGGGAGT ATCTACTATA
GCTGAAATAG CATCTACAGT TAACAGAGAG GTAGAAAATA TTGGAGCTCG TAGATTACAT
ACCATTTTAG AAAAGCTAAT GGAAGATATC AGTTATACTG CAACAGAAAA TAGTGGTAGG
ACATATGTGA TAGATAGCGA ATATGTAAAG AAAAAGCTAG AAGACATTGC AAAACAATTA
GATTTATCAA AATTTATATT ATAG
 
Protein sequence
MFVTPNNKLK LNNDTTNNIN DEQASSEVLN SENAISTEGY EDDINLDDLY NPQELTPQQI 
TQELDRFIIG QADAKRAVAI ALRNRWRRNR VPEPLREEII PKNILMIGHT GIGKTEIARR
LAKLAKAPFI KVEATKFTEI GYVGRDVDSI IRDLVDVAIN LEKEKSRKFV ETKAKSLAEN
IILEALVGAD ASQETKTIFQ EKLRNGEFEN FEISISIKES KNAIPSIDIP NIPGNQVGIM
NINEIVHKML GNNKQLKTIK VTVKEARELL INEESEKLMD EDKIIKDALL LASNDGIVFL
DEIDKIAART EIRGEVNREG VQRDLLPLLE GTSVTTKYGT ITTDHILFIA SGAFHLAKPS
DLLPELQGRL PIRVELKPLS KDDLVRILTE PESSLLKQYC ALMKTENITI DFTDEGVSTI
AEIASTVNRE VENIGARRLH TILEKLMEDI SYTATENSGR TYVIDSEYVK KKLEDIAKQL
DLSKFIL