Gene Gdia_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3083 
SymbolhslU 
ID6976517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3374865 
End bp3376181 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content68% 
IMG OID643392591 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_002277428 
Protein GI209545199 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0175697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC CCAACCATTC CCCCCGTGAG ATCGTCTCGG AACTCGATCG CTTCATCATC 
GGCCAGACCG ATGCCAAGCG CGCGGTGGCC ATCGCGCTGC GCAACCGCTG GCGCCGGGCG
CAACTGGCCG ACGGGCTGCG CGACGAGGTG GTGCCGAAGA ACATCCTGAT GATCGGCCCG
ACCGGCTGCG GCAAGACCGA GATCGCCCGC CGCCTGGCCC GTCTGGCGCA GGCGCCGTTC
TTGAAGGTCG AGGCCACCAA GTTCACCGAG GTCGGCTATG TCGGCCGCGA TGTCGAAAGC
ATCGTGCGCG ACCTGGTCGA GGTGTCGCTG AACATGCTGC GCGACCTGCG CCGCCGCGAC
GTGCAGGCGC GCGCCGAACT GGCCGCCGAA AACCGGCTGG TGGACGCGCT GGTGGGCGAG
GGTGCCTCGG CCGACACCAA GGGCAAGTTC CGCCGCATGC TGCGCAATGG CGAGCTGGAG
GAGAAGGAGG TCGAAATCTC GATCGCCGAC ACCCAGGCAC CGGGCGGCAT GGGCGACATG
GGGAACATGA CGTCCGGTAC GGTCATCAAT TTCTCCGACA TGATGAAGGG ATTGATGAAC
CGCGTGCCGC AGCGCCGCCG CATGACCGTC GCCGCCGCGC GCGAGGCCCT GACGCGCGAG
GAAGCGGACA AGATGCTGGA CAACGACGCC CTGACCGGCG AGGCGGTGGC CCATGCCCAG
GATCACGGCA TCGTCTTCCT GGACGAGATC GACAAGGTCT GCGCCCGGTC GTCGGAAAGC
GGGTTCCGGG GCGGCGACGT CTCGCGCGAG GGCGTGCAGC GCGACCTGCT GCCGCTGATC
GAGGGCACGA CGGTATCCAC CAAATACGGG CCGGTGAAGA CGGACCATAT CCTGTTCATC
GCCTCGGGCG CGTTCCACAT CGCCAAGCCG TCGGACCTGC TGCCGGAATT GCAGGGGCGC
CTGCCGATCC GTGTCGAGCT GGCGCCGCTG ACGCGCGAGG ACCTGCGGCG CATCCTGACC
GAGCCGGAAC ATTCGCTGCT GAAGCAATAT ACCGCGCTGC TGGGGACCGA GGGCGTGACC
CTGGAGTTCT CGGACGACGC GGTGGACGCG CTGGCGGAAC TGGCCGCCGA CATCAACGAG
CGGATCGAGA ATATCGGCGC GCGGCGCCTG GCGACGGTGC TGGAGCGGCT GCTGGAAGAG
GTCTCGTTCA CGGCGTCCGA CCGTTCGGGC CAGGCGGTGC GCATCACCGC CGCCGACGTG
CAGGACAAGG TGGCGCCGCT GGCGCGCAAG GGCGACCTCA GCCGCTTTAT TCTTTAA
 
Protein sequence
MDIPNHSPRE IVSELDRFII GQTDAKRAVA IALRNRWRRA QLADGLRDEV VPKNILMIGP 
TGCGKTEIAR RLARLAQAPF LKVEATKFTE VGYVGRDVES IVRDLVEVSL NMLRDLRRRD
VQARAELAAE NRLVDALVGE GASADTKGKF RRMLRNGELE EKEVEISIAD TQAPGGMGDM
GNMTSGTVIN FSDMMKGLMN RVPQRRRMTV AAAREALTRE EADKMLDNDA LTGEAVAHAQ
DHGIVFLDEI DKVCARSSES GFRGGDVSRE GVQRDLLPLI EGTTVSTKYG PVKTDHILFI
ASGAFHIAKP SDLLPELQGR LPIRVELAPL TREDLRRILT EPEHSLLKQY TALLGTEGVT
LEFSDDAVDA LAELAADINE RIENIGARRL ATVLERLLEE VSFTASDRSG QAVRITAADV
QDKVAPLARK GDLSRFIL