Gene Avin_45390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_45390 
SymbolhslU 
ID7763407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4608407 
End bp4609747 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID643807387 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_002801628 
Protein GI226946555 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCATGA CGCCCCGCGA AATCGTCCAC GAACTCAACC GCCACATCGT CGGCCAGGAA 
GATGCCAAGC GCGCCGTCGC CATCGCCCTG CGCAACCGCT GGCGGCGCAT GCAACTGCCG
GCCGAGCTGC GCGCCGAGGT CACGCCGAAG AACATCCTGA TGATCGGCCC GACCGGCGTC
GGCAAGACCG AGATCGCCCG CCGCCTGGCA CGCTTGGCGA ACGCCCCGTT CATCAAGGTA
GAGGCGACCA AGTTCACCGA GGTGGGCTAT GTCGGCCGTG ACGTGGAATC CATCATCCGC
GACCTGGCCG ACGCGGCGGT GAAGATGATG CGCGAGCAGG AGATCCAGCG GGTGCGCCAC
CGCGCCGAGG ACGCCGCCGA GGACCGCATC CTCGACGCCC TGCTGCCGCC GGCGCGTCAG
GGCTTCGGCG ACGAACCGAT CGCCCGCGAG GACTCGAACA CCCGCCAGTT GTTCCGCAAG
CGCCTGCGCG AGGGCCAGTT GGACGACAAG GAAATCGACA TCGAGATCAC CGAAACGCCC
AGCGGCGTGG AGATCATGGC TCCACCCGGC ATGGAGGAAA TGACCAGCCA GTTGCAGAAC
CTGTTCTCCA GCATGGGCAA GGGCCGCAAG AAGACCCACA AGCTGAAGGT CAAGGATGCG
CTCAAACTGG TCCGCGACGA GGAAGCCGCC CGCCTGGTCA ACGAGGAGGA ACTGAAGGCC
CGTGCCCTGG AATCGGTCGA GCAGAACGGC ATCGTCTTCA TCGACGAGAT CGACAAGGTG
GCCAAGCGTG CCAACGTCGG CGGCGCCGAC GTCTCTCGCG AGGGCGTGCA GCGCGACCTG
CTACCGCTGA TCGAGGGCTG CACGGTGAAC ACCAAGCTGG GCATGGTCAA GACCGACCAC
ATCCTGTTCA TCGCCTCTGG CGCCTTCCAT CTCGCCAAGC CCAGCGATCT GGTGCCGGAA
CTGCAGGGCC GCCTGCCGAT CCGCGTCGAA CTCAAGGCGT TGACGCCCGA GGACTTCGAA
CGCATCCTCA CCGAACCCCA CGCCTCGCTG ACCGAACAGT ACCGCGAGCT GCTGAAGACC
GAAGGACTGA ACATCCAGTT CGCCGCCGAC GGCATCAAGC GCATCGCCGA AATCGCCTGG
CAGGTCAACG AGAAGACCGA GAACATCGGC GCCCGCCGCC TGCACACCCT GCTCGAGCGC
CTGCTGGAGG AAGTCTCGTT CAGCGCCGGC GACCTGGCCG CCGACCACAG CGGCCAGCCG
ATCGTGATCG ACGCCGCCTA CGTCAACAAC CACCTCGGCG AACTGGCCCA GGACGAGGAT
CTGTCGCGCT ACATTTTGTA G
 
Protein sequence
MSMTPREIVH ELNRHIVGQE DAKRAVAIAL RNRWRRMQLP AELRAEVTPK NILMIGPTGV 
GKTEIARRLA RLANAPFIKV EATKFTEVGY VGRDVESIIR DLADAAVKMM REQEIQRVRH
RAEDAAEDRI LDALLPPARQ GFGDEPIARE DSNTRQLFRK RLREGQLDDK EIDIEITETP
SGVEIMAPPG MEEMTSQLQN LFSSMGKGRK KTHKLKVKDA LKLVRDEEAA RLVNEEELKA
RALESVEQNG IVFIDEIDKV AKRANVGGAD VSREGVQRDL LPLIEGCTVN TKLGMVKTDH
ILFIASGAFH LAKPSDLVPE LQGRLPIRVE LKALTPEDFE RILTEPHASL TEQYRELLKT
EGLNIQFAAD GIKRIAEIAW QVNEKTENIG ARRLHTLLER LLEEVSFSAG DLAADHSGQP
IVIDAAYVNN HLGELAQDED LSRYIL