Gene Hoch_5083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5083 
Symbol 
ID8547494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7007996 
End bp7009873 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content67% 
IMG OID646389759 
ProductAAA ATPase central domain protein 
Protein accessionYP_003269464 
Protein GI262198255 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.322132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.684028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAACT CTGTCGCCAT CATGCCGGCC TGGGCCGAGG AGCTCCGCCA CCGCTACTTG 
CGCGGTGAGG CGTGGCTGTT CGTGGTTCAT GGCAACGTCC ACGACCTCAT CCTGTACCAG
GATAAGCTCG TCGGCGCGGT CGACTTCCTC AGCGAGCACG TGCTCGACAA GAAGGAAAGC
GTCCTGCGCT ACAACGTCTC CACCGGCTGC CGCTTCGTCA AGAAGACGCG CAGCATGGAG
GCGCTCGCCA TCGAAGATCT CATCGCCCAG CGCAACCCCG AGCGGGTGCT GCCGGTGCTC
GAGCAGATCC TCTACTACCA GCGCAATGTC GGCCTGGTGG TCGATCACGC GGAGATGATC
GCGCCCGCGG GCGACCCGAC CTTCTTCTCG CAATCCGACC GCATGTCCGT GGTCACCTTG
CAGCGCTGGT CGCTGGCGCC CGAGATCGAG CGCGCCGACA ACATCGTCAT CCTGGTGACC
GAGTCGGCCG GCGAGCTCAA TCCCAAGATC ATGGCCCACC CGCGGGTGGC CTCGGTCGAG
ATTCCCATGC CCCTGGTCGA CGCCCGGCGG CAGGTCATCC GCGCGGTCAA TCCCGAGCTG
GACGACGCCT GGGTCGAGCG CTTCGCCGAC ATCACCGCCG GCCTGCGCAG CATTCAGATC
AAGTCGATCC TGCAGCCGCC GCCCGAGGGC GAGGAGGACC CCAACGCGCG CTTCAAGTTC
ATCCGCCAGC TCGTCGGCGG CGACGACGCG CGCGCGCGCA AGTTCGCGGC CATCACCCGC
GGCATGCCGC GCGACGAGAT CCGCGCGCTG ATCGGCGGCG TGGCCACGCC CCCGGCCGAG
CGCGAGGACG AGGGCCAGGA GTTCGACGAG GTCCTGGCCC TGATCGCGCG CCGCAAGCGC
GAGATCATCG AGCGCGAGTG CTTCGGCATC ATCGAGTTCG TCGAGCCCGA TCACGACTTC
TCGGTGGTCG GCGGCGTGGA CGAGATCAAG CGCGAGCTCA AGAGCATCGC GCGCAACATC
CGCGAGGGCC GGCGCAGTCG CGTGCCCATG GGCCTCTTGT TCACCGGCCC GATGGGTACC
GGCAAGACCT TCGTGGCCGA GGCCTTTGTC AAGGAGAGCG GGCTCACCGG GCTCAAGCTC
AAGAACTTCC GCTCCAAGTG GGTGGGCGCG ACCGAGTCCA ACCTCGAGCG CATCCTCGGC
GTCATCCGCG CCATCGGCAA CGTCATCGTC ATCATCGACG AGGGCGACCG CTCGTTTGGC
TCGGGCGACG GCGAGGGCGA CGGCGGCACC TCCTCGCGCA TCATCGCGCG GCTCAAGGAG
TTCATGAGCG ACACCTCGAA CCGCGGCCGG GTGCTGTTCG TGCTCATGAC CAACCGCCCC
GACAAGCTCG ATATCGATAT CAAGCGCGCC GGCCGCCTGG ACCGCAAGAT CCCGTTCCTC
TACCCGCACA CGCCCGAGCA GGTGGAGCTG GTGCTCGAGG CCCAGATCCG CAAGCACGAC
CTCGACACCG AGCTGTCGTT TCCGCGCGAT CGCGAAGAGG TCTCGGACAA GCTGGTCGGC
TACTCCAACG CCGACCTCGA GGCCCTGGCG CTGCTGGCCT ACAGCTACGC CAACGACCTC
GGCCCGGAGG AGGGCGCCGG CGCCGGCGAG GGCGCCGCTG AGCGCAAGAT CGGCGCCATC
GACGTCGAGA CCTTTCAGCG CGCGGTGGCC GATTATCTGC CCTCGCGCGA TCGCGACATG
CTGGCGTACA TGGAGCTGTT GGCCGTGTTC GAGGCATCGA ATCGGCGCAT GTTGCCGCCC
AAGTACGCCA ATCAGAGCAT CGAGGAGCTT CAGCAGCAGC TCGAAGAGCT GCGCATCCGC
TGTGCGGGGC GGCGGTAA
 
Protein sequence
MNNSVAIMPA WAEELRHRYL RGEAWLFVVH GNVHDLILYQ DKLVGAVDFL SEHVLDKKES 
VLRYNVSTGC RFVKKTRSME ALAIEDLIAQ RNPERVLPVL EQILYYQRNV GLVVDHAEMI
APAGDPTFFS QSDRMSVVTL QRWSLAPEIE RADNIVILVT ESAGELNPKI MAHPRVASVE
IPMPLVDARR QVIRAVNPEL DDAWVERFAD ITAGLRSIQI KSILQPPPEG EEDPNARFKF
IRQLVGGDDA RARKFAAITR GMPRDEIRAL IGGVATPPAE REDEGQEFDE VLALIARRKR
EIIERECFGI IEFVEPDHDF SVVGGVDEIK RELKSIARNI REGRRSRVPM GLLFTGPMGT
GKTFVAEAFV KESGLTGLKL KNFRSKWVGA TESNLERILG VIRAIGNVIV IIDEGDRSFG
SGDGEGDGGT SSRIIARLKE FMSDTSNRGR VLFVLMTNRP DKLDIDIKRA GRLDRKIPFL
YPHTPEQVEL VLEAQIRKHD LDTELSFPRD REEVSDKLVG YSNADLEALA LLAYSYANDL
GPEEGAGAGE GAAERKIGAI DVETFQRAVA DYLPSRDRDM LAYMELLAVF EASNRRMLPP
KYANQSIEEL QQQLEELRIR CAGRR