Gene Hoch_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3830 
Symbol 
ID8546223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5277928 
End bp5279535 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content75% 
IMG OID646388499 
Producthypothetical protein 
Protein accessionYP_003268222 
Protein GI262197013 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1651] Protein-disulfide isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0399088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGGT ACGAGCGCCG TCGCCGGCCC CCGCCCCTGG ACGCTCCCGG CGGCAACTTC 
TATGCTGACG GCATGCGCCG CCTGTCATGT GCTGTGCTCG CCCTGCTCCT GAGCGCGCTC
GGCGCTGCGC GCGAAGGCCG CGCCCAGCCC GCGCCCGCGT CCACACAGCC GGCAACGGCG
CCCGCGGGCG ACGTCGAAGG CGACGCGGCC CATCCCGACG GCGGCGGCCT GCGCCCCCAG
ATCGCCACCG TGCCGCACCG GCGCGAGAGC GCGCATCCCG GCTTCGGCCC GGCGGCCGCG
CTGGTCACGG TCGAGCTGTT CCTGGCCCCG GGCGACCGCG GCAGCCGCCT GGTCGATCGC
CACCTGCGCG AGCTGCAGAC GCGCCACCCC GGGCGCGTGC GCCTCGACTA CCGCCTCACC
GGCGTCGGCC GCGCCCGCGA CCTCAGCGTG GCGCTGCTCG AGGCCCACGA GCAGGGCCGC
TTCGCGCAGC TCTGGGACGC GGTCTCGGGG CGGCGTCGGC CGGTGCTCGA GCGCGACGCG
CTGGCCGCGC TGGCCGGCGA GCACGGGCTC GACCTGGGCA AGCTCGAGGC CGCCTGGAGC
GACGGCCGCC ACGACCGCGC GCTGTACCTC AACGACAGCG AGCGCAAGCG CCGCGCCGAC
GCGGTGCCGG CGGTGTTTTT CAACGGCCAG CTCGCCACCC GCGCGCAGCT CGGCGACGCC
GACGAGGTCG AGCGCCGCTA CGCCGCGGCC CTGCTGCGGG GCCGGCATGC GCGCGCGCGC
GGGGCCTCGC TGGCGCAGGT GCACGCGACC TTGCTGCGCG AGATCGCGGC CCAGCGCGCG
GCCGCGACCA CGGGCGAGTT CCTGGGCGCG ATCGACGGGC TCTCGGCCAG CGCCATCGCC
GAGCAGGAGA TCGCGCCCTG GCTGGTGCGC CAGGACATCG AGGTGCCGGG CCACGATTTC
GGACCGCGCT CCGCAGCGGT CACGCTCGAC GTCTACTGCA ACTTCCTGTC GGCCAACTGC
GCCATGCTCA AGTCGTCGCT CACCACCGCG ATGGAGGAGT TTCCCACCGA GCTGCGCGTG
GTCTTCCACC ACATGTTCCC GCGCGCGGTG CTCGACGACG ATGACGACGA CGATGACGAC
GGCGACGGCG CGCCCGCGGA CCGCGGCGCG GCTACGCTCG ACGAGGACGA GCGCACCGCG
CTCGAGCGCG CGCTGCTGAG CATCCATCAG GCCTCGCTGT GCGCGGCCGA TCAGGGCGCG
TTCTGGGCCT TCTACAAGCG CGCCTATCAG CTCCGCGGCG CCCAATATCG GCATCTGAGC
AGCGACGAGC GCGTCGCCGC CATCGCCGCC GAGCTGCCCG TGGAGCGCGC GCGCTTCGAC
GCCTGCGCGG CCCGGCCCGA GGGCGCGCAG CGGGTGCTCG AGCGGCTCGA GGCGGCCCGC
GAGCTGGGCA TCGTCGACAC CCCGACCGTG GTCGTGGGCG GCCGCGCCTA CCCCGGCTTC
AAGTCCTCGC TCGACCTGCG CCTGCTCATC CAGACCCAGC TCGCGCCCGG CCTGCTCGAG
CGCCTGTTCC CGCACAGCCA GCCCGAGCAC TTCGAGCGCG AACCCTGA
 
Protein sequence
MKRYERRRRP PPLDAPGGNF YADGMRRLSC AVLALLLSAL GAAREGRAQP APASTQPATA 
PAGDVEGDAA HPDGGGLRPQ IATVPHRRES AHPGFGPAAA LVTVELFLAP GDRGSRLVDR
HLRELQTRHP GRVRLDYRLT GVGRARDLSV ALLEAHEQGR FAQLWDAVSG RRRPVLERDA
LAALAGEHGL DLGKLEAAWS DGRHDRALYL NDSERKRRAD AVPAVFFNGQ LATRAQLGDA
DEVERRYAAA LLRGRHARAR GASLAQVHAT LLREIAAQRA AATTGEFLGA IDGLSASAIA
EQEIAPWLVR QDIEVPGHDF GPRSAAVTLD VYCNFLSANC AMLKSSLTTA MEEFPTELRV
VFHHMFPRAV LDDDDDDDDD GDGAPADRGA ATLDEDERTA LERALLSIHQ ASLCAADQGA
FWAFYKRAYQ LRGAQYRHLS SDERVAAIAA ELPVERARFD ACAARPEGAQ RVLERLEAAR
ELGIVDTPTV VVGGRAYPGF KSSLDLRLLI QTQLAPGLLE RLFPHSQPEH FEREP