Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3830 |
Symbol | |
ID | 8546223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5277928 |
End bp | 5279535 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646388499 |
Product | hypothetical protein |
Protein accession | YP_003268222 |
Protein GI | 262197013 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1651] Protein-disulfide isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.220897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0399088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCGGT ACGAGCGCCG TCGCCGGCCC CCGCCCCTGG ACGCTCCCGG CGGCAACTTC TATGCTGACG GCATGCGCCG CCTGTCATGT GCTGTGCTCG CCCTGCTCCT GAGCGCGCTC GGCGCTGCGC GCGAAGGCCG CGCCCAGCCC GCGCCCGCGT CCACACAGCC GGCAACGGCG CCCGCGGGCG ACGTCGAAGG CGACGCGGCC CATCCCGACG GCGGCGGCCT GCGCCCCCAG ATCGCCACCG TGCCGCACCG GCGCGAGAGC GCGCATCCCG GCTTCGGCCC GGCGGCCGCG CTGGTCACGG TCGAGCTGTT CCTGGCCCCG GGCGACCGCG GCAGCCGCCT GGTCGATCGC CACCTGCGCG AGCTGCAGAC GCGCCACCCC GGGCGCGTGC GCCTCGACTA CCGCCTCACC GGCGTCGGCC GCGCCCGCGA CCTCAGCGTG GCGCTGCTCG AGGCCCACGA GCAGGGCCGC TTCGCGCAGC TCTGGGACGC GGTCTCGGGG CGGCGTCGGC CGGTGCTCGA GCGCGACGCG CTGGCCGCGC TGGCCGGCGA GCACGGGCTC GACCTGGGCA AGCTCGAGGC CGCCTGGAGC GACGGCCGCC ACGACCGCGC GCTGTACCTC AACGACAGCG AGCGCAAGCG CCGCGCCGAC GCGGTGCCGG CGGTGTTTTT CAACGGCCAG CTCGCCACCC GCGCGCAGCT CGGCGACGCC GACGAGGTCG AGCGCCGCTA CGCCGCGGCC CTGCTGCGGG GCCGGCATGC GCGCGCGCGC GGGGCCTCGC TGGCGCAGGT GCACGCGACC TTGCTGCGCG AGATCGCGGC CCAGCGCGCG GCCGCGACCA CGGGCGAGTT CCTGGGCGCG ATCGACGGGC TCTCGGCCAG CGCCATCGCC GAGCAGGAGA TCGCGCCCTG GCTGGTGCGC CAGGACATCG AGGTGCCGGG CCACGATTTC GGACCGCGCT CCGCAGCGGT CACGCTCGAC GTCTACTGCA ACTTCCTGTC GGCCAACTGC GCCATGCTCA AGTCGTCGCT CACCACCGCG ATGGAGGAGT TTCCCACCGA GCTGCGCGTG GTCTTCCACC ACATGTTCCC GCGCGCGGTG CTCGACGACG ATGACGACGA CGATGACGAC GGCGACGGCG CGCCCGCGGA CCGCGGCGCG GCTACGCTCG ACGAGGACGA GCGCACCGCG CTCGAGCGCG CGCTGCTGAG CATCCATCAG GCCTCGCTGT GCGCGGCCGA TCAGGGCGCG TTCTGGGCCT TCTACAAGCG CGCCTATCAG CTCCGCGGCG CCCAATATCG GCATCTGAGC AGCGACGAGC GCGTCGCCGC CATCGCCGCC GAGCTGCCCG TGGAGCGCGC GCGCTTCGAC GCCTGCGCGG CCCGGCCCGA GGGCGCGCAG CGGGTGCTCG AGCGGCTCGA GGCGGCCCGC GAGCTGGGCA TCGTCGACAC CCCGACCGTG GTCGTGGGCG GCCGCGCCTA CCCCGGCTTC AAGTCCTCGC TCGACCTGCG CCTGCTCATC CAGACCCAGC TCGCGCCCGG CCTGCTCGAG CGCCTGTTCC CGCACAGCCA GCCCGAGCAC TTCGAGCGCG AACCCTGA
|
Protein sequence | MKRYERRRRP PPLDAPGGNF YADGMRRLSC AVLALLLSAL GAAREGRAQP APASTQPATA PAGDVEGDAA HPDGGGLRPQ IATVPHRRES AHPGFGPAAA LVTVELFLAP GDRGSRLVDR HLRELQTRHP GRVRLDYRLT GVGRARDLSV ALLEAHEQGR FAQLWDAVSG RRRPVLERDA LAALAGEHGL DLGKLEAAWS DGRHDRALYL NDSERKRRAD AVPAVFFNGQ LATRAQLGDA DEVERRYAAA LLRGRHARAR GASLAQVHAT LLREIAAQRA AATTGEFLGA IDGLSASAIA EQEIAPWLVR QDIEVPGHDF GPRSAAVTLD VYCNFLSANC AMLKSSLTTA MEEFPTELRV VFHHMFPRAV LDDDDDDDDD GDGAPADRGA ATLDEDERTA LERALLSIHQ ASLCAADQGA FWAFYKRAYQ LRGAQYRHLS SDERVAAIAA ELPVERARFD ACAARPEGAQ RVLERLEAAR ELGIVDTPTV VVGGRAYPGF KSSLDLRLLI QTQLAPGLLE RLFPHSQPEH FEREP
|
| |