Gene Hoch_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2050 
Symbol 
ID8544432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2832515 
End bp2834008 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content69% 
IMG OID646386753 
Productprotein of unknown function DUF404 
Protein accessionYP_003266488 
Protein GI262195279 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.591797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.273192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAG AAGCAACCGC CGAAAAACCG CTATTCGACG GGTACCGGCC GGCTGAAGGC 
GTTTTTGACG AAGTTTTCAG CAGCGACGGG CAGCCGCGCG CAGAAGTGCG CCGCGCCATC
GAACTCATCT CGAGCCTGGG TCGCGCCGAG TTCGTCCGCC GCCAGCGCAT CGCCGACACC
TCGTTCTTCA AAGCCGGGGT GACGTTCAAT GTCTACGCCG ACAACCCCGC CGGAACCGAG
CGAATCTTTC CCTTCGACAT GATCCCGCGC ATCATCGACG CCGCCCACTG GCAGCGCATC
GAGCGCGGCC TGGCCCAGCG CGTGGCCGCG CTCAACGCCT TCCTGGGCGA CGTCTACGGC
CCGCAGCGCA TCCTCGACGA GGGCGTGATC CCGCGCGAGC TGGTCGAGGG CGCGGCCGGG
TATCTGCCCG CCATGCGCGC GATCAAGCCG CGCAATGGCG TCTACGTGCA CATCGCCGGC
ATCGACCTCA TCCGCGACCC GGAGGGCGAC TTCCTGGTGC TCGAGGACAA CGCGCGCACG
CCCTCGGGCG TGTCCTACGT GCTCGAGAAC CGGGCGGTGA TGAAGCGCGC CATCTCCAAG
ATCTTCCAGG ACACGCACGT GCGCGCGGTC GACGACTATC CGCGCCAGCT CCGCGACGCG
CTGCTGGCGC TGGCGCCGGC GGCCGACGAG CCGCGCCTGG TGGTGCTGAC CCCGGGGCGG
TGGAACTCGG CCTACTTCGA GCACAGCTTC CTGGCCCGGC GCATGGGCTG CGAGCTGGTC
GAGGCCTCGG ACTTGTTCGT CGAGAACCGG CGCGTGTACG TCAAGACCAC GGCCGGCCCG
CAGCCGGTGC ACGTGATCTA CCGGCGCATC GACGACGAGT TCATCGACCC CGAGGTGTTC
CGCCCCGACT CGCTGCTGGG CGTGCCCGGC CTGGTCGACG CCTACGCGGC CGGCAACGTC
ACCCTGGCCA ACGCGCTGGG CAACGGCGTG GCCGACGACA AGGCGGTGTA TCCCTTCGTC
CCCGATATGA TCCGCTTCTA CCTGTCGGAG GAGCCCATCC TCGGCCAGGT GCAGACCTAC
CGCTGCGGGC TCGACGAGCA CCGCCGCTAC GTTCTCGACC ACCTGGACGA ACTCGTGGTC
AAGGCGGTCG ACGCCTCGGG CGGCTACGGC ATGCTCATGG GGCCGACCGC GAGCGAGACC
GAGCGCACCG ACTTCGCGGC CCGCATCCGC GAAAACCCGC GCGGCTACAT CGCGCAGCCG
CGGGTCGAGC TGTCGAGCTG CCCGACGTGG CAGAACGGCA GCGTGCACCC GCGCCGGGTC
GATCTGCGAC CGTATATCGT CACGGGAACC TCGAGCTGGG TGCTGCCGGG CGGGCTCACC
CGGGTGGCCT TGCGCGAGGG TTCGTACGTC GTCAACTCCA GCCAGGGCGG AGGCTCCAAG
GACACCTGGG TGGTCGAGGA GAAGGCCGAG GAGAAGGCCG AGGTGAACTC ATGA
 
Protein sequence
MTSEATAEKP LFDGYRPAEG VFDEVFSSDG QPRAEVRRAI ELISSLGRAE FVRRQRIADT 
SFFKAGVTFN VYADNPAGTE RIFPFDMIPR IIDAAHWQRI ERGLAQRVAA LNAFLGDVYG
PQRILDEGVI PRELVEGAAG YLPAMRAIKP RNGVYVHIAG IDLIRDPEGD FLVLEDNART
PSGVSYVLEN RAVMKRAISK IFQDTHVRAV DDYPRQLRDA LLALAPAADE PRLVVLTPGR
WNSAYFEHSF LARRMGCELV EASDLFVENR RVYVKTTAGP QPVHVIYRRI DDEFIDPEVF
RPDSLLGVPG LVDAYAAGNV TLANALGNGV ADDKAVYPFV PDMIRFYLSE EPILGQVQTY
RCGLDEHRRY VLDHLDELVV KAVDASGGYG MLMGPTASET ERTDFAARIR ENPRGYIAQP
RVELSSCPTW QNGSVHPRRV DLRPYIVTGT SSWVLPGGLT RVALREGSYV VNSSQGGGSK
DTWVVEEKAE EKAEVNS