Gene Hoch_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2384 
Symbol 
ID8544770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3300694 
End bp3302313 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content76% 
IMG OID646387083 
ProductN-acetylglucosaminyl transferase-like protein 
Protein accessionYP_003266814 
Protein GI262195605 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.40493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.336988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACGC CGTTGACCCT GGCCCTCTCG TGCGTGCTCG CGCTGGTCAT CGGCGTGTTG 
CTGGGGCGCT TCTATGTCCC GGCCAAGCGC GGCTTGGTGC GCGCGGCGCG CCAGGCCGAG
AGCTACGCCC GGGCCCTCAA CCACACCCTC GAGGACCAGC CCGACGACGC GGTCGAGGCG
CTGCGCCGCG TGGTCGCCGA GGACACCGAC GATCTCGAGC CGTACTTCGC CCTGGGCGCG
CTGTTCCGCC GCCGCGGCGA GTGGGAGCGG GCGGTGCGTG TGCACCAGGC CATCGCCATG
CGCGATCCCA AGAACAAGGC CATCCAGGGG CGCGCGCACT TCGCCCTGGG CCGCGACTTC
ACCTGCGCCG GCATGCCGCG CCGGGCCACG CGCGCCTTCG AGCAGTGCCT GGTGGTCGAC
GGCAAACACC AGCCGGCGCT GCGCGCGCTG GTGGCGCTGT ACGAGGAGCA GGGCCGCTAC
GCCGAGGCCG CCGACGCGCT CGCGCGGCTC GACAAGCTGC GCGAGCAGGG CCCCTCGGCG
CGCGGCCATC ACCTGCTGGT AGCGGCGGCG CAGTCGGCGC TGCGCGGACC CGCCGCCGAT
CTCGACCACG CCAGCCGGCT GCTGCGAGAC GCCCGCCGCG GCAAAGCGCA CAGCGTGCAC
GCGCTGGTCG CCGAGGCCGA GTTGGCGGCC GCGCATCGCG ATCCGGATGC GGCCTGCGAA
CACCTCCTCG ACGCCGTCGA GATGGCGCCC GAGCTGGCCG CGTTCCTGTT GCCCGGGCTG
ATCGAGGCCC AGCGCCAGAG CATGCGACGC GAGCGCGGTG ACAGCGCTGA GCTCGCGGTC
TCGGACGAGG CCGCGGTCGC CGGCGTGGCG GCCAAGCTCG CCGAGCGCCT GGCCACCTCC
GGGCGCAGCG ATGAACCCTT TGCCGGCATG GCGCTGGCCG AGCTGCGCTC GCACTGCGAT
CCCGAGGCCG CGCTGGCCGA CTACCGCGAC CTGGCCGAGC GCTTTCCCGA CCTGCTGCCG
GCGCAGGTGG CGGCCGCCCG CATGGCGCTG GCCGCGGGCG ACGAGGGCGA GATCCGCGAC
GCCCTGCGCC GCTTGAGCGC GGCCGACGGC GTGCTCGCTT GGGCCATGGA GGGCGCCTGG
CGCTGCAGCG GCTGCGGCCA TCGCCAGGAC CTGTTTTTCT GGCGCTGCCC CGCGTGTCGC
GCCTGGGGCA GCGTGCGCCT CGAGCTCGGG CGCGAGGCGC TGGCGCCGCC GCCGCCGCCG
CCCTGGGACG AGCCCGCGCT GGTCCGCGGC GGTGTCGATG CCGCGCTCTC GGGTGCGGCC
GCGAGGCGTA CCCGGGCCTC GGCGATGGTG GCTGCGGGGG CCTCGGCCTC GCACCAGGCC
GCGCCCTGGA TCGACGCCTC CTCGGGCTCG AGCCGCAGCG CCTCGCTGTG GAGTCGCGTG
GGCGCGTGGT TCGGTGGCGT GGGCGCATCG CAGGCGCCCG CCGAGCCGGT CGCCAAAGCC
CCCGCCGGCG CGGCCGCGAG CCCGCGCCCG AGCCCGCCCG CGACGGTCCC GGCGTGGCGC
GAGGACGGCG CGGCCGCCGA CGCTGCGGCC GCCGACAACG CAGAGCAGAG TAGCGTATGA
 
Protein sequence
MITPLTLALS CVLALVIGVL LGRFYVPAKR GLVRAARQAE SYARALNHTL EDQPDDAVEA 
LRRVVAEDTD DLEPYFALGA LFRRRGEWER AVRVHQAIAM RDPKNKAIQG RAHFALGRDF
TCAGMPRRAT RAFEQCLVVD GKHQPALRAL VALYEEQGRY AEAADALARL DKLREQGPSA
RGHHLLVAAA QSALRGPAAD LDHASRLLRD ARRGKAHSVH ALVAEAELAA AHRDPDAACE
HLLDAVEMAP ELAAFLLPGL IEAQRQSMRR ERGDSAELAV SDEAAVAGVA AKLAERLATS
GRSDEPFAGM ALAELRSHCD PEAALADYRD LAERFPDLLP AQVAAARMAL AAGDEGEIRD
ALRRLSAADG VLAWAMEGAW RCSGCGHRQD LFFWRCPACR AWGSVRLELG REALAPPPPP
PWDEPALVRG GVDAALSGAA ARRTRASAMV AAGASASHQA APWIDASSGS SRSASLWSRV
GAWFGGVGAS QAPAEPVAKA PAGAAASPRP SPPATVPAWR EDGAAADAAA ADNAEQSSV