Gene Hoch_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2936 
Symbol 
ID8545324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3999475 
End bp4001754 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content71% 
IMG OID646387618 
Productglycosyl transferase family 2 
Protein accessionYP_003267346 
Protein GI262196137 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.822223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG CCACCGCCAT CATCCCCACG ACGGGCCAGC GGCCGGCGCT GCTCCGTCGG 
GCGTTGGCCT CGGTGCACGC GCAGACCCTG GCGCTGCGTG CGGTCTTGGT CGTGGTCGAC
GGGGACGATG TCCAGGCGTC CGAAGTCCGC GACCAGCTCG ACGGGCTGGC CGCTGAGGTG
ATCGCCGTGG GCACGCGACG CGGCGCCGGG GCTGCGCGCA ATCACGGCGC CCGACACGCG
CGCACGCGGC ATCTGTGCTT CCTCGACGAT GACGACGTAT GGAAACCCGA CTACCTCGCG
GCCGTGTTCG CGGCGGGGCC CGACTTCGAC CTTGCACTGA CCGCCTTCGA GAAGCACACG
GTCGCAGGAG CACGGCCCGA AAAAGTGCCG CCCGTCGCGC TGTGCGCCGA CGCGTTCCTG
GTCAAGAATC CGGGCCTGCG CGGCAGCAAT CTCGTGCTCA CCCGGTCCTT GTACTGGGCG
GCTGGCGGGT TCTCCGAGGC GCTGCCGGCG CTGAACGACA TGGACTTTGG CCTGCGCCTG
GTCGCCGCCG AGGTCGGGCG CTATCGTCGT GTCGTCGACC CGCTGGTCGA GTACCATGCG
CACGACGGCG AGCGGCTGTC CGGGCGCGGC GCGCGCGCCA TCCCGCCGGG ACTCAGCGGT
TTCCTCCTGC GCCACGGGCC GCGTATGGAT CACCGCCAGG AGGCGGCGTT CCGCGCCCGC
GCGCTCGCGC TCTGGGGGGT CGATCCCTGG GCGCTCGCGG CGCTCGCGCA GCGCTTTGAC
GAGGCGCGCG CTCAGGGGAC GCTGGCCGCG CACTTCCCGG GGCTGTTGCA CGCGGCCGAA
ACCGCGCTCC TCGAGGCCGC GTGCCAGAGC GATGCCCAAG CAGACGCCTA CCAGGCGTTC
ATCGATCGTC TGTGTCACGC ATTCGAGGAC GTCGCCGACA GCGCGCAGCG TGTGCGTACG
CTGCGTGTCG TGGTCATCAC CACGGACACA CCGGACAGCG TCGCCGGCCT GCTGAGTTCG
CTGGTCCAGA TGCTCACGCG CTCGCGCTGG CGTCACGCGC ACACCGGCCC GCTGGTCGAG
CTGCTGCTGG TGCGTAACGA CGCCGACGCC GAGATCGCGG CCGCCCACGA CGCGGTGCTG
CGAGGCTGGG ACGATCCGCG GGTCGCGGTG TCCGAAAAAA ACGTGCCCGC GGCCGCGCGC
CCGCTGTCGC TCACCGAGGC GCGGATCCAT GCGTTTCGCG CGGCGCGGAC ACGCGGCTGG
TGTCCCTCGC CCGAGGCGCC GGTGTGGTTT CTCGACGAGG ACTTCCGCTT CGAGGTTCTC
GTCCCGTCGG TCGAGCGCTG GTACCGACAG GTTCCGGGCG GCTCGCCGCT CCATCGGCTC
GAGTGTTTGG CCCTGCGCCT GGGCGCGGCC GGCGTGGACG CGCTCGTCTG CGGCAACAGC
GGCGCGCCCC CGGTGCCCGC GCTGGGGACC ATCGGGCGAC AACTCGGCGA TCTCATGTCG
TTGACCGCGG AGGCGCAGCC GTTGCGGCGC GATGTGCTCG CGGCGCTGCT CGAACGGCCT
GATCCGTACT ACGATCTTTC GCGCGATCTT CGAGATGATC TGCGCGCGCC GGTCGCGACC
ACCTGGTGGC GGGATCTCGG GCCGTTGCGT TGGGACGAGG TCGGGACGCG GCTGCTGGGC
GGCTTGCCGG TCACCCGTCC AGCGTTGCCC TCCCTGCACG AGCAGCCGGC GAGCGCCTGG
GGACATCAGA CGCCTGCCGC GGTCGCGGGC GGCAACACAG TCTTGCTCTC GATGCGCGTA
CTGCGCCCGG AGCGCTTTGC ACAGGTCGAG TGGCGGGGCG TTCGCTCGCG GCGCGGCGAC
ACGGTCTGGT GTCTGCGCTG TCAGCGTGAG GGCGCGCGCA TCGTCTGCGC CTCCCTGCCG
CTCTTGCACG CACGCGTCCC CCGGCGCGGC CCGTCTGGCT TCGATCGCGC CCTGCGCGAT
GCCCTCTCCG ACGCCCTCGG GGTCGGGCTC TACGAGACCA TCGCAGCGGG CGAGGGTCTC
GACGTGTCGG CGATCGAGCG GCGAGCAGCC CACCGACTCG CCGCGCTCAC CGACAACCTC
GCGAGCGCGG CCGACGCCTT GCGGAACACC GATGAGGCCA TCTTAGCGGC CCCGCTGCGT
GTGTTGGAGG GCGATTTGCG GGCGCTGTGC GCGTCGCTGC GGGACGCAAA AATCGTCGGC
GTGCACGTTG TCGGTGAACG CAGCGCCGAT CTCGAAGGCT TCTTGACGGT CAGCGTGTAG
 
Protein sequence
MIDATAIIPT TGQRPALLRR ALASVHAQTL ALRAVLVVVD GDDVQASEVR DQLDGLAAEV 
IAVGTRRGAG AARNHGARHA RTRHLCFLDD DDVWKPDYLA AVFAAGPDFD LALTAFEKHT
VAGARPEKVP PVALCADAFL VKNPGLRGSN LVLTRSLYWA AGGFSEALPA LNDMDFGLRL
VAAEVGRYRR VVDPLVEYHA HDGERLSGRG ARAIPPGLSG FLLRHGPRMD HRQEAAFRAR
ALALWGVDPW ALAALAQRFD EARAQGTLAA HFPGLLHAAE TALLEAACQS DAQADAYQAF
IDRLCHAFED VADSAQRVRT LRVVVITTDT PDSVAGLLSS LVQMLTRSRW RHAHTGPLVE
LLLVRNDADA EIAAAHDAVL RGWDDPRVAV SEKNVPAAAR PLSLTEARIH AFRAARTRGW
CPSPEAPVWF LDEDFRFEVL VPSVERWYRQ VPGGSPLHRL ECLALRLGAA GVDALVCGNS
GAPPVPALGT IGRQLGDLMS LTAEAQPLRR DVLAALLERP DPYYDLSRDL RDDLRAPVAT
TWWRDLGPLR WDEVGTRLLG GLPVTRPALP SLHEQPASAW GHQTPAAVAG GNTVLLSMRV
LRPERFAQVE WRGVRSRRGD TVWCLRCQRE GARIVCASLP LLHARVPRRG PSGFDRALRD
ALSDALGVGL YETIAAGEGL DVSAIERRAA HRLAALTDNL ASAADALRNT DEAILAAPLR
VLEGDLRALC ASLRDAKIVG VHVVGERSAD LEGFLTVSV