Gene Hoch_5069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5069 
Symbol 
ID8547480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6989607 
End bp6990857 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content76% 
IMG OID646389745 
Productglycosyl transferase group 1 
Protein accessionYP_003269450 
Protein GI262198241 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.338432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGTCG GCGTAGTCAC CAGCTCGTAT CCCCGCTGGC CCGGCGACCC GGCCGGAAAT 
TTCGTGGCCG CTCACGCGGG CTGGCTGCGG GATGCCGGAC ACGCGGTCGA GGTCGTGTGC
GCGGGCGAGC CGGGCGCGCG CGCGCGCTGG CAAGAGGGCG TGCGCGTGTT GCCGGTGGCG
GCGCGGCCGG GGCTGTTCTA CGCGGGCGGC GCGCCCGAGG CCCTGTCCAT GTCCCGGTCG
CGGCCGCGAC CGGCGATGGC GGCGGCCGCG CTGGCCTTCT CGCTGTCCCT GCGGCGCGCG
CTCGCGGAGC GCGCTCACTA CTGGGATGCG GTGTTCGCGC ACTGGCTGTT GCCGAGCGCG
GCCGCCGCGG TGCTGGCCCT GCCGCGAAGC CGGCGCGCGG TCGCCATCGC CCACTCGGGC
GACGTGCATC TGGCCCGCGC GCTGGCGCTG TGCACGCCCT TGGCCGCGGC CATGCACGCC
CGCGGCGATC GCGTGTGTTT CGTGAGCGAA CACGTGCGCG CGCGTTTTCT CGCCGGCGTG
TGGCCGCGGG GGCTACGCCG AGCGCTGCGG GCGCGCTCGC TGGTGCGTCC CATGGGCGTG
TCCCTGGCGC GCTGGCAGGC GGCGCGGGCG CGCGCGGACG CGCTGCGAGT CGGGCACGGC
GATGGCGCGT ATCGCGACGA GCGCGCGCGC GTAGTTTTTT TGGGACGACT GGTCCCCATC
AAGGGCGTGG CGGTATTGCT CGAGGCCTGC GCCCAGTTCG CGCGCGCCGG GTTCGCGCTG
GATCTGCTCG TGGCCGGCGA TGGGCCGCTG CGCGCCCAGC TCGCGGCGCG CGCCGAGACC
CTGCGCGCGA GCCTGCCGCC GGGCGCGGCT GCGCTCAGCG TCGAGTTCGC GGGTGAGCTA
CAGGGCACCC GCCTGGGCGA TGCGGTGGCC GCGGCCGACC TGCTGGTGTT GCCTTCGCTG
CCGGTCGCCG GCGGTCGCAG CGAGGGCGCT CCGGTCACCG CGCTCGAGGC CATGGCCGCA
GGGACGGCGG TGTTGGCCTC GCGTACTGGC GGCCTGGCCG AGCTGCCCGA AGACGCCGCG
ACCCTGGTCC CGGCCGGCGA TGTCGACGCG CTCGCCCAGG CGCTGCGCAG GCTGCTGCGC
GATCGCGCAG GGCGGGCGGC GCAGGTGCGG CGGGCGCGCG CGTGGGTGCG ACAACATGAC
TGGGCAGAGG TCGGAGCAGC GCTGTGGTCC CTGTATGAAA ACCCGCGGTA A
 
Protein sequence
MRVGVVTSSY PRWPGDPAGN FVAAHAGWLR DAGHAVEVVC AGEPGARARW QEGVRVLPVA 
ARPGLFYAGG APEALSMSRS RPRPAMAAAA LAFSLSLRRA LAERAHYWDA VFAHWLLPSA
AAAVLALPRS RRAVAIAHSG DVHLARALAL CTPLAAAMHA RGDRVCFVSE HVRARFLAGV
WPRGLRRALR ARSLVRPMGV SLARWQAARA RADALRVGHG DGAYRDERAR VVFLGRLVPI
KGVAVLLEAC AQFARAGFAL DLLVAGDGPL RAQLAARAET LRASLPPGAA ALSVEFAGEL
QGTRLGDAVA AADLLVLPSL PVAGGRSEGA PVTALEAMAA GTAVLASRTG GLAELPEDAA
TLVPAGDVDA LAQALRRLLR DRAGRAAQVR RARAWVRQHD WAEVGAALWS LYENPR