Gene Hoch_5565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5565 
Symbol 
ID8547979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7638725 
End bp7641157 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content70% 
IMG OID646390238 
Productglycosyl transferase group 1 
Protein accessionYP_003269940 
Protein GI262198731 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.896864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCG AAGCCATCGA AGTGCGGGTG GATCTGCACG TACACTCGTC GTACTCGGAC 
ACGCCGCAGA ACTGGTTTCT GCGCACCGGA GGGGTAGCCG AGAGCTACAC CTCGCCGCAG
ACCGTGTACG AGACCGCGAT GCGCCGGGGC ATGTCCCTGG TCACGCTCAC CGATCACAAC
ACCATCGCCG GCGCGCTCGA GCTGGCGGCG AAGTACGACA ACGTCTTCCT CAGCGAGGAG
ATCACGGCGC GCTTCCCCGA AGATGGTTGC ATCGTGCGCG TCGCCGCCCT GGACATCAGC
GAGGCGCAGC ACGAGGACAT CGCGCGCCTG CGCGACAATG TCTACGATCT GGCCGCGTAC
CTGGCGCAGG AGAGCATCGC GGCGGTGTGG TGCCACCCGC TGTCCGACGT CAACGCGCGG
CTCTCGCGCA GCCACCTCGA GCGCTGTTTC CTGATGTTCC GCGCGCTGGA GCTGCGCAAC
GGCAGCCGCG ACGCCAGCCA CGAGCAGCAC CTGGCCGAGC TGGTGACCGC GCTCTCGCCC
GCGCATCTGG CGCGTTTCGC CGAGGCCCAT CCGCAGACGC CGGCGATCAA CCTCGAGGGT
CGCTACGCCT TCACCGGCGG CAGCGACGAC TACGGCGGCT TGGCCATCGC GCGCGCCTAC
ACCGCGTTCC ACGGCAGCCC CACCGGCGCC AGCGCGGCCT CGGCCGTGCG CTCGCTGACC
TGCGCGCCGG CCGGCGAGTG CGGCGACGCG CCGACCATAG CGCACAACGC CTTCAGCATC
GCGTCGGGCC ACGTGCAGAA GCAGATGGCC CAGAACCCGG CGCAGATGGC CACACCGGCC
AACATCTCGA TCATCACCGA GCTGATGAAG CGCAAGAGCC TGTTCGAGCA GGGCGGCGGT
CGCCTCGACT TCGACGAGAT GAAGGAGAAA GGCCACCAGT CCTCGTTCCA GGACATGCTG
GTGCGCATGG CCGAGCCGGC GCTGGTGCAC GGCTGGCGCG AGGTCCTCGG CGGCTTCTTC
GGCGCGGCCT TTGGCGGGCG CTTTGGCGAG GCCGCCGACG CGGTGTCGCA GGTGCTCAAG
GCGTCGATGT TCGATGTCCC GTACATCATG GCCTATCACG GCTTTGCCCA GGATCGGCTG
GCGGCCGAGC GCCTGTATCG CGACCTGGCC GCGCAGCTCG CCGACGCCAA GGGCACAGAG
GCGGACGGCA GCAAGGACAA GGCCGGGGCC ACCGGCAGCG CGCCGCGCAA CATGCGCGTG
GCCGTCATCT CCGACACCCT CGATCACGTC AACGGCGTGG CCCTGGGGCT GCGCCGCTTG
CACGCGCAGG CCCAGCGCTC GGGCCTCGAG CTCGACCTGG TCGCGGTCGG CGGCTGCGAC
GAGATGTGCG TGGACGCCGA CGGCTTGCAC CGCATCCCGA GCATCTACTC GCACACGCTG
GAAGAGTATC CCGAGCTGCC CTGGAACGTG CCGCACCTGC CCTCGCTGCT GCGCTATCTG
GTCGAGCGCC GCATCGACAT GCTGCAGTGC TCGACCCCGG GCCCGGCCGG TATCGCCGGC
CTCATCGCCG CGCGCCTGCT CGGCATCCCG GTGGTCGGCC AGTACCACAC CGATGTCCCC
GAGTACACGA TGCGCCTGAT GGGCGACCCC ATGCTCGCCG GCGTGGTCCG CATCATCACC
TCGTGGTTCT ACCGCACGGT CGACCGCGCG CTGGTGCCCT CGCAGTGGGT CGCGCGCCTC
ATCAATGATA TGGGCGTGCC GGCCGAACGC ATCACGCGCA TCCCGCGCGG CATCGATCTC
GACCTGTTCC GCCAGGCCGC GCGCGACGAG CACGCGTTTG AAGAGTACGG CCTCAACGGC
GAGCCCAAAG TGCTCTATGT CGGCCGGGTG TCCAAAGAAA AGGGCCTGTC GCATCTGGCC
GCGGGTTTCC GCCGGCTCAG CTCCGAGCTG CCGGGCGCGC GCCTGGTGGT CATCGGCGAC
GGCCCCTACG CCGACGAGCT GGCCACGCAG ATGCCGGCCG ACAAGGTGAT CTTCACCGGC
CCGGTCACCG GCGAGAAGCT GGCCCGCCTG TACGCGTCCA GCGATGTCTT CGCCTTCCCC
AGCGAGACCG AGACCTTCGG CAACGTGGTC GTCGAGGCCC AGGCCACCGG CCTGCCCGTG
GTGGTCGCCG ACCGCGGCGC CGCGCGCGAG AACATGCGCG AGGGCGTCAC CGGCATGGTC
GTTGATCCCC GCGATCCCGA GGCCTGGTGC AGCACGCTCA AGCGCCTGCT CGAGGACAGC
GCGCTGCGCA AGCAGATGAG CTCGGCGGCG CAGGAGTTCG CCCAGCGCTA CCGCATGGAC
GCGGCCGCGC ACGGCACCTT CGAGGAGTAC GCGCGCATCC TCGACGAGCT GCGCGCCGGC
CAGCCCGCGG CGCCGACCAG CGCCGCCGAC TGA
 
Protein sequence
MKTEAIEVRV DLHVHSSYSD TPQNWFLRTG GVAESYTSPQ TVYETAMRRG MSLVTLTDHN 
TIAGALELAA KYDNVFLSEE ITARFPEDGC IVRVAALDIS EAQHEDIARL RDNVYDLAAY
LAQESIAAVW CHPLSDVNAR LSRSHLERCF LMFRALELRN GSRDASHEQH LAELVTALSP
AHLARFAEAH PQTPAINLEG RYAFTGGSDD YGGLAIARAY TAFHGSPTGA SAASAVRSLT
CAPAGECGDA PTIAHNAFSI ASGHVQKQMA QNPAQMATPA NISIITELMK RKSLFEQGGG
RLDFDEMKEK GHQSSFQDML VRMAEPALVH GWREVLGGFF GAAFGGRFGE AADAVSQVLK
ASMFDVPYIM AYHGFAQDRL AAERLYRDLA AQLADAKGTE ADGSKDKAGA TGSAPRNMRV
AVISDTLDHV NGVALGLRRL HAQAQRSGLE LDLVAVGGCD EMCVDADGLH RIPSIYSHTL
EEYPELPWNV PHLPSLLRYL VERRIDMLQC STPGPAGIAG LIAARLLGIP VVGQYHTDVP
EYTMRLMGDP MLAGVVRIIT SWFYRTVDRA LVPSQWVARL INDMGVPAER ITRIPRGIDL
DLFRQAARDE HAFEEYGLNG EPKVLYVGRV SKEKGLSHLA AGFRRLSSEL PGARLVVIGD
GPYADELATQ MPADKVIFTG PVTGEKLARL YASSDVFAFP SETETFGNVV VEAQATGLPV
VVADRGAARE NMREGVTGMV VDPRDPEAWC STLKRLLEDS ALRKQMSSAA QEFAQRYRMD
AAAHGTFEEY ARILDELRAG QPAAPTSAAD