Gene Hoch_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4036 
Symbol 
ID8546437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5540106 
End bp5541884 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content75% 
IMG OID646388713 
Productglycosyl transferase group 1 
Protein accessionYP_003268428 
Protein GI262197219 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.212035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.445852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCA TCCTGATGAC CGGCTTCTTC TGCCTGCCTG CGCCGGATCG CGTAGCGGTG 
CAAATGCACC ACCTGCTGCG CGCGCTGTCG CGGCATCACG AGGTGGATGT GTTGGTCGCC
CGCGCGGACG ATCACGCCTA TGTCGAGCGC AGCGGCGGGG CCTCGATCCT GCGCGTCCCC
ACCCGCGAGG ACAGCATCGA TGGACGCCTG GCTAGCTTCC GGCGGGCGCT GCGTCGCCAG
CTCGGCAGCG CCGACTACGA TCTGGTGCAC TTCCGCGACG GCTGGTGCGG CTCGGTGGTG
GTCGAGCTGC GCCAGCGCTA CGGCTACATC ACTGTGTTCG ACGCGGCGCG CGCGCCGCTC
GCCGGGCCGC CGATCCTCGA CCTCGAGGTG AGCGCGGCGC TGGCGCGCGA CGAGGAGCTG
TGCCTGCAGC AGGCCGACCA CGTGCTGGTG CCGACCGCGC TGGCCCGCGC CCACCTGCTG
GAGCAGCGCG GCGCCGGCGT GCACCTGGTG CCGCCGGGCG TCGATGTCGA TCTCTTCGAC
TGGCTGCCGG CGCGTCCCGG GCCGCCGCTG GTGCTGTACG CGGGCGCGGT CGAGGCCGGC
CGCGGTCTGC GCGTGCTGCT GCGGGCCTTC GCCCGGCTGG CGCCGCATTC GGACGCGCGT
CTGGTCATCG CCGGGCGACC CAGCGGCAAC GCGGCCTCGT CGCTCAACGC GGCGATCGCC
GAGCTGGGCA TCGAGGAGCG CGTGACCCTG GAGCCGGCGG TGGCCAATGA GGACATGCCC
GAGCTCATCG CCCGCGCGGC GGTGTGCGTG GCGCCCTCGG CGGCCGAGGT GTCGGTGCAG
CCCATGGCCC TGTACCCGAC CAAGATCCTC GAGTACATGG CCTGTCGCCG GGCCGTGGTC
GCGGCCCGCC GCGGCGCCGC CAGCCTGCTC ATCGAGGACG GGGTGCACGG TGTCTTGTTC
CGCCCCGGCG ACGCCGAGGA TCTGGCCGAC AAGCTGCTGC TCGTGCTCGA GGACGCGGCG
CTGCGCGAGC GCCTGGCCGC GGCCGGCTAC CGCCGCGTGC GCGACGAGCA CACGGCCAGC
AACACCCGCC GCGCGGTGCG CGCCGCCTAC GCCGGCATGC AGGTCGACAC CAGCGAGTAC
CGCACCCTGA CCCTGAGCAG CGTCGACATC ATCGCCCCGG AGCTGCCGGG CTCGCTGCCC
GAGGGCTCGG TGCGCGTGGT CGAGCTGGCC GGCGGCAACC GGGCGACCAC CGACGAGGAG
AGCGTGTTGC CGCGCGCGTT CGGGCCGGTG GCCAACCTCG ACACGCTCAC CGATGGCCGC
GGGCCGCAGG CGGCGAGCGA CGCCGGCCCG ACGCGCGAGC GCCCCTCGCT GCAGACGCGC
GATACCTACC GCATGCCGGC CTTCGAGGTG GCGGAGCTGG AGCCCGAGGG CGAGGGCGAG
GGCGAGGACG AGGACGAGAT CCGCGCCGAT ACCGACCCGG TGGACGAGGA CAGGGACAGA
GATAGAGACG GGGACGAGGT CGGCGCGCAC GCCGACGTGG TCGGCGCGGT CGCCGCCGCC
ATCGCGTCGC CGACGCCGCC GCCGCCGCCG CCGCCGCCGC GAGCGCGGGC AGCGGCCCGG
AGCCGCGACG CCCGCAGCGA GCGCGACTCG CTCAGCGCTG CGGGCGAGCT CGAGGTGCGG
CCCGTGCGCG TGCCGCTGAG TCCCGACGAG CGCCCGACCA CGCCGAGCAT GCCGGCCATC
CAGATCCCCG ACGACGATCT CGGCGACGAG CCCGCGTGA
 
Protein sequence
MSRILMTGFF CLPAPDRVAV QMHHLLRALS RHHEVDVLVA RADDHAYVER SGGASILRVP 
TREDSIDGRL ASFRRALRRQ LGSADYDLVH FRDGWCGSVV VELRQRYGYI TVFDAARAPL
AGPPILDLEV SAALARDEEL CLQQADHVLV PTALARAHLL EQRGAGVHLV PPGVDVDLFD
WLPARPGPPL VLYAGAVEAG RGLRVLLRAF ARLAPHSDAR LVIAGRPSGN AASSLNAAIA
ELGIEERVTL EPAVANEDMP ELIARAAVCV APSAAEVSVQ PMALYPTKIL EYMACRRAVV
AARRGAASLL IEDGVHGVLF RPGDAEDLAD KLLLVLEDAA LRERLAAAGY RRVRDEHTAS
NTRRAVRAAY AGMQVDTSEY RTLTLSSVDI IAPELPGSLP EGSVRVVELA GGNRATTDEE
SVLPRAFGPV ANLDTLTDGR GPQAASDAGP TRERPSLQTR DTYRMPAFEV AELEPEGEGE
GEDEDEIRAD TDPVDEDRDR DRDGDEVGAH ADVVGAVAAA IASPTPPPPP PPPRARAAAR
SRDARSERDS LSAAGELEVR PVRVPLSPDE RPTTPSMPAI QIPDDDLGDE PA