Gene Hoch_6500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6500 
Symbol 
ID8548917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8929274 
End bp8931658 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content77% 
IMG OID646391163 
ProductO-antigen polymerase 
Protein accessionYP_003270862 
Protein GI262199653 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCGTC GCGACCTGGT CGCGCTGCTG GCGATCGCGC TGAGCCTGAG CCTGAGCCTG 
GCCGCGCTCG GTGGCGTGCC GCGCTGGGCC GCGTGCACGG GCGCGGCCCT GGGTCTGGCC
TGCGCGCTGC CCTACCTCGG CGCCCGCCGC CGCATCGATG CGCGCGCGCC GCTGGTGCTC
GCGGCCGGTT TCGCCGCCCT GGCCACGGCC GTCCAGTGCC TGCCGCTGCC GGCCGGCCTG
GTCGCCTGGC TGTCGCCCGC GCGCCACGCC GCCAGCGCCG AGGTCGCGCG TCTGCTCGGC
GAGCCGCTGC CGGGTTTCCT GGCGCTGTCG CACGAACCCG CCGCCACCCT GGTGCACCTG
GCCAGCGCCG CCGGCTTGCT GGCGCTGGCG TACGCGGCCG TGCGCGTGGC CGGCCGGCGC
AGCGGCGCGC TGTGGCTGAT GCGCATCGCG GCCGGCTGCG GTGCGGCCAT GGCGCTGTGC
GCGCTGATCC ATGGCTTCTT CGGCGTCGAC GCGCTGTTCG GCGTGTACCG CCCCAGCCAG
GCCACGCCCG CGTTCCCGGC GCCGCTGCTC AACGACAACC ACCTGGCCGG CTTCCTGTCG
CTGTGCGCGC CGCTGGCCCT GGCCCTGGCG GTCGCGCGTC CCGGCGCCGA ACGCGCCGGC
TGGCTGCTGG CCGTGCTCGC CATCGCCGCC ACCAACCTGC TCGTGGCCTC GCGCATCGGC
GCGCTGTCGC TGTGCGCCGG GCTGCTGGTC GCGGGCGCGC TGCTGGTGGC CGAGGCGCAG
CGGCGACCGG GCCGCCGCAT CGAGCGCGCC GTGCTCATCC CGGGCGCCAT CATCGCCGCG
TGCGCGGCCG TCCTGGTGGT GAGCAGCGCG GGCGCGCGCG TGGGCCAGGA GCTGGCCGAG
ACCACGCGCG CCGAGCTCGA CGACCCGCGC AGTAAGTTCC AGGTGTGGCA GCGCTCCATC
GACCTGGTGT CCGAACACCT GTGGACCGGC GTCGGCAGCG GCGGCTTCGA GCCCACCTTC
ACCCGCCTCG ACGGCACCAG CATCAAGACC TACTCGCATC TCGAGAATGC CTACCTGCAG
ATCCTGGTCG ATTTCGGCGC CCCGCTCGCG GCCGGTTTCG CGATCCTGGG CCTGTGGCTG
GCCCTGGCCG CGCTGCAACG CCGCCGCCAG GATCCCCTGT GCGCCGGCGC GCTGGCCGGA
TGCACGGCCA CGGCCGTGCA CGCCTGCGGC GATTTCCACC TGGCGCTGCC GGGCGTGGCC
GCCACCCTGG TGATCGTGCT GGCCGTGCTC GTGCCCGCGC GCCTGCGCTC GCCGGCGCCG
CGGCGCCGCG TGCTGACGCC GCGCATCGCC GGCCTGCTGC TGGGCGCGGG CGCGGTGGCG
CTGGCCGCGT CGCCGCTCGG CCAGGGCGCG CGCGCCGCCG AGGACCGCGT CCGTGCCGAA
CTCGATCGCT CGTCCGAATT ACGCGCGCGC GGCGAGCGCG AGGGTGCGGC CGCGGCCGAG
ACCCGCGCGC TGCGCATCGG CCGCGCCCAG GTCGCCCGCC ACCCCAGCGA CTACCTGCTC
GCCGGCCTGC TCGCCGAGGC CCACTTCCGC CGCCGCGACC CGGCCTCGGT GGCCTGGATC
AACCGCGCTC TGCAGCTCAA CCCCAGCCAC GCCGGGCTAC ACGTGCTGGC CGCGCGCATG
CTGCTCACCG CCGGACACCG CGACCAGGCC CTGCTCGAGT ACGCGGCCGC GCTGCGCCAC
ACGCTCACGC CGCGGCCACT GCTGGGCGAC CTGGTGCGCT GGTTCGCCGA CCCGAGCGAG
GCCGCGCGCG GCATCCCGGC CAGTCGCGAG CGGCTGCCCA TTCTCAGCTC GTGGCTGCTG
GCCATGGAGC GCGGCGACGT CGCCCTGGCC TACGCGCGCC GCGTCTACGC CGAACACGCG
GACGATCACG AGGTGCAGCG CGTCGTCGCC GATCTGGCCT GGAGGCAGGG TGACCATGTC
CTGGCCCGGA GCGCGGCCGA GCCCGCCTAC GCGGCCACCG GCGCGCTCGC CGAAGCTCTG
ATCCTCGGCC AGGCGCTGCA CGCCCAGGGA CGCGAGGACG AGGCCGCCCG CGTGCTCGTC
GAGGCCATCG CCGCGCGCCG CTACGACGAG CTGTGGCAGC TCACGCGCGC GCACCAGGTC
CTCGCCGAAG TCGAAGACGC CCGCGGCGAC CCTCTGGCCG CGCGCACGCA CCTGCGCACC
GCCATCCGCC TGGCGCCAGC CCACGCCGGG CCCGCGGTGC GCGCCGACCT GCAGCGCCGG
CTGGCGCGTC TCGAGGAGCG CCTGGGCAAT CGCAGCGCCG CGGCCCAGGC GCGCGCCCTG
GCGGCGGAAC TCGACGCGCT GTCCGAAGCC GTACGCGAGC CCTGA
 
Protein sequence
MRRRDLVALL AIALSLSLSL AALGGVPRWA ACTGAALGLA CALPYLGARR RIDARAPLVL 
AAGFAALATA VQCLPLPAGL VAWLSPARHA ASAEVARLLG EPLPGFLALS HEPAATLVHL
ASAAGLLALA YAAVRVAGRR SGALWLMRIA AGCGAAMALC ALIHGFFGVD ALFGVYRPSQ
ATPAFPAPLL NDNHLAGFLS LCAPLALALA VARPGAERAG WLLAVLAIAA TNLLVASRIG
ALSLCAGLLV AGALLVAEAQ RRPGRRIERA VLIPGAIIAA CAAVLVVSSA GARVGQELAE
TTRAELDDPR SKFQVWQRSI DLVSEHLWTG VGSGGFEPTF TRLDGTSIKT YSHLENAYLQ
ILVDFGAPLA AGFAILGLWL ALAALQRRRQ DPLCAGALAG CTATAVHACG DFHLALPGVA
ATLVIVLAVL VPARLRSPAP RRRVLTPRIA GLLLGAGAVA LAASPLGQGA RAAEDRVRAE
LDRSSELRAR GEREGAAAAE TRALRIGRAQ VARHPSDYLL AGLLAEAHFR RRDPASVAWI
NRALQLNPSH AGLHVLAARM LLTAGHRDQA LLEYAAALRH TLTPRPLLGD LVRWFADPSE
AARGIPASRE RLPILSSWLL AMERGDVALA YARRVYAEHA DDHEVQRVVA DLAWRQGDHV
LARSAAEPAY AATGALAEAL ILGQALHAQG REDEAARVLV EAIAARRYDE LWQLTRAHQV
LAEVEDARGD PLAARTHLRT AIRLAPAHAG PAVRADLQRR LARLEERLGN RSAAAQARAL
AAELDALSEA VREP