Gene Hoch_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0024 
Symbol 
ID8542394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp33933 
End bp35303 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content65% 
IMG OID646384812 
ProductO-antigen polymerase 
Protein accessionYP_003264559 
Protein GI262193350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGC TCCCCGGCCT ATCCGCCCTC GTCACCTTCA TCTATCTGCG GCCGCAGGAA 
TTCGTCCTGA TCCTGCAGAA GCTGCCGCTG CTCTACATCT TCTTTGCGGC GGCCCTGGGC
GGCCTGGTCA TCGACCTCAA GCTGCGGCTC ATCAAGCCCA TCCCGGCGCG CACGCTGCTG
ATCGCGACGC TGTTCTTCGG CTGGATTCTG CTCAACATCG GCGTCAAGGT CCCGGCCGCG
TCCAAGGTCT CGGAGATCAT CACCTTCACG ATCGTCTTCG TCACCTATGT GCTCATCGCC
CAGGGCATAC AGAGTTTCCG CGCCCTGCGG GTGATGGCCT GGGTGCTGCT GTTTAGCTTC
CTGTTCCTCA CCTACACCGG CATCCGCCAG GCCCACGGGC CCATGGGATG TCTGATGATC
GAGAACTACA TGGCCCGGCT CGGCACGCCC GACGGCCGCT CGTGCGAGAC CGCGCTCGAC
TGCGGCGGCG TCGAGGCCGA ACCCGGCGAG GCCTACGACT GCCAGCGCAT CGGCCCCTTC
GGCACCACGG CGCTCAACGA GCGCGTGCGC TATCGCGGCT CGCTGCAGGA CCCCAACGAG
CTGTCCACCG TGATCTGCGG CGGCGTCGCC CTGATCATCG GTCTGGTGGC GATGTCGCGC
AAGCTGCGCT GGCGGGTGCT GGCCGTCATC GGCGCCGTAG CCATATTCAT GTGCGTGATC
TACACGCAGT CGCGCGGCGG CATGCTGGCC TACATGGCCG TCATCGGCGT GTACTTCATA
CGAAGATACG GCGTGAAATA CGCGGTCATC GGCGTGCTGT GCATGCTGCC GCTCATGGCC
CTGGGCGGGC GCGGCGGCGA GGCCGCGTCG GCGTCCACCG AGGGCCGCTA CGAGGCCTGG
CGCTCGGGCC TCGACATGCT GAAGATGGAC CCGGTGTTCG GCGTGGGCAA GGGCATGTTC
ACCGAGTACC ACCACCTCAC CGCCCACAAC TCGCACGTGC TCACCTTCGC CGAGCTCGGC
CTGCTCGGCA TGTTCCTGTG GGTGAGCACG CTGTACCTGG CGTTCAAGCC CTCGGTCGTC
GCGCTGCGCG ACTTCGCCGA CGATCCCAGC GCCGGCTCGG TGCGCACCTG GGCGCTGGCG
GTGCTGGCCA TGGGCATGCC GCTCATCGTG CAGATGATGT TCCTGTCGCT CACCTACCAC
TTCATGACCT GGGTGTGGGT GGGCATGACC GGCGGCTTCT ACTCGGCGGT CAAATCGCAC
GTGCCCGACT GGGAAGTGAA GTTCGGGCTC TTCGACATCT TCATCATCGC CGGCATCGCC
TACGGCTTCA TCGCGGTCCT GCCCTTCTTC CTGCTGCTCA AGGGTTTCTG A
 
Protein sequence
MFTLPGLSAL VTFIYLRPQE FVLILQKLPL LYIFFAAALG GLVIDLKLRL IKPIPARTLL 
IATLFFGWIL LNIGVKVPAA SKVSEIITFT IVFVTYVLIA QGIQSFRALR VMAWVLLFSF
LFLTYTGIRQ AHGPMGCLMI ENYMARLGTP DGRSCETALD CGGVEAEPGE AYDCQRIGPF
GTTALNERVR YRGSLQDPNE LSTVICGGVA LIIGLVAMSR KLRWRVLAVI GAVAIFMCVI
YTQSRGGMLA YMAVIGVYFI RRYGVKYAVI GVLCMLPLMA LGGRGGEAAS ASTEGRYEAW
RSGLDMLKMD PVFGVGKGMF TEYHHLTAHN SHVLTFAELG LLGMFLWVST LYLAFKPSVV
ALRDFADDPS AGSVRTWALA VLAMGMPLIV QMMFLSLTYH FMTWVWVGMT GGFYSAVKSH
VPDWEVKFGL FDIFIIAGIA YGFIAVLPFF LLLKGF