Gene Hoch_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1599 
Symbol 
ID8543981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2189210 
End bp2190346 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID646386307 
ProductpolyA polymerase related protein 
Protein accessionYP_003266042 
Protein GI262194833 
COG category[R] General function prediction only 
COG ID[COG4639] Predicted kinase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.29317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTGC TCGCCCACTG CCCCGCGCCC CCGCACTGGC GGCTCGACTG GCGCGCGCTG 
CGCGACGCTT ACCCCTGGGT CGATGCGCTG CACGCGTGCC CGCAGGATCC CGGCTTTCAC
GCCGAGGGCG ACGTCGGCAT CCACACCGAG ATGGCGTGCC AGGCGCTGGC CGCCTCGGCC
GCATTTCGCG CCCTCCCCGC GGAAGAACGC GCGATCGTGT TCGCCGCCGT GCTCTTGCAC
GACGTCGCCA AACCCGCGTG CACCAAACAC GAGGACGACG GCCGCATCAG CTCGCGCGGC
CACAGCGGCC GCGGCGATAT CCTGGCGCGG CGCATCCTGT GGCGGCAGGG CGTGCCCTTT
GCCACCCGCG AGGCCATCTG CGGGCTCATC CGTCATCACC AGGTGCCGTT TTTCCTGGTC
GATCGCGAGG ACTCTCGCAA GCTCGCCTAT CGCGTCAGCC ACATGGCGCG CTGCGATCAC
CTCGCGCTGG TGGCCTGGGC CGACGGCTTC GGCCGACGCT GCGCCGACGA CGCCGACCAG
CGCCGCATCC TCGACAACGT CGAGCTGTTC CGCGAGTACT GCGACGAACA GGGCTGCCTC
GCGCAGCCGC GGCGCTTCGC CTCAGACCAC TCGCGCTTTC TCTACTTCCA CAAGGACAGT
CGCGACCCCG ACTACCACGC ACACGATGAC ACCGGCTGCC AGGTGACGCT GATGTCGGGC
CTCCCCGGCG CCGGCAAAGA CCACTGGATC CGCCACGCTG CCGGCGATCT GCCCGTGGTC
TCACTCGACG CCATCCGCCT CGAACGCGGC ATCGACCCGG CCGCACCGCA GGGCCGCGTC
ATCGACGAGG GCCGGCAGCG CGCCAAAGAA TACCTGCGCC GCCAGCAGTC CTTTGTGTGG
AACGCGACCA ACCTCAGCCA GCAGATCCGC GACCAGCTCA TCGCGCTGTT CAACGACTAC
GGCGCCCGCG TGCGCATCGT CTACGTCGAG GCGTCCGAAA CCCACATCCG CAGCCGCAAC
CGCGCCCGCG AGAGCCCCGT GCCGTCGCGG GTCATCGACA AGCTGCTCGA GCGCTGGACC
GTGCCCACCA CCGTCGAGGC CCACGAGATC GTCTGTCACG TGGACGCCGA CCAATAG
 
Protein sequence
MSVLAHCPAP PHWRLDWRAL RDAYPWVDAL HACPQDPGFH AEGDVGIHTE MACQALAASA 
AFRALPAEER AIVFAAVLLH DVAKPACTKH EDDGRISSRG HSGRGDILAR RILWRQGVPF
ATREAICGLI RHHQVPFFLV DREDSRKLAY RVSHMARCDH LALVAWADGF GRRCADDADQ
RRILDNVELF REYCDEQGCL AQPRRFASDH SRFLYFHKDS RDPDYHAHDD TGCQVTLMSG
LPGAGKDHWI RHAAGDLPVV SLDAIRLERG IDPAAPQGRV IDEGRQRAKE YLRRQQSFVW
NATNLSQQIR DQLIALFNDY GARVRIVYVE ASETHIRSRN RARESPVPSR VIDKLLERWT
VPTTVEAHEI VCHVDADQ