Gene Hoch_5614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5614 
Symbol 
ID8548028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7708575 
End bp7710293 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content70% 
IMG OID646390285 
Productthymidylate kinase 
Protein accessionYP_003269987 
Protein GI262198778 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0125] Thymidylate kinase 
TIGRFAM ID[TIGR00041] thymidylate kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.2019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTCG TATTCGAGGG GATTGACGGC AGCGGCAAGA CCACGGTCTC CAACAAGGTG 
GCCAAGGTCT TGCGACGCCG TGGTTTGGCG GTAGAGCACG TGCGCGAAGG CGGAGAGTTC
GCCTCGCCGC TGGTCGGCCG GATGCGCGAG TTCGGCAAGG ATCCGCGCAA CATGGCGATG
GCGCCGCTCA CCGAGCTTCT GTTCTACGTG GCCCGGGACG CGCAGCTTCT GGCCGAGTGC
ATTCAGCCGG CGCTGCGCCG GGGGGGCCTG GTGTTCGCCG ACCGCTATCT GTACTCCTAC
GAGGTGCTCA GCCACCACGG GCGCGGTGTG CCCATGGACC AGGTGCGGCC CATCCTCGAC
GCGGTGTCCG GGGGCGTGTG GCCCGATCTC GTGGTGTATC TCGACGTCGA GCCCACCCTG
GCGCGCGCGC GCCGCAAAGT CGGCAAGCTG ATCAAGAAGT CCAAGGGCGG CAAACCCGGC
GGCGGCAGCC GCAAGGGCCT GCAGGGCGTC GGCACCCAGC ATCGGCTGCG CGCCGGCTAT
CTGGAGTTGG CCGAGCGCGA TCCCGAGCGC TGGCTGGTCG TGGACAACGC CGACGTGCTC
AGCCCGGAGA GTCGCCTCGA CGCCATCGTG CAGCGCATCG CCGGCGCCAT CGAGCGCATC
TGGCAGGGGC AGCGCATCAG CGAGGTCGTG CGCAACGCCG CGGTCGAGCC CGGTCCGCGC
ATCGCGGCCG AGACCCCCGA GCTCGAGGCC GGGCGCGATG CCTTCTTCCA GTTGATCGAG
GCCCGCGCCG CGCGCGAGCT GCCCATCGCC GCCTACCTGC TCGCCGGCAT CGGCGGCGAG
CGCGCCCACG ATCTGCGCGA GAGCTGGGCA CAGAAGTCGC CGCACATGGT CGCCTACGGC
CTGCGCGGGC TGGCCGACGA TCGCGCCTGG GAGCTGCGCG AGCGGCTGCG CGAGCGCACC
CCGTACTTCG TCGCCCGTTC GCTCGACGGC CGCGCCGTGC AGCTCGACCC GCGGGTCGAG
CAGATGCGCC GCGAGTTTGT CGACAGCCAG CCGCAGGCGG TGCTCAGCAC CATCGACAGC
GTCGACACCG AGGCCGCCTG GGAGCTGCGC GAGCGCCTGG CCGAGAGGGC TCTGCCCGAG
GTGCTCACCA CGCTCAAGCG ATTGCCCAGC GATCGCGCCT GGGAGCTGCG TTCGCGCCTG
GAGTCGCAGG TCAGCGATCC CATGGCCATG GCGCCGCTGG TGAGTTCGAT CCGCGGCCTC
GCCGACGATC GCGCGTGGGC CATTCGCGAC CGCTATCTCG ACGTCTTGCC CAGTCAGGTG
CTGGCGTCGC TGAGCGGTGT CGAGGACGCG CGCTCGTGGG TGCTGCGCTC GCGCTTCGCC
AGCCAGGCGC CCAAGATCGT CCTGCGCACC ATCGACGGCA TGGATGACGC GCGCGCCTGG
GCGCTGCGCC GCGTGTACGC GCCGCGCGTC AAAGAGGCCC TCGACTCCAT GGTCGGCCTC
GACAGCGATG TCGCGTGGGC CATCCGCGAC GAGTGTCTTT CTATCTGGCC ATCGACGACG
ACCAAGTCGT TGGGATTGCT GGCCTCGAGC GAACGCGGCA GGGAGATGGT GCGCGAGGCA
CTCGCGCAGC ATCCCGAGGA CATCTCCCTG CTCAAACACG TGACACGTCT GGCCGGTCGC
ACCGCGGGAA GGCTGCCCGG TCTCGAGAAG GTGGGGTGA
 
Protein sequence
MFVVFEGIDG SGKTTVSNKV AKVLRRRGLA VEHVREGGEF ASPLVGRMRE FGKDPRNMAM 
APLTELLFYV ARDAQLLAEC IQPALRRGGL VFADRYLYSY EVLSHHGRGV PMDQVRPILD
AVSGGVWPDL VVYLDVEPTL ARARRKVGKL IKKSKGGKPG GGSRKGLQGV GTQHRLRAGY
LELAERDPER WLVVDNADVL SPESRLDAIV QRIAGAIERI WQGQRISEVV RNAAVEPGPR
IAAETPELEA GRDAFFQLIE ARAARELPIA AYLLAGIGGE RAHDLRESWA QKSPHMVAYG
LRGLADDRAW ELRERLRERT PYFVARSLDG RAVQLDPRVE QMRREFVDSQ PQAVLSTIDS
VDTEAAWELR ERLAERALPE VLTTLKRLPS DRAWELRSRL ESQVSDPMAM APLVSSIRGL
ADDRAWAIRD RYLDVLPSQV LASLSGVEDA RSWVLRSRFA SQAPKIVLRT IDGMDDARAW
ALRRVYAPRV KEALDSMVGL DSDVAWAIRD ECLSIWPSTT TKSLGLLASS ERGREMVREA
LAQHPEDISL LKHVTRLAGR TAGRLPGLEK VG