Gene Hoch_4947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4947 
Symbol 
ID8547355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6821986 
End bp6823224 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content68% 
IMG OID646389621 
Productaspartate kinase 
Protein accessionYP_003269329 
Protein GI262198120 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.599309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATCG TCCAGAAGTA CGGCGGTACC TCGGTGGCCG ACATCGACCG CATCCAGGCG 
GTCGCCAAAC GCTGCCTCGA AACCCAACGC GAGGGCCACC AGGTGGCCGT GGTGGTCTCG
GCCATGGCTG GCGAGACCAA TCGCCTGCTC GGGCTCGCCA GCCAGCTCCA CCCCGAACCC
CACGATCGCG AGATCGACGT CATCGTGTCC ACCGGCGAGC AGGTGAGCGT GGGCCTGCTC
GCGCTCGCCA TCCGTTCGCT CGGCGGTCAG GCGCAGTCCT TTCTCGGACA CCAGGTCCAG
ATCGTCACCG ACAGCCGCTA CTCGCGCGCG CGCATCCAGT CGATCGACGC CGAGGCCGTG
CGCGCCTGCT GGGAGGCGGG CCGCATCGCG GTCATCGCCG GCTTTCAAGG CGTCGACGCC
AAGGGCAGCA TCACCACCCT GGGGCGCGGC GGCTCCGACA CCACCGCGGT GGCCATCGCC
GCGGCCATCG ACGCCGACGT CTGCGAGATC TACACCGACG TCGACGGCAT CTACACCGCC
GATCCGCGCC TGGTCTCGAG CGCGCGCAAG GTCGAGCGCA TCGGCTACGA GGCGATGCTC
GAGCTGGCCT CGGTGGGCGC CAAGGTGCTG CAGATCCGCT CGGTCGAGAT GGCCATGAAA
TACGGCGTGC CCATCCACGT GCGCTCGAGC TTCAATCACC AACCCGGCAC CTGGGTCGTG
CCCGAGGAAC AGTCCATGGA ACATGTAGCC GTAGACGGCG TCGCCCTGGT GCGCGACGAG
TCCAAGATCA CGGTCCGCGC GCTGCCCGAC ATCCCGGGCG TGGCCGCGCG CCTGCTGTCG
CCGCTGGCCG ATGCCGGCAT TGTCATCGAC ATCATCGTGC AGAACGCCAG CGCCGACGGC
AGCACCGACA TCTCGTTCAC CGTGGCCCGC TCCGAGCGCG CGCGCGCCAT CGAGCTGCTC
GGCAAAGAGG CCGGCGACCT GTGCACCTCC GAGCGCCTGG CCTACGCCGA CGACGTCGCC
AAGGTCAGCG TGGTCGGCAT CGGCATCCGC TCGCACGCCG GCGTGGCCCG GCGCATGTTC
GAGCTGCTGG CCGGCGAGAA CATCAACATC GAGCTCATCT CCACCAGCGA AATCAAGATC
ACCTGCGTGA TCAACGAGAA GTACGCCGAG CTGGCGCTGC GCGTGTTGCA CATGGGCTTT
GAGCTCGACC TGCCGCCCGA AGATCGGCCG ACGCCTTAG
 
Protein sequence
MLIVQKYGGT SVADIDRIQA VAKRCLETQR EGHQVAVVVS AMAGETNRLL GLASQLHPEP 
HDREIDVIVS TGEQVSVGLL ALAIRSLGGQ AQSFLGHQVQ IVTDSRYSRA RIQSIDAEAV
RACWEAGRIA VIAGFQGVDA KGSITTLGRG GSDTTAVAIA AAIDADVCEI YTDVDGIYTA
DPRLVSSARK VERIGYEAML ELASVGAKVL QIRSVEMAMK YGVPIHVRSS FNHQPGTWVV
PEEQSMEHVA VDGVALVRDE SKITVRALPD IPGVAARLLS PLADAGIVID IIVQNASADG
STDISFTVAR SERARAIELL GKEAGDLCTS ERLAYADDVA KVSVVGIGIR SHAGVARRMF
ELLAGENINI ELISTSEIKI TCVINEKYAE LALRVLHMGF ELDLPPEDRP TP