Gene Hhal_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1040 
Symbol 
ID4709610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1122884 
End bp1124320 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID639855511 
Productpyruvate kinase 
Protein accessionYP_001002618 
Protein GI121997831 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCCC GCAGAACCAA GATCCTGGCG ACCCTCGGGC CGGCGACGGA TCGCCCCGAA 
GGGCTGGAGG CGGTCCTCGC CGCCGGGGTG GATGTGGTCC GCATCAACCT CTCCCACGGC
GAGGCCAGCG ACCACCGCCG GCGGGTCGAG GCCGTCCGGA CCTGGGGCGA AGCCCACGGC
CGGCGGGTGG GCGTGCTGGT GGACCTGCAG GGGCCGAAGA TCCGGATTGA GCGCTTTCGC
AATGGCCCGG TCCTCCTGCG CCGCGGGGAC CCCTTCATCC TCGATCCTAC CGTGGATCCC
GATGCCGGGG ACCAGACGCA GGTCGGGGTC GCCTACCGGG CCCTGCCCGA CGACCTGCAC
CCCGGCGACG AGCTGCTGCT CGACGACGGC CGCCTGGTGG TGCGGGTCGA ACGCATCGAG
GGGACCGCGG TGCACACCAG CGTGCAGGTC GGCGGTGAAC TCAGCGATCG CAAGGGCATC
AACCGCCGCG GTGGCGGGCT TTCGGCGCCG GCGCTCACCG ACAAGGACCG GGTCGACATC
GTCACCGCCG CCGAACTGGC CGCCGACTAC GTGGCGGTGT CGTTCCCGCG GTGTGCGGAC
GATATCCACG AGGCCCGCCG GCTGCTGACC GAGGCCGGTG GGCGGTCCGG GATCGTGGCC
AAGATCGAGC GGGCCGAGGC CCTGGACGCC GCCGAGGAGA TCATGGATGC CGCCGACGCC
ATCATGGTCG CCCGCGGCGA TCTCGGCGTG GAGATCGGCG ACGCCGCCCT GCCCGAGGTT
CAGAAGCAGC TCATCCAGAC CGCCCGGGCC CGGAACCGGG TCGCCATCAC GGCGACCCAG
ATGATGGAGT CGATGATCGA CAATCCCATC CCCACCCGGG CCGAGGTCTT CGACGTTGCC
AACGCCGTGC GGGATGGCAC CGACGCCGTG ATGCTCTCGG CAGAGACCGC CACCGGCTCG
TTCCCGGCCG AGACCGTGGC CGCCATGGAC CGCGTCTGCC GGGCCGCCGA GCGCAGCCGT
ACGGTGACGG TCTCCCACCA CCGGCTGGAC GAGCACTTCA CGCAGGTCGA CGAGACGGTG
GCCATGGCGG CGATGTACGC CGCCAACCAC TTCGCCATCA AGGCGCTGAT CGCCATCACC
GAGTCCGGAG GGACGGCCTC GTGGATGTCG CGGATCAGCT CCGGGCTGCC GATCTACGTC
CTCACCCGCC ACCCGGAGAC CCGGGGCCGG GTCACCCTCT ACCGTGGCGT CTACCCGCTG
GAGTTCGATG CCGAGCAGGA GGACCTCAAC GCCCTCAAGA GTCGGCTGCT GGCCCGTCTC
TCCAAGCAAG GGCTGGTCAG CCAGGGGGAC TACGTCCTGG TGACCCACGG CGAGGAGCTC
GGCGCCGCAG GGGGCACGAA CACGTTGCGC ATCGTCTGCG TGGACGATTA TCTCTAG
 
Protein sequence
MMPRRTKILA TLGPATDRPE GLEAVLAAGV DVVRINLSHG EASDHRRRVE AVRTWGEAHG 
RRVGVLVDLQ GPKIRIERFR NGPVLLRRGD PFILDPTVDP DAGDQTQVGV AYRALPDDLH
PGDELLLDDG RLVVRVERIE GTAVHTSVQV GGELSDRKGI NRRGGGLSAP ALTDKDRVDI
VTAAELAADY VAVSFPRCAD DIHEARRLLT EAGGRSGIVA KIERAEALDA AEEIMDAADA
IMVARGDLGV EIGDAALPEV QKQLIQTARA RNRVAITATQ MMESMIDNPI PTRAEVFDVA
NAVRDGTDAV MLSAETATGS FPAETVAAMD RVCRAAERSR TVTVSHHRLD EHFTQVDETV
AMAAMYAANH FAIKALIAIT ESGGTASWMS RISSGLPIYV LTRHPETRGR VTLYRGVYPL
EFDAEQEDLN ALKSRLLARL SKQGLVSQGD YVLVTHGEEL GAAGGTNTLR IVCVDDYL