Gene Hhal_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1041 
Symbol 
ID4709793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1124353 
End bp1125537 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID639855512 
Productphosphoglycerate kinase 
Protein accessionYP_001002619 
Protein GI121997832 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGCGA AGCGCATGAC CGATCTGGAC CTGACGGGCA AGCGGGTGCT CATCCGCGAG 
GATCTGAACG TGCCGATCAA GGACGGCCAG GTTGCCGACG ACACCCGCGT GCGCGCTGCC
GCCGAGAGCA TCCGCCAGGC CATGCAGGCC GGTGGCCGGG TGCTGGTGAT GTCCCACCTG
GGGCGGCCCA AAGAGGGCGA GTACGACGCC GAGGCCTCGA TGGCCCCGGT GGCCCGCCGC
CTGGGCGAGA TTCTCGGCTG TGAGGTGCCG GTGGTGCGTG ACTGGCTGGA GGGCGTGGAT
GTCCCCGAGG GCGGGGTCGC GCTGGCGGAG AACGTGCGTT TCCAGCCCGG CGAGACCAAG
GACGACGAGG CCCTGTCCCG GCGTATGGCA GCGCTCTGTG ACGTCTTCGT CATGGACGCC
TTCGGCACTG CGCACCGGGC CCAGGCCTCC ACCCACGGTG TGGCCCGCTT CGCCCCCGAG
GCCTGTGCAG GCCCGCTGCT CAGTGCCGAG CTGGAGGCCT TGGGCAAGGC TCTGGATAAC
CCGGCCCGGC CGATGATCGC CATCGTTGGC GGTTCCAAGG TCTCCGGCAA GGTCCAGGTC
CTGGAGGCGC TCACCCACAA AGTCGATCAG CTCATCGTCG GCGGCGGGAT TGCGAACACC
TTCATCGCCG CGGCGGGCTA CTCTGTGGGC AAGTCCCTCT ACGAGGCCGA CTTCGTCGAC
ACCGCCAAGC GTCTGATGGA AGAGGCGCGG GCCAAGGGCG GCGAGATCCC GATCCCCGAG
GACGTGGTCA CGGCCAGGGA TTTCTCGGCG GACGCCGAGG CCCACGTCCA TCCGGTGGAC
GCCGTGCCCG ACGACGAGAT GATCCTCGAC GTTGGGCCGC AGACCCGGGC CCGCTACGAC
GGCATGTTGC GCAACGCCGG TACGGTGGTC TGGAACGGGC CGGTGGGGGT CTTCGAGATG
GCGCCCTTTG CCGGCGGCAC CCGGGCGCTG GCCGAGGCCA TTGCCGCCAG TGACGGTTTC
TCCATTGCTG GTGGCGGGGA CACGCTGGCC GCGGTGGAGC AGTTCGGCAT CACCGACCAG
GTCTCGTACA TCTCCACCGG CGGTGGCGCC TTCCTGGAGT TCCTCGAGGG GCGCGTTCTG
CCCGGCGTTG CGGCCCTGGA GCAGCACGCG GCAGCGCACT CGTGA
 
Protein sequence
MKAKRMTDLD LTGKRVLIRE DLNVPIKDGQ VADDTRVRAA AESIRQAMQA GGRVLVMSHL 
GRPKEGEYDA EASMAPVARR LGEILGCEVP VVRDWLEGVD VPEGGVALAE NVRFQPGETK
DDEALSRRMA ALCDVFVMDA FGTAHRAQAS THGVARFAPE ACAGPLLSAE LEALGKALDN
PARPMIAIVG GSKVSGKVQV LEALTHKVDQ LIVGGGIANT FIAAAGYSVG KSLYEADFVD
TAKRLMEEAR AKGGEIPIPE DVVTARDFSA DAEAHVHPVD AVPDDEMILD VGPQTRARYD
GMLRNAGTVV WNGPVGVFEM APFAGGTRAL AEAIAASDGF SIAGGGDTLA AVEQFGITDQ
VSYISTGGGA FLEFLEGRVL PGVAALEQHA AAHS