Gene Dret_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1551 
Symbol 
ID8419381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1793814 
End bp1794992 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID645038124 
ProductPhosphoglycerate kinase 
Protein accessionYP_003198413 
Protein GI258405671 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGTA TTGAGGATAT CGAGGTCGGC GGAAAAACAG TCCTGGTCCG GGTGGATTTC 
AACGTCCCGG TGGATGCCCA GCAGCGGATC ACGGACGACA ACCGCATCCG GGCCACGCTT
CCCACGATCC AGGCCTTGGT CGACAAGGGC GCCAAGGTCG TCCTCATGTC CCACATGGGC
AAGCCCAAGG GCAAACGGGT TCCGGAACTC TCCCTGGCCC CGGCGGCGCA GCGCCTCGGT
GAGCTGCTCG GCCAGGATGT CGCCCTGGCC CCGGACTGCA TCGGCGACGA GGTCGCGCCG
CTGGTCGACG GCCTGGCCCC GGGCGGCGTG CTCCTGCTGG AAAACCTGCG CTTTCACGAC
GGGGAGACCA CAAACGATCC CGAGTTCAGC CGCGAACTCG CCCGCTGGGG CGAGATCTAC
GTCGATGACG CGTTCGGTGT CGCCCACCGC GCCCACGCCT CGGTGGTCGG CGTCACCGAG
CACATCGACA CCTGCGTGGC CGGTCTGCTG TTGAAAAAAG AAGTCGACTA TCTCGGCACG
GCCCTGGACA ACCCGGCGCG TCCCTTTGTC TGCATCGTCG GCGGGGCCAA AGTCTCCTCC
AAGCTCGGCA TCCTGGAAAA CCTCATGGGC CGCGTGGACC GCTTCATCGT CGGCGGGGCG
ATGGCCAACA CCTTTTTGAA GGCCCAGGGC TATAACGTCG GCGCTTCTCT GGTCGAGGAC
GATCTGCTGG ACACGGCCCG GGACATCATG GAGCGCGCCA AGGCAGCGGG CGTGAGTTTC
TACCTCCCGG TGGACGGGAT CCTGGGCACT GGACCGCAGG GCAAACTGGC CAGCGGGGTT
TGCCCCTTCC AGGACATCCC CGATGGGGAA ATGGTCCTCG ATATCGGACC GGCCACGCAC
ACCCTGTTCG CCGAAGTCCT CAAGGACGCC AAAACCGTGG TCTGGAACGG TCCGATGGGC
GCGTTCGAGA ACCAGGCCTT TTCCCAGGGC TCGGTCGGGT TGACCCATTT CGTGGCCGGA
TTGGAGGCGA TGACCATCGT CGGCGGTGGC GATACCGACG CCCTGGTCCA TCTGTGCAAG
ATGACCCACA AATTCAGCTT TATTTCCACC GGTGGCGGCT CCTTCCTTGA ATTCATGGAG
GGCAAGAAAC TGCCGGCGCT GCAGGTACTG CAGGGCTGA
 
Protein sequence
MRSIEDIEVG GKTVLVRVDF NVPVDAQQRI TDDNRIRATL PTIQALVDKG AKVVLMSHMG 
KPKGKRVPEL SLAPAAQRLG ELLGQDVALA PDCIGDEVAP LVDGLAPGGV LLLENLRFHD
GETTNDPEFS RELARWGEIY VDDAFGVAHR AHASVVGVTE HIDTCVAGLL LKKEVDYLGT
ALDNPARPFV CIVGGAKVSS KLGILENLMG RVDRFIVGGA MANTFLKAQG YNVGASLVED
DLLDTARDIM ERAKAAGVSF YLPVDGILGT GPQGKLASGV CPFQDIPDGE MVLDIGPATH
TLFAEVLKDA KTVVWNGPMG AFENQAFSQG SVGLTHFVAG LEAMTIVGGG DTDALVHLCK
MTHKFSFIST GGGSFLEFME GKKLPALQVL QG