Gene Dret_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1930 
Symbol 
ID8419775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2212082 
End bp2213278 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content65% 
IMG OID645038518 
Productphosphonopyruvate decarboxylase-related protein 
Protein accessionYP_003198792 
Protein GI258406050 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3635] Predicted phosphoglycerate mutase, AP superfamily 
TIGRFAM ID[TIGR00306] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase, archaeal form
[TIGR02535] proposed homoserine kinase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCCCA AAATCGTTTT TTGTATTGCC GACGGCATGG GCGATTATCC CGTCCCCTCC 
CTTGGTGGAA AGACTCCCCT CGAAGCCGCG GAGACCCCGG AACTGGACGC GCTCTGCCCC
AACGGTCTGC AGGGGCAGGC CCAGACCATC CCCGAGGGCA TGGGCCCGGG CTCAGACGTG
GCCAATATGG CCCTGCTCGG GTATCCCCCG CGGACGTACC ACACCGGCCG GGGGCCCATC
GAAGCCGCGG CCCAGGGACT TACGCTCGCC TCCAACGACA TTGTCTGGCG GCTCAACCTG
GTCACCACCG ATGGGTGGAC GCCGGAAGCG GTCCTGCGCG ATTACGCCGC TGGCCATATC
GATACGGCCA CGGCGCGGGA TATGATCCTT GAACTGAACG AGACCTTCGG CGACAGCACC
TGGCAACTCG TCCCCGGCAT CCAGTACCGC CATCTCCTGG TCCAAAAAGG GGGTGCGGAC
AGTCCGGAAG CTGAACTCGC CATCCGCCCG CCCCACGATA TCCTCGACCA GCCCATTGGC
GCTGATCTCG AGGCTTTTGG TTCCAGTCCG GCGCTCAAGA CGTTGCTTGC CTCCTCGGCC
GAACGTTTGG CAAGCCGGGG CGGAACCGCC ACGGCCCTTT GGCCCTGGGG CCAGGGCAAG
TCGCTGCATC TGCCCGATTT CAGTGAACGG TTCGGCCTCA AGGGCCAAGT GGTATCGGCT
GTGGATCTGG TCAAGGGACT GGGACGCGCC GCCCAGATGG ACGTGGCGGA AGTGCCTGGG
GCCACTGGTC TGCTGGACAC CAACTACGCC GGGAAAGTCG AGGCCGCGCT GGAATTTCTC
GACCAGGGCG ATTTTGTCTA CCTCCATGTC GAGGCCCCGG ATGAATGCGG TCACGCGGGC
GACCCCGAGG CCAAGCAAGA GGCCATCGCC CGCTTTGACA GCCGCGTTCT CGCGCCCCTG
CGCCAGGCTC TGGGGCAGAC CGCCTATTTC ATGGTCGCTT GCGATCACCT CACACCGGTC
CGGGAACGCA CCCACACCAG CGATCCTGTC CCCTTTCTGC TCTCCGGGCC GGGGCTGCGC
CCGAACACCG CGCACAGCAC GTTCACCGAA GCCACGGCCG ACAGCGCCGA ACTCAGCCTG
TCCGCGGGAG AGGACCTGCT GCCGTTTGTC CTCAAAACCA TCGCCGATCT GAAATGA
 
Protein sequence
MLPKIVFCIA DGMGDYPVPS LGGKTPLEAA ETPELDALCP NGLQGQAQTI PEGMGPGSDV 
ANMALLGYPP RTYHTGRGPI EAAAQGLTLA SNDIVWRLNL VTTDGWTPEA VLRDYAAGHI
DTATARDMIL ELNETFGDST WQLVPGIQYR HLLVQKGGAD SPEAELAIRP PHDILDQPIG
ADLEAFGSSP ALKTLLASSA ERLASRGGTA TALWPWGQGK SLHLPDFSER FGLKGQVVSA
VDLVKGLGRA AQMDVAEVPG ATGLLDTNYA GKVEAALEFL DQGDFVYLHV EAPDECGHAG
DPEAKQEAIA RFDSRVLAPL RQALGQTAYF MVACDHLTPV RERTHTSDPV PFLLSGPGLR
PNTAHSTFTE ATADSAELSL SAGEDLLPFV LKTIADLK