Gene Dret_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1954 
Symbol 
ID8419799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2236942 
End bp2237919 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content63% 
IMG OID645038542 
ProductPorphobilinogen synthase 
Protein accessionYP_003198816 
Protein GI258406074 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.635001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA CAGACTTCTT CAGAGGCCGT CGTTTGCGAC AGACGGCCAC AATGCGAGCC 
CTGGTCCGCG AAACCGTGCT CCGGCCCGAA GACCTGATTC AGCCCTATTT CGTGGCCGAG
GGCGACCGGC ACGCCAGCCG GCCTATCGGG TCCATGCCCG GCCAGCACCA ACTCGGTCTG
GAGGCCCTGA CGGCCCGGGT GAGTCAGGCC GTGGACAACG GCCTCCACGC GGTCATGCTC
TTCGGCATCC CTGAGTCCAA GGATCCGCAA GGGTCTCAAG CCTATGCCCC GGAGGGCATC
GTCCAGGAAG CGGTCCGCCG CCTCAAAGCT GCCGTGCCGG AGCTGACGGT CATCACTGAC
GTCTGCCTCT GCGAATTCAC CTCGCACGGC CATTGCGGTC TCGTGGACGG CCAGCAGGTC
CTCAACGACC CGACCCTGGA ACTCTTGGCT GCCACCGCTG TCTCCCACGC CGAGGCCGGA
GCGGACATCG TCGCCCCCTC GGACATGATG GACGGACGGG TTCAGGCCAT CCGGCTCGCT
TTGGATACCG CCGGTTTTCA ACAGACACCG ATCATGTCCT ATGCGGTCAA ATACGCTTCG
GCCTTCTACG GCCCGTTTCG GGAGGCGGCG GAAAGCGCGC CGCAATTCGG CGACCGCAAG
ACCTATCAAA TGGATCCGTC CAACGCTCGG GAAGGGCTCC GCGAAGCCGA GGCGGACACC
GCTGAGGGCG CTGATTTTCT GATGGTCAAA CCGGCTCTGG CCTACCTCGA TATTCTGAGT
CAACTGCGGC AGCGCAGTAC ACTTCCCCTG GCCGCCTATC ACGTCAGCGG CGAATACAGC
CTGATCAAGG CGGCTGCCCA GCAGGGGTGG ATCGATGAAC AGGCCGTGGC CCTGGAATCC
CTGACAAGCA TCAAACGGGC CGGCGCTGAT CTGATCCTGA CCTATTTTGC GGAACAGGCT
CTGGACTGGT TGTCCTGA
 
Protein sequence
MAKTDFFRGR RLRQTATMRA LVRETVLRPE DLIQPYFVAE GDRHASRPIG SMPGQHQLGL 
EALTARVSQA VDNGLHAVML FGIPESKDPQ GSQAYAPEGI VQEAVRRLKA AVPELTVITD
VCLCEFTSHG HCGLVDGQQV LNDPTLELLA ATAVSHAEAG ADIVAPSDMM DGRVQAIRLA
LDTAGFQQTP IMSYAVKYAS AFYGPFREAA ESAPQFGDRK TYQMDPSNAR EGLREAEADT
AEGADFLMVK PALAYLDILS QLRQRSTLPL AAYHVSGEYS LIKAAAQQGW IDEQAVALES
LTSIKRAGAD LILTYFAEQA LDWLS