Gene Dret_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0538 
Symbol 
ID8418346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp647671 
End bp648882 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content61% 
IMG OID645037102 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 
Protein accessionYP_003197413 
Protein GI258404671 
COG category[I] Lipid transport and metabolism 
COG ID[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT GGACCATCAT CCTCGCCGGC GGTAGCGGCT CACGCCTCGC CGAAGCCACA 
AGCGGGGTCA AGAAACAATT CCTGCACTAT CTTGGCCGGC CGCTATTGTG GCACAGTGCC
GCCACGTTTG CCGCCATGCC AAGCATCGAA GGCATTGTCA TGGTTGCTCC GGCAGAGGAA
TTGGAAACCG CGCGCGCGCT GTTCAACGAG TGTGCGGCCC AATCGCCCCT GGGAGTCCCG
GTGCGGTGGA CCGTTGGCGG CAGACGCCGC CAGGATTCCT CGGCCCAGGG GCTGGCTTCC
CTGCCGGCCG AGTGCCGCCG CGTCCTGATC CACGACGCTG CCCGCCCCTT TGTCAGTGCC
CCCCTCACGC AACGCGTCCT GGACGCCTTG GAATGTTTTG ACGGTGTCGT GCCCGGGATT
CCAGTCACCG ACACCATCAA ACAGGCCCAA CAAGACTTGG TCAGCACCAC CCTCCCTCGC
CATGAACTCT TTGCAATTCA AACCCCTCAG GGGTTTCGCA CCGCTGCGCT TGACCAGGCC
CACAAAACAG TCGCTGACCA CGGCATTGAC GTTACCGATG ACGCCTCCAT GCTCGAGTAT
AGCGGCGGTC GTGTCGGGGT CGTTGCCGGA GAGCGGAGCA ATTGCAAAAT CACCACCGCC
GAGGACTTGC GTATGTTGAC CGCTTCTCCT TCGACCCGGA TCCCGTGCAC CGGTTGGGGC
TACGATGTAC ACCGCTACGG AGCGGGCAGG CCCATGAAAC TCGGCGGGAT CCCGATCACC
AATGGCCCGG AGATCATCGC CCATTCCGAC GGCGACGTCC TGCTCCACGC TCTGATGGAT
GCCCTGCTTG GATGTTTGGG GGCCGGAGAT ATTGGCGAAC ATTTTCCCGA CACGGACCCC
CGGTGGGACA ACGCCAACAG CAGTGCCCTG CTTACCGACG TTTTGGACTG GTGCCGTTTC
AACGGACTCA TCCTGCGGCA CGTCGATATG ACTGTGGTTT GTCAAACCCC GAAACTGCAA
CCCTGGAAAC ACCAGATCCG AAAAACCGTG GCCGCGCTCC TCGGTCTGGC TGAACACCAT
TGCAATCTGA AGGCGACTAC TGAAGAAGGG CTCGGATTTA CTGGACACAA AGAAGGCATC
AAAGCGATCG TCCTGGTGAC AGGAGAACGG GAAACCCCGA CCCACGAATC CACTTTTCCA
GAGTCCCGGT AA
 
Protein sequence
MSTWTIILAG GSGSRLAEAT SGVKKQFLHY LGRPLLWHSA ATFAAMPSIE GIVMVAPAEE 
LETARALFNE CAAQSPLGVP VRWTVGGRRR QDSSAQGLAS LPAECRRVLI HDAARPFVSA
PLTQRVLDAL ECFDGVVPGI PVTDTIKQAQ QDLVSTTLPR HELFAIQTPQ GFRTAALDQA
HKTVADHGID VTDDASMLEY SGGRVGVVAG ERSNCKITTA EDLRMLTASP STRIPCTGWG
YDVHRYGAGR PMKLGGIPIT NGPEIIAHSD GDVLLHALMD ALLGCLGAGD IGEHFPDTDP
RWDNANSSAL LTDVLDWCRF NGLILRHVDM TVVCQTPKLQ PWKHQIRKTV AALLGLAEHH
CNLKATTEEG LGFTGHKEGI KAIVLVTGER ETPTHESTFP ESR