Gene Dret_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2135 
Symbol 
ID8419985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2432610 
End bp2433671 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content69% 
IMG OID645038728 
Producttetraacyldisaccharide 4'-kinase 
Protein accessionYP_003198997 
Protein GI258406255 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000208743 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.172594 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATC CCGACTGGTC CCACTGGCAG GCCCGCCTGG CCCCGCTTCT CCGCCCCGCC 
GGCCGGCTCT ATGCCCAGGC CATGCGCTGC CGCGAACACG CCTACGCCGC CGGCTGGCTC
ACCTCCTGGC GCCCGCCCGT GCCTTGCGTC AGCGTCGGCA ACATCGGTTG GGGCGGCTCC
GGCAAGACAC CGCTTTGCCA GTATCTGCTC CGCCGGGCCG CAGAGCACGG CCAGCGAGCG
GCCCTGCTCA CCCGTGGGTA CCGCGCCCAC CCGCCCCATC TGCCCTACCT GGTCCGCGCG
GACAGCCCTC CCGACGAGGC CGGTGACGAA CCGCTTTTAC TGGCCCAGAG CTGCCCCGAA
GCGCACGTCT GGGTCGATCC CCTCCGGCCC CGCGCCGGCG CCCGGGCCTG GCGCACGAGC
CAGCCCGATC TCTACCTCCT GGACGACGGT TTCCAGCACC TCGCGGTCAA ACGGGATATC
GATCTGGTCC TGCTCCGGCC AGAGGACCTG GACAGTCAAT GGGACACCGT CCTGCCCGGA
GGGTCCTGGC GGGAAGGGCG CCGCGGACTC AGACGCGCCG ACGGGTTCTG CATCCAGGCC
CCGCCCGCGC AGTGGCCCCG TCTGCGGGCC AAATTTATCG ACCGCCTCGG AGCACTGCAA
AAGCCCCTGT TTTCCTTTTC TCTCACCGTC CAGGGCATCC GCCACTGCGA AACCGGTCTG
CTGCGCGCGG CCCCGGGGAC CCCGTATTTG CTTATAAGCG GCGTGGCCGG ACCACAGCGG
GTCCTGCGCA CCGCAACCGA CACCTTCGGC CCTCCAGTCA GACACCTCTG CTACCCGGAC
CACCACCAGT TTACGGCTGC GGACTGGGAG ACCATCCAGC GCACCGCCCG CGACCACGCT
TGCCCCACCG TCCTGTGCAC CCCCAAAGAC GCGGTCAAAT TGCGCTCCCT GGCCGACGAC
TCCCTTTTCG CCTTTGACCT GGACCTCGCT TTTGGGCCAC AGTGGCTGGC GGCAGCCCCC
TTTTCCAAGT GGCTGGAACA CCGGCTCGTC CAAAAAGTGT AA
 
Protein sequence
MPDPDWSHWQ ARLAPLLRPA GRLYAQAMRC REHAYAAGWL TSWRPPVPCV SVGNIGWGGS 
GKTPLCQYLL RRAAEHGQRA ALLTRGYRAH PPHLPYLVRA DSPPDEAGDE PLLLAQSCPE
AHVWVDPLRP RAGARAWRTS QPDLYLLDDG FQHLAVKRDI DLVLLRPEDL DSQWDTVLPG
GSWREGRRGL RRADGFCIQA PPAQWPRLRA KFIDRLGALQ KPLFSFSLTV QGIRHCETGL
LRAAPGTPYL LISGVAGPQR VLRTATDTFG PPVRHLCYPD HHQFTAADWE TIQRTARDHA
CPTVLCTPKD AVKLRSLADD SLFAFDLDLA FGPQWLAAAP FSKWLEHRLV QKV