Gene Dole_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2837 
Symbol 
ID5695695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3419170 
End bp3420339 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content59% 
IMG OID641265452 
Productlipid-A-disaccharide synthase 
Protein accessionYP_001530717 
Protein GI158522847 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATAAGC ATGCTTTGCC GAACAGGGTG CCAGTGGGCC GGTGCGTGAT GATCATTGCC 
GGCGAGGCCT CCGGCGACCT GCACGGCGCC AACCTGATCC GAAACATGCG CGAACAGATT
AAGGACCCTC TTTTTTTCTG CGGCATTGGA GGGGCGGCCA TGCGCCGGGC CGGCGCCAAG
ATTCTGGTGG AGGCGGAGCG GCTTTCGGTG GTGGGAATCA CCGAGGTGAT TGCCCGCATG
CCGGATATCC TAAGCGGCAT GAAAACGGCC AAAAGGATGC TGGCCTCCCG CATTCCCGAT
CTGCTGGTGC TTATCGATTT TCCCGATTTT AACCTGAGAA TGGCCGCAAC GGCCAAAAAG
CACGGCATTC CCGTTTTTTA CTATATTTCT CCCCAGGTAT GGGCCTGGCG AAAAGGCAGG
GTGCGCACCA TTCGAAAACG GGTGGATCAC ACGGCGGTGA TTCTTCCCTT TGAGGCCGAT
TTTTTTAAGG CCCACGATGT CCCCGTGACC TTTGTGGGCC ATCCCCTGCT GGACGCCGGA
TACGGTCCGG CGCCGTTATA CGAGAGAACA GAAGGGCGGA CAGTGGTGGG CCTGCTGCCC
GGTTCCAGGG GCAGCGAGGT GGCACGACAC CTGCCTGTAA TGATGGAAGC CGGGGCCCGG
ATCAGCCGTC GCCATCCCCA TGTCACTTTC ATGGTCTCCT GCGCGCACTC GATTCCGGTG
GAAAGCATGG CTTCAATCAC GGAAAAGTAT ATCGGCACCG TTCCTTTTAC CATTGTTCCC
GGTGACGTGA CCCAGGTGTT GAAGAGGAGC ACCTGCGTTG TGGCGGTGTC CGGCACCGTG
TCCCTTGAAA CGGCCCTGTA CGGCGTTCCC ATGGTGGTGA TTTACAAGGT GTCGTTTCTC
AGTTACTGGC TGGCAAAGGC ATTGATCCGG CTGGAGCACA TCAGCCTGGT GAACCTGATC
GCCGGAAAAG CGGTTGTGCC GGAGCTGATT CAGAAAGATG CGTCGGCGGA GCATATTGCC
GCGCGCATCA TGTCGATGAT TTCTGATCCC CAGGAACTGG AGACCGTTCG AAAGGAGCTT
GCCGAAGTTC GGAAGCGCCT GGGCGGTCCC GGGGCATCTG CCCGGGCCGC CGGAATCGCG
GCCCGGCTGC TGAATGAAGG GGTAACCTGA
 
Protein sequence
MDKHALPNRV PVGRCVMIIA GEASGDLHGA NLIRNMREQI KDPLFFCGIG GAAMRRAGAK 
ILVEAERLSV VGITEVIARM PDILSGMKTA KRMLASRIPD LLVLIDFPDF NLRMAATAKK
HGIPVFYYIS PQVWAWRKGR VRTIRKRVDH TAVILPFEAD FFKAHDVPVT FVGHPLLDAG
YGPAPLYERT EGRTVVGLLP GSRGSEVARH LPVMMEAGAR ISRRHPHVTF MVSCAHSIPV
ESMASITEKY IGTVPFTIVP GDVTQVLKRS TCVVAVSGTV SLETALYGVP MVVIYKVSFL
SYWLAKALIR LEHISLVNLI AGKAVVPELI QKDASAEHIA ARIMSMISDP QELETVRKEL
AEVRKRLGGP GASARAAGIA ARLLNEGVT