Gene Daro_3208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3208 
Symbol 
ID3566593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3459152 
End bp3460153 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content62% 
IMG OID637681679 
Productlipid-A-disaccharide synthase 
Protein accessionYP_286408 
Protein GI71908821 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0000403525 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.19888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGC GCTGGCTACA GCGACAGTGG TTCGATCAAC GCCGGCGTCA GCCGGCGTTG 
TGGCTTTTGC TGCCGCTTGC CTGGCTGTAT GCCGGCTTGA GTGCGCTCAA TCGCCTGTTG
GCCAAACCCA AACACTTGCC GGTGCCGGTC ATCGTGGTGG GCAACATCAT CGTCGGTGGG
GCGGGCAAGA CACCGCTGAC CCTGTGGCTG GCTCGTCAGT TGCGCGATCG TGGCTGGCGA
CCGGGCATTG TCAGCCGTGG CTATGGCCGT TCCGGGGATG AGGTCAGGAC GGTTTCCGCA
CAATCGCGAC CGGAGGAAGT TGGCGACGAG CCCTTGTTGC TCGCCCGGCG CAGCGGGGTT
CCCGTCTGGG TTGGACGCCA TCGGGCAGTT GCCGGCGAGG CCTTGCTGGC TGCCCACCCG
GAAGTGAATG TCTTGCTCTG CGACGACGGT TTGCAGCATT ACGCCCTGGC GCGGGACGTC
GAACTTGTCG TCTTTGATGT CCGTGGTGCC GGCAATGGCT GGCGGCTGCC GGTCGGGCCA
TTGCGTGAGC CGGTGTCGCG GCTGGCGAGC GCCGATGCAG TCATTTGCAA CGGCCAGCCG
GAAACCCTCC TGCCGACTGC GACGCCGAGC TTTGAGATGA GTCTCAAGCC CGGCTTGTTC
TATCGCGTCG ATGTTGCTGG CCAGTCAGCT TCTGCCGAGA GTCTGCGTGA CCGGGGGCGG
CTTTATGCGC TGGCCGGTAT TGGCAATCCG GAGCGCTTCT TCCGGACGCT GGAATCCTTG
GGGTTGTCGT GCGAAACCCG TCCGTTTCCG GATCATCATC GCTATGTTGC GGCCGATCTG
GCATTTGCCA AAGACGGCAT CCTGCTGATG ACCGAGAAGG ATGCAGTAAA ATGTGCAGGA
ATGACGGCGG GTGAAACCTG GGTTTTGCCT GTCCAGGCCG AGCTTTCGCC GGCCCTGATT
GATTTAATCG TGGAGAAACT TCGTGGACGC CAGGTTGCTT GA
 
Protein sequence
MLARWLQRQW FDQRRRQPAL WLLLPLAWLY AGLSALNRLL AKPKHLPVPV IVVGNIIVGG 
AGKTPLTLWL ARQLRDRGWR PGIVSRGYGR SGDEVRTVSA QSRPEEVGDE PLLLARRSGV
PVWVGRHRAV AGEALLAAHP EVNVLLCDDG LQHYALARDV ELVVFDVRGA GNGWRLPVGP
LREPVSRLAS ADAVICNGQP ETLLPTATPS FEMSLKPGLF YRVDVAGQSA SAESLRDRGR
LYALAGIGNP ERFFRTLESL GLSCETRPFP DHHRYVAADL AFAKDGILLM TEKDAVKCAG
MTAGETWVLP VQAELSPALI DLIVEKLRGR QVA