Gene Daro_3296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3296 
Symbol 
ID3567163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3545447 
End bp3546460 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID637681769 
Productlipopolysaccharide heptosyltransferase II 
Protein accessionYP_286496 
Protein GI71908909 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.295722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGG CCCTCATCGT CGCCCCCTCC TGGATTGGCG ACACGATCAT GGCGCAACCG 
CTGTTCGCGC GCTTGCACGC CAATCATCCC GGCTTGCAGC TCGACGCCCT GGCGCCACGC
TGGGTAGCCC CTGCCCTGCA GCGAATGGAT GAAATCCGCG ATGTCGTCGA CAGCCCGTTC
GGCCACGGCC AGCTATCGCT GAAGGCCCGC TGGCGACTGG CCCGCGACCT TGCCGCCCGC
CATTACGATA CCGTTTACGT CCTGCCCAAT TCGCTAAAAT CGGCACTCGT GCCGTGGATG
GCCGGCATTC CACAACGCAT CGGGTTCACT GGCGAATCCC GCTTTGGCCT GATCAACGTT
CGCCACACGC TCGACAAGCA GGCGCTGCCG CTGATGCTTG AGCGCTTTAC CCAGCTGGCC
GAACGACCGG GCGCGCCGCT ACCCAAGCCG ATCGCCCACC CCAGAATTCG CTCGAGCGCA
GCCGATCAGG CAAAAACGCT GGTTGAACTA GGTCTGGAGC GCCCGGCCCG CATCGTCGCC
TTCTGTCCCG GCGCCGAATA CGGCCCGGCC AAGCGCTGGC CAGCAGCCCA TTTTGCTGCA
CTGGCCAGGC AACTGGCTGA AACCGGCCAC TCCATCTGGC TTTTCGGTTC GCCGAAAGAC
CGCGCCGTCG CCGAAGAAAT CTCACTGCTT GCCCCAGGCC TCTGCCGCAA CCTGTGCGGC
GCCACTTCGC TAACCCAGGC CATCGACCTG CAGGCCATGG CCGAACTGGT CGTCTGCAAC
GACTCCGGCC TGATGCACGT TGCCGCCGCG CTCGACCGGC CGATCGTCGC GCTGTATGGT
TCGTCGTCAC CGGGTTTCAC GCCGCCGCTC TCCGACAAGG CCGACATCCT CAGCCTGCAA
CTCGACTGCA GCCCCTGCTT CAAGCGTGAA TGTCCGCTCG GCCATCTCGA CTGCCTGAAC
AAACTGTCCC CCGAACAGGT TTTCGCCGCC TGCCAGAAAT GGATCTCGCG ATGA
 
Protein sequence
MNKALIVAPS WIGDTIMAQP LFARLHANHP GLQLDALAPR WVAPALQRMD EIRDVVDSPF 
GHGQLSLKAR WRLARDLAAR HYDTVYVLPN SLKSALVPWM AGIPQRIGFT GESRFGLINV
RHTLDKQALP LMLERFTQLA ERPGAPLPKP IAHPRIRSSA ADQAKTLVEL GLERPARIVA
FCPGAEYGPA KRWPAAHFAA LARQLAETGH SIWLFGSPKD RAVAEEISLL APGLCRNLCG
ATSLTQAIDL QAMAELVVCN DSGLMHVAAA LDRPIVALYG SSSPGFTPPL SDKADILSLQ
LDCSPCFKRE CPLGHLDCLN KLSPEQVFAA CQKWISR