Gene Daro_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3337 
Symbol 
ID3566321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3591244 
End bp3592194 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content64% 
IMG OID637681809 
Productcation efflux protein 
Protein accessionYP_286536 
Protein GI71908949 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1230] Co/Zn/Cd efflux system component 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00037106 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCAGC CACAACACGA CGTTTCGCGC TGGGCCCACC AACACCAGTA CGGCACCGGC 
AATGCGGCGG CCGAACGCGG CACGCGGGCA GTCATGTGGA TCACCATCGC CACCATGCTG
GTCGAAATCA TCGCCGGCTG GTGGTTCAAC TCGATGGCCG TGCTGGCCGA CGGCTGGCAC
ATGAGCTCGC ACGCGCTGGC CATTGGCCTC TCGGCCTTCG CTTATGGCGC GGCGCGCAAG
TACGCCAGCG ACCCCAGTTT CGCCTTCGGC ACCTGGAAGA TCGAAGTACT GGCCAGCTAT
ACCAGCGCCA TCTTCCTGCT CGGCGTGGCT GGTGCGATGG TCTTCGGTTC GCTGGAGCGC
CTGTGGCAGC CGCAAACAAT CCACTACCCG GAAGCGATGG GCGTTGCCAT CTTTGGCCTG
GCGGTCAATC TGGTTTGTGC GCTGATCCTC GGCCAGGCTG GAGATCACGG CCATCACCAC
CACGACGATG GCCATGCCCA CCATCATCAC CACGACCTGA ACCTGAAAGC CGCGTATATC
CACGTCATCA CCGATGCGCT GACCTCAGTG CTGGCGATTG CCGCGCTGGC CGGCGGCTGG
TTCTACGGCT GGGCCTGGCT CGACCCGGCG ATCGGACTGG TCGGCGCCGT GCTGGTTGCG
CTCTGGGCGA AAAACCTGAT TTTACAAAGC GGCCGCGTGC TGCTCGACCG CGAGATGGAC
CATCCGGTGG TCGCCGAAAT CCGCGAGGTC ATCGAACAAC TACCGCTAGC CGGCAGCACG
CAACTGACTG ACCTGCACGT CTGGCGCGTC GGCAACGGCG CCTACGCCTG CGCACTGAGC
CTCCTGACCC ACGATCAGGC CCTGACACCG TTGCAAGTTC GCAGCGCCCT GGGCGTGCAT
GAGGAAATCG TGCATGCCAC GGTCGAAATC CACCGCTGCG ACCTTTGCTA G
 
Protein sequence
MKQPQHDVSR WAHQHQYGTG NAAAERGTRA VMWITIATML VEIIAGWWFN SMAVLADGWH 
MSSHALAIGL SAFAYGAARK YASDPSFAFG TWKIEVLASY TSAIFLLGVA GAMVFGSLER
LWQPQTIHYP EAMGVAIFGL AVNLVCALIL GQAGDHGHHH HDDGHAHHHH HDLNLKAAYI
HVITDALTSV LAIAALAGGW FYGWAWLDPA IGLVGAVLVA LWAKNLILQS GRVLLDREMD
HPVVAEIREV IEQLPLAGST QLTDLHVWRV GNGAYACALS LLTHDQALTP LQVRSALGVH
EEIVHATVEI HRCDLC