Gene Daro_0700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0700 
Symbol 
ID3569012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp767883 
End bp768950 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content59% 
IMG OID637679148 
Productbile acid:sodium symporter 
Protein accessionYP_283926 
Protein GI71906339 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGC AATGCGAAGT GACGATGAAA CGGGCGGAAG GGCTGTCGAT GAGTGGGTTT 
GAGCGTTACC TGACGATCTG GGTATTCCTG TGCATCGTGA CCGGCATCGT GTTTGGCCAG
GTGTTCCCCG GGTTTTTTCA GGCAGTGGGC GGCATGGAGG TGGCACGTGT CAATCTGCCC
GTTGGCCTGC TGATCTGGGT GATGATCATT CCGATGCTGG TCAAGGTGGA TTTCGGCGCG
CTGAGCGAAA TGAAGCAGCA CGCCAGGGGT ATTGGCGTCA CGCTGTTCGT CAATTGGCTG
GTCAAACCGT TCTCGATGGC TTTTCTCGGC TGGCTGTTTG TACGCCAGCT GTTTGCTGCC
TATCTGCCGG CCGATCAGCT CGATAGTTAC ATTGCCGGTC TGATCCTGCT CGCTGCCGCG
CCGTGCACGG CGATGGTCTT CGTCTGGAGC CGGCTGTCGA ATGGTGATCC GCTGTTCACG
CTGTCGCAGG TGGCGGTCAA CGACACGATC ATGGTTTTTG CCTTTGCCCC CATCGTCGCC
TTCCTGCTCG GCATCTCGGC TATCACCGTG CCGTGGGAAA CGCTGCTTAC CTCGGTCGTG
CTCTACATTG TCATTCCAGT TGCGCTGGCT CAGTTCTGGC GCAGGTCGTT GTTGGCTCGA
GGCCAGGCCG TCTTCGATGC GGCAATGGCG AAAATCGGTC CGTGGTCGAT CTGCGCGCTA
TTGCTGACCT TGGTCTTGCT GTTTGCCTTC CAGGGCGAGG CGATCCTGCG TCAACCACTG
GTCATCGCGC TACTCGCCGT GCCCATCCTG ATTCAGGTCT TCTTCAACTC GGCGCTGGCC
TACTGGCTGA ATCGGGCGGT TGGCGAAAAG CACAACATCG CGTGCCCATC GGCGCTGATC
GGTGCTTCCA ATTTCTTTGA GCTGGCGGTG GCTGCGGCGA TCAGCCTGTT CGGTTTCGAA
TCCGGTGCAG CCTTGGCGAC GGTGGTCGGC GTGCTGATTG AAGTGCCGGT CATGTTGCTG
GTCGTGCGCG TGGTCAATGC CAGCAAGGGG TGGTACGAGG CAAAATAA
 
Protein sequence
MSAQCEVTMK RAEGLSMSGF ERYLTIWVFL CIVTGIVFGQ VFPGFFQAVG GMEVARVNLP 
VGLLIWVMII PMLVKVDFGA LSEMKQHARG IGVTLFVNWL VKPFSMAFLG WLFVRQLFAA
YLPADQLDSY IAGLILLAAA PCTAMVFVWS RLSNGDPLFT LSQVAVNDTI MVFAFAPIVA
FLLGISAITV PWETLLTSVV LYIVIPVALA QFWRRSLLAR GQAVFDAAMA KIGPWSICAL
LLTLVLLFAF QGEAILRQPL VIALLAVPIL IQVFFNSALA YWLNRAVGEK HNIACPSALI
GASNFFELAV AAAISLFGFE SGAALATVVG VLIEVPVMLL VVRVVNASKG WYEAK