Gene Daro_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1096 
Symbol 
ID3569369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1198750 
End bp1199949 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID637679558 
Productmajor facilitator transporter 
Protein accessionYP_284322 
Protein GI71906735 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.454099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG CTCCGGAAGC CCAACACCGC CGCGCTGTCA TGCTGCTCTC CGTCGCCGCC 
TTCGCCAGCG CGGCGGCAGC GCGGCTGTGC GACCCGATGC TGCCCGACCT CGCTCGCAGC
TTCACGGCCA GCCCGACTGC CGTCGCCTCG GTCATCTCCT CGTTCGCCAT CGCCTATGGC
CTGACCCAGG CCATGTTCGG CCCGCTCGGC GACCGCCTCG GCAAATACCG CCTGATCGCC
CTGACCACCC TGCTCAGCAC GCTGGGCGCA CTCGGCTCGG CCATCGCCTG GTCGCTCGAT
GCGCTGGTCG TCTCCCGCGT GCTGCTCGGC GCCACGGCGG CCGGCATCAT CCCGCTCTCC
ATGGCGTGGA TCGGCGACAC GGTGCCCTAC GAACAACGGC AAGCCACGCT CGCCCGCTTC
CTGGGCGGGC AGATTCTCGG CGCCATCGGC GGCCAGTTCA TTGGTGGCGT GTTCACCGAC
ACGCTCGGCT GGCGCTGGGC CTTCGCCTTC CTGGCCGGGC TCTACCTGAT CATCGGCGCC
GTCGTCCTGC TCGAATCGCG GGCCAACCCG AGCACCCATC ACCGCCATGC CGATACCCCC
CGCCAGGGCA TTCTCGGCCA GGCCGCGCAG GTCTTCGCCC AACCGTGGGC CCGGGTCATC
CTGAGCATTG TTTTCCTCGA AGGCATGCTG GTGTTCGGTG CCCTGGCCTT CGTGCCCTCC
TACCTGCACG AACATTTCGG CCTCAGCCTC ACCATGGCCG GCGCCGCCAT GGCCTTCTTC
GGCCTCGGCG GCCTGTCGTA CATCCTTGCC GCCCGGCATT TTGTCCGCCT CCTCGGCGAA
GTCGGGCTGG CCACCGGCGG CGGCATTCTC ATCGCGCTCG GCTGGGCGAT GCTGGCCTGG
GGCACCACCT GGCTATGGGC CCTGCCGGCC AGCTATTTTG TCGGCCTCGG CTTCTACATG
CTGCACAACA CCCTGCAGAC CAACGCCACC CAGATGGCCC CCGCCGTACG CGGCACCGCG
GTCTCCTTGT TCGCCTCCAG TTTCTTCCTC GGCCAATCGC TCGGCGTCAC CCTCGCTGCC
CACATCCTGG CCGCATCCGG CATCCTCCCC ATGCTGCTGA TCGCTGCCAT CGGCACACCG
CTGGTTGGTG GTACGCTGGC CTGGCTTCTC TCCCGGCACC ACCGCATACA TTCCGCCTAG
 
Protein sequence
MPEAPEAQHR RAVMLLSVAA FASAAAARLC DPMLPDLARS FTASPTAVAS VISSFAIAYG 
LTQAMFGPLG DRLGKYRLIA LTTLLSTLGA LGSAIAWSLD ALVVSRVLLG ATAAGIIPLS
MAWIGDTVPY EQRQATLARF LGGQILGAIG GQFIGGVFTD TLGWRWAFAF LAGLYLIIGA
VVLLESRANP STHHRHADTP RQGILGQAAQ VFAQPWARVI LSIVFLEGML VFGALAFVPS
YLHEHFGLSL TMAGAAMAFF GLGGLSYILA ARHFVRLLGE VGLATGGGIL IALGWAMLAW
GTTWLWALPA SYFVGLGFYM LHNTLQTNAT QMAPAVRGTA VSLFASSFFL GQSLGVTLAA
HILAASGILP MLLIAAIGTP LVGGTLAWLL SRHHRIHSA