Gene Daro_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1932 
Symbol 
ID3569589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2076951 
End bp2078381 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content61% 
IMG OID637680403 
ProductGntR family transcriptional regulator 
Protein accessionYP_285148 
Protein GI71907561 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00018992 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.474146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC AGCTATTACG TTATGAACAA CTGGCCAGCG AATTGGCTGG CATGGTCGAA 
AGCGGCGTGC TCAGTCGCGG TGATCGGCTG CCATCGGTCA GGCAACTCTC GAGAGAGCGG
CGCCTTTCTG TATCTACCGT GCTTCAGGCC TTACACCAAC TTGAAGACAG GGGGCTGGTC
GAGGCGCGGC CTCAGTCGGG GTATTTCGTT CGGCAGGCAA AGACAAGCCA CGCCGAACCG
CAGCTGCGTT CGACCCCTGA AGCGCCCATG CCGGTTGATA TCTCGCAGCG CCTGGTCAGG
GTCTTGCAGG CCGGCACCGG GCCGGGAGTT GCCCCGCTGG CTGCGGCGCT ACCAGCCTCA
GCGCTGTTGC CGGTGGCCGC TTTGAACCGC CTTTACGCCG GAGTCGTCCG GCGCCATCCT
GAACTGTTGT CGGGTGGTAG CCACATCAAT ATGGACGAAC CGGCCTTGGT TCGCCAACTG
GTGCGCCGTT CCCTGGCTTG GGGTGGGCCC GTGGCGGCTG AAGAATTAAT TATCACCAAT
TCCTGCACCG AATCGCTTGG GTTGTGCCTG AGGGCGGTCA CCAAGCCCGG TGACACAGTG
GCCGTCGAGT CGCCGGCCTA TTATCTGATG CTCCAACTGC TGGAGACTTT GGGGCTGAAG
GCGCTGGAGA TTCCTACCGA TCCGCGCACC GGCATGTCGC TCGATGCGCT GGAACTTGCG
ACTCGTCAGG GCCGGGTTGC CGCCTGTCTG CTCGTCTCGA ATGGCAGCAA TCCGCTCGGT
TGCGTCATGC CAGACGAAAG GAAACGCCTG TTCGCGACGC TGACCGCCGC GCGCGGCGTC
GCAGTGATTG AAGACGACAT CTACGGTGAT TTGCACCTTG GCAATGAACG GCCCTGGCCA
ATCAAGGCCT ACGACAAGAC TGGTAACGTG TTGCTTTGTT CGTCGTTTTC GAAAAGCCTG
TCGCCGTCGT TGCGCATCGG TTTCGTTGCA GCCGGTCGCT ACCGTTCGGC GGTTGCCTTG
CACAAGACAA TTTCGAGCGG GGGGACCAAT CCGATCACCC AGCATGTGCT GGCCGAATAT
CTGGAGTCGG GGGCCTATGA TCGCCACCTG CGAACGCTGC GCCGGGCCTA CGAGCGCCAG
GTGGAGGCCA TGCGGCTGGC CGTCAGCCGC TATTTTCCGG CAGCGACCCG CATCGCCCAG
CCACAGGGGG GGTATGTGCT ATGGGTCGAG TTGCCGGAAG CCTTCGATAC CACGCTCCTT
TACGAGCAGG CGATTGCGGA AAATTTGGCC TACGTGCCGG GGGAGCTGTT TTCACCCAGC
GGCATGTATC GCAACTGCCT GCGCCTCAAT TGCGGCAACC CGCACACGCC AGAAATCGAG
GATGCCGTGC GACGCCTCGG CGCTATCTTT TCCCGCCCCC AAATCAAATG A
 
Protein sequence
MSEQLLRYEQ LASELAGMVE SGVLSRGDRL PSVRQLSRER RLSVSTVLQA LHQLEDRGLV 
EARPQSGYFV RQAKTSHAEP QLRSTPEAPM PVDISQRLVR VLQAGTGPGV APLAAALPAS
ALLPVAALNR LYAGVVRRHP ELLSGGSHIN MDEPALVRQL VRRSLAWGGP VAAEELIITN
SCTESLGLCL RAVTKPGDTV AVESPAYYLM LQLLETLGLK ALEIPTDPRT GMSLDALELA
TRQGRVAACL LVSNGSNPLG CVMPDERKRL FATLTAARGV AVIEDDIYGD LHLGNERPWP
IKAYDKTGNV LLCSSFSKSL SPSLRIGFVA AGRYRSAVAL HKTISSGGTN PITQHVLAEY
LESGAYDRHL RTLRRAYERQ VEAMRLAVSR YFPAATRIAQ PQGGYVLWVE LPEAFDTTLL
YEQAIAENLA YVPGELFSPS GMYRNCLRLN CGNPHTPEIE DAVRRLGAIF SRPQIK