Gene Daro_1351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1351 
Symbol 
ID3569241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1469367 
End bp1471016 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content54% 
IMG OID637679819 
Producttranscriptional regulator LysR 
Protein accessionYP_284570 
Protein GI71906983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0000706303 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000188076 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCAAC AGAGGTCAGT GCGCGCTACA TCGCCTGGGC GTCTCACATA TCTGGCGCTT 
GCCGTTACTC TCGTATCGCC GGTTCTTTGT GCAGCAGAGG AAGGGGGGGC TTTCGACAGT
TTCAAGGTTG ATGGTTACCT GCGCGAAGAA GTGTCGGTCA ATACAAAGAA CTGGGCGGAT
ACGCCGAACT ACAACGACCG AGGCAAGGTG TCGATGGCGC GCACAACGTT GCGCCTGAAC
GCTGACTGGA AAGCGACGGA CACCCTGTCC ATCGTGGCCA AATGGCGCGG CTCGCAGGAA
CTAAAGACAC CGTTCCTGAA GCATCTTGAA CAAATGGGGG CCAGCAACTA CACCTCTGGT
GGGGCTGGAA CCCAAGCTGA CATCATGGAT CTCTACAACA AGTCCGATTT CCGCGAGCTT
TATGTCGACT GGCAGGCCAC CGATCGCGTG AAATTCCGCT TCGGCCGGCA ACAGATCGTC
TGGGGCGAGA CCGATTTCTT CAACGCCAAC GACATGGTGC ATGGCTTCAA CTTGACCTGG
CGTTCTTTTC TTGAGCCGGC CAATGAAGAA CTGCGCAAGC CGTTGATCAT CTTGAAGACC
AACATCGATT TACCGGAAGC CAATGGAGCT ATCGAAGCTT TCGTTCGTCC GGGTTGGGAT
CGCAAAAAGG ATATCGGGAC CGAACTGGAC ATCTACGGCG GTCGCTGGTC CAGCCAGCCG
TACGCCGGCG TCGACTTCCG CAACATCGAC CCCTATAACC TGGACAACAA GGAAGGCGAT
TACAAGAAGG TTACCGGTGG TATTCGCTGG AACGGAACGA CTGATCATTT CAATTATTCG
CTGTCGTACC TGAAGACCTT CTGGCAAAAC CCCATTTTGA ATCCCAGTTC GACTGACTTT
GCTGGGGGGG CTTTCTATGT TCCGACCACG CAGGGCGCCT CGAATATCAA GCCTCAACAA
GGCGCCATCT TTGGCGAAAT CATCTATCCG CTCGTCGATG TCTTCGGCGC CACCGCTTCC
GGCTATGCCG ATTGGGCTGA TGCCGTGTTC AGCACCGAAG TCGCCTACAT CAAGGATGCG
CCTTATCAGT TCAACAATTT TCCGAACAAT TCACTGGCAT CGACGGTTGT GGCGCCTGGC
TTCGACGGGT TCAAAAAGAA AAACGTGATC GCCTGGATGT TGCGCATGGA TAAAAACATC
GCAGCCACCC AGAGCCTGCT CGGCTCAGAG AAGCCGATGT TCTTCTCGGT TCAGCTCTTC
GATAAGTGGA TTCAGGACTT CAACGAGAAC GAAGGCCTGC TGAATAGTGT TGGTTGGGGC
GCACGCACCA AGGAACACTC GTTTTTGCTG ACCGGTATTT TTAGCCTGAG CTACAACAAT
GGCCGCATCA AACCTGAACT GGTCGTTGGT ACAGATCTGA CCTACAAAGG TGGATTCTTC
GCACCGTCGG TCACCATGGA GCTATCCAAG AGTCTGAAGT GGAAAATCGA ATACGACGGC
TTTTGGGATG GTGGTCGTTG GCGTGATACC GGACCGGGCA GCCGGTGCGC CAGCCCCGGC
GCCAATCAGT CCAACTGCGA TAGCGCCGGT TTGTTTGGCT ATTTCCACAA CCGCGACCAG
CTGTACACCA GCCTGACTTA TCAGTTCTGA
 
Protein sequence
MNQQRSVRAT SPGRLTYLAL AVTLVSPVLC AAEEGGAFDS FKVDGYLREE VSVNTKNWAD 
TPNYNDRGKV SMARTTLRLN ADWKATDTLS IVAKWRGSQE LKTPFLKHLE QMGASNYTSG
GAGTQADIMD LYNKSDFREL YVDWQATDRV KFRFGRQQIV WGETDFFNAN DMVHGFNLTW
RSFLEPANEE LRKPLIILKT NIDLPEANGA IEAFVRPGWD RKKDIGTELD IYGGRWSSQP
YAGVDFRNID PYNLDNKEGD YKKVTGGIRW NGTTDHFNYS LSYLKTFWQN PILNPSSTDF
AGGAFYVPTT QGASNIKPQQ GAIFGEIIYP LVDVFGATAS GYADWADAVF STEVAYIKDA
PYQFNNFPNN SLASTVVAPG FDGFKKKNVI AWMLRMDKNI AATQSLLGSE KPMFFSVQLF
DKWIQDFNEN EGLLNSVGWG ARTKEHSFLL TGIFSLSYNN GRIKPELVVG TDLTYKGGFF
APSVTMELSK SLKWKIEYDG FWDGGRWRDT GPGSRCASPG ANQSNCDSAG LFGYFHNRDQ
LYTSLTYQF