Gene Daro_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1487 
Symbol 
ID3568997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1603477 
End bp1605126 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID637679955 
Producthelix-turn-helix, Fis-type:Nif-specific regulatory protein 
Protein accessionYP_284706 
Protein GI71907119 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.659732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGAT CTAAGAAAAA CCACGAGAAC AGCATGCACG AACACACCAT CCACCCGCTT 
GGCGATGAAT GCCCGCCCAC CGGCGGCTTC TGCCGAACGC ACGAGCTGAA CATCCAGCTG
CTGGCCGCGC TCTACGCCGT CAGCCGGGTG CTCAGCCGCT CCCTCGACTT CAACGAAACG
CTGCGCGATG TGCTGCGCGT GCTGCACGAC GAGGCCGGAC TGACCCGTGG CCTGATCAGC
GTGGTGGACC CGGACAGCGG CAAGCTGAAC ATCCACACCA TCTACACGCC GGAAGGCCCG
ATCCTCGACG ACAACCAGTA CGGCCCGGGC GAAGGCGTCA TTGGTCTGGT CCTCGAAAAA
CCACGCACCA TCAAACTGGC CCGCGGCGCC GATGAGCCGC ATTTCCTGAA CCGCCACGGT
GTCTATCAGC CCGACTTGCC CTTTATCGCC GTACCGATCA AGGTCGGCGG CGACCTGAAG
GGCGTCCTCG CCGTCCAGCC GGAAGCGCCG GAAGACGGCT TACTGGAAGA GCGCGCCCAG
TTCGTCGAAA TGGTTTCCAA CCTGATCGGC CAGAGCTTGC GGCTGGCGAT GGACGTTGCC
CAGGAAAAAT CGACCCTGCT CGAGGAGCGT GACCTGCTCC GGCGCACCGT GCGCCACCAG
TTCGGCTTCG ACAGCATGGT CGGTCGCTCG GCCGTGATGC GCCGCGTCTT CGACCAGGCA
CGCATGGTCG CCAAGTGGAA TACCACGGTA CTGATCCGCG GCGAAACCGG CACCGGCAAG
GAACTGATCG CCAATGCCAT CCACTACAAC TCGCCGCGCG CCCGCAACGC GCTGGTCAAG
CTCAACTGCG CCGCGCTGCC GGAAAACCTG CTTGAGTCGG AGCTCTTCGG CCACGAGCGC
GGCGCCTTCA CCGGTGCCGT CGAGTCACGC AAGGGTCGCT TCGAGCAAGC CCACGGCGGC
ACCCTCTTTC TCGATGAAAT CGGCGAGGTT TCGCCAGCTT TCCAGGCCAA GATGCTGCGC
ATCCTGCAGG AAGGCGAATT CGAGCGAGTC GGCGGCAGCA AGACGATCAA GGTCGATGTC
CGCATCATCG CCGCCACCCA CCGCGATCTG GAAACCGCCG TTGAAATGGG CGACTTCCGC
GAAGACTTGT TCTACCGCCT CAACGTCATG CCCCTCTTCC TGCCGCCGCT GCGCGAACGG
ATCGAGGACA TCCCGGAAAT CGCCCGTCAC CTGCTCGGCA AGATCGGCAA CGACCAGAAG
CGCAAGCTGA CCCTGACCGA CATGGCCAAT CGCCGCCTGG CCAGCCACGA ATGGCCGGGC
AATGTCCGTG AGCTGGAAAA CTGCCTGGAA CGCGCTGCGG TGTTGTCCGA CGATGGCCAC
ATCGATGTCG ACCTGATCCG CTTCCCCAGC GCCCGCGAAC GAAGCAGCCC CCGCCCCCTG
CGCACCGCCC CATCAGCGTT CAGCCCGGCT TCTGCCTCAA GCCCGGAAAT CGACATCGAC
GACCCGAACC TCTCGGAAAA AGAACGCGTC ATTGCGGCGC TGGAACAGGC AGGCTGGGTC
CAGGCCAAGG CGGCGCGCAT TCTCGGCATG ACGCCTCGGC AGATCGCCTA CCGGATTCAG
ACGCTGAATA TCGAGGTCAA GCAGTTCTAG
 
Protein sequence
MTRSKKNHEN SMHEHTIHPL GDECPPTGGF CRTHELNIQL LAALYAVSRV LSRSLDFNET 
LRDVLRVLHD EAGLTRGLIS VVDPDSGKLN IHTIYTPEGP ILDDNQYGPG EGVIGLVLEK
PRTIKLARGA DEPHFLNRHG VYQPDLPFIA VPIKVGGDLK GVLAVQPEAP EDGLLEERAQ
FVEMVSNLIG QSLRLAMDVA QEKSTLLEER DLLRRTVRHQ FGFDSMVGRS AVMRRVFDQA
RMVAKWNTTV LIRGETGTGK ELIANAIHYN SPRARNALVK LNCAALPENL LESELFGHER
GAFTGAVESR KGRFEQAHGG TLFLDEIGEV SPAFQAKMLR ILQEGEFERV GGSKTIKVDV
RIIAATHRDL ETAVEMGDFR EDLFYRLNVM PLFLPPLRER IEDIPEIARH LLGKIGNDQK
RKLTLTDMAN RRLASHEWPG NVRELENCLE RAAVLSDDGH IDVDLIRFPS ARERSSPRPL
RTAPSAFSPA SASSPEIDID DPNLSEKERV IAALEQAGWV QAKAARILGM TPRQIAYRIQ
TLNIEVKQF