Gene Daro_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0526 
Symbol 
ID3567356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp586121 
End bp588094 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content60% 
IMG OID637678969 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_283753 
Protein GI71906166 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGG CAAAAGACAC CGCTAAGGAA GCCCCGAAGG CTCGCACCAG CAAGGCCAAG 
GATAAAGCCG CCGAAAAGGC GCTGCTCCAG GGCCAGATGG CTGAGACGCC GGAACCGCTT
GACGCCGAAG CGCGCAAAAT GCGCCTCAAG GCCCTGATCA AGCTGGGCAA GGAACGCGGC
TACCTGACTT ACGCCGAAAT CAACGATCAC CTGCCGGATG ACGTCGTCGA TGCCGAAAGT
ATCGAAGCGA TCATCTCGAC TTTCAGCGAA ATGAGCATCC AGGTCTTCGA CGAAGCCCCG
GCTGCAGAAG ACCTGCTGAT GTCCGACACT GCTGCGGTTG CCGCCGACGA CGAGGAAGTC
GAAGCGCAGG CCGAGCAGGC CCTGTCCACT GTCGACTCCG AATTCGGCCG CACCACCGAC
CCGGTCCGCA TGTACATGCG TGAAATGGGC TCGGTCGAAC TCCTCACCCG CGAAGGCGAA
ATCGAGATCG CCAAGCGAAT CGAGGAAGGC CTGAAGCACA TGATCCAGGC GATTGCCGCC
TGCCCGACGA CGATTTCCGA CATTCTCGAA TTGGCCGCCA AGGTCGAATC CGATGAAATG
CGCATCGACG AACTGGTCGA CGGCCTGATC GACCCGAACG CTGTCGAAGA AGCCCCGGCG
GCCGAAATGC CGGAAGCCGA AGAAGACGAA GAAGAAGACG GCGATGACGA CGGCGAGGGT
GCTGGCGGCG GCGCCGCTGC GGCCTCCCTG CTGCAACTGA AGACCGATGC CTTGGAGCGC
TTCAAGAACA TCAAGGGCAT CCACGCCAAG ATGCAGAAGA TGCTCCCGAC CAAGGGTTCG
CACGACAAGG TGTACATCAA GCTGCTGCAA CAGGTTTCCG ACGAACTGAT GAACATCCGC
TTCACCTCGC GTTCGATCGA GCGCCTGTGC GATAGCGTGC GCGGCATGGT CGAGCAGGTT
CGCGGTTGCG AGCGCAAGAT CCAGCAAATC TGCGTGGACC GCGTCAAGAT GCCGCGCCCG
CACTTCATCC AGTCCTTCCC GGGCAATGAA ATCAACCTCG ACTGGGCCGA TGCCGAAGTC
GCTGCCGCCC CCAAGACCTA CGTTGCCATC CTGACCCGCA ACGCGCCGGA CATCAAGGAA
GAGCAGAAGA AACTGCTCGC CCTGCAGGAC CGTATCGGCA TTCCGCTCAA GGACCTGAAA
GACATCAACA AGCAGATGTC CACCGGCGAA GCCAAGGCCC GCCGCGCCAA GCGCGAAATG
ACCGAAGCCA ACCTGCGTCT GGTCATCTCG ATCGCCAAGA AATACACCAA CCGCGGCCTG
CAGTTCCTCG ACCTGATCCA GGAAGGCAAT ATCGGCCTGA TGAAGGCGGT GGACAAGTTC
GAATACCGCC GCGGCTACAA GTTCTCCACC TACGCCACAT GGTGGATCCG CCAGGCCATC
ACCCGCTCCA TCGCCGACCA GGCACGGACC ATCCGTATCC CCGTGCACAT GATCGAAACG
ATCAACAAGA TGAACCGGAT CAGCCGCCAG ATCCTGCAGG AAACGGGTGC CGAACCCGAT
CCGGCGACGC TAGCCAAGAA GATGGACATG CCGGAAGACA AGATTCGCAA GATCATGAAG
ATTTCCAAGG AGCCAATCTC CATGGAAACC CCGATCGGCG ACGACGACGA CTCGCACCTC
GGCGACTTCA TCGAAGACTC AGGCACCCTG GCCCCGGCCG ATGCCGCGAT GTATTCCAGC
CTGCGCGGCG TCACCAAGGA AATCCTCGAC ACGCTGACGA CGAGGGAAGC CAAGGTACTG
CGCATGCGCT TCGGCATCGA AATGAACACC GATCACACGC TGGAAGAAGT CGGCAAGCAA
TTCGACGTCA CCCGCGAACG TATCCGTCAG ATAGAAGCCA AAGCCCTGCG CAAGCTGCGT
CACCCGACCC GCTCCGACAA GCTGCGCAGT TTCGTCGACA CGAACAACAG TTAA
 
Protein sequence
MAKAKDTAKE APKARTSKAK DKAAEKALLQ GQMAETPEPL DAEARKMRLK ALIKLGKERG 
YLTYAEINDH LPDDVVDAES IEAIISTFSE MSIQVFDEAP AAEDLLMSDT AAVAADDEEV
EAQAEQALST VDSEFGRTTD PVRMYMREMG SVELLTREGE IEIAKRIEEG LKHMIQAIAA
CPTTISDILE LAAKVESDEM RIDELVDGLI DPNAVEEAPA AEMPEAEEDE EEDGDDDGEG
AGGGAAAASL LQLKTDALER FKNIKGIHAK MQKMLPTKGS HDKVYIKLLQ QVSDELMNIR
FTSRSIERLC DSVRGMVEQV RGCERKIQQI CVDRVKMPRP HFIQSFPGNE INLDWADAEV
AAAPKTYVAI LTRNAPDIKE EQKKLLALQD RIGIPLKDLK DINKQMSTGE AKARRAKREM
TEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST YATWWIRQAI
TRSIADQART IRIPVHMIET INKMNRISRQ ILQETGAEPD PATLAKKMDM PEDKIRKIMK
ISKEPISMET PIGDDDDSHL GDFIEDSGTL APADAAMYSS LRGVTKEILD TLTTREAKVL
RMRFGIEMNT DHTLEEVGKQ FDVTRERIRQ IEAKALRKLR HPTRSDKLRS FVDTNNS