Gene Dde_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_2052 
Symbol 
ID3757060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp2094872 
End bp2096641 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content52% 
IMG OID637782940 
Productsigma-70 factor 
Protein accessionYP_388544 
Protein GI78357095 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0289381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAACA TTAAGGAAAT CCAGCAAATC AAGACCCTGA TTGCCAAAGG CAAGGTCGCG 
GGCTTTCTCA CGTTTGAGGA AGTTAACAAA GCTTTGCCGG CAGAGGTCAA CACGCCTGAG
CAGATTGAAG AAATCATCGG GATTTTCGAT CAGCTTGATA TTGCGATTGT CGATTCTGAA
AAAGACGGCA AAAAAATAAG CGTTTCCCCT GCCGAGGCAG ATGACGAAGG GGGTGACGGA
GGGCTGGAGC TGACCGATGA AGACGATCCT GCCGACTATT CTTCGCGCAG TACAGACCCT
GTAAGGATGT ACCTGCGGGA AATGGGCGCT GTGCCGTTGC TCGACCGTGA CGGTGAAGTG
AACATTGCCA AGAAGATAGA GACCGGAGAG CAGGATGTTC TGTACGCTCT GGTGGAAGTT
CCCATCGCTG TGGAAGAGCT TATTTCCGTG GGCGAAGACC TGAAGGAAAA CCGCATCAAA
CTTAAGGATG TGGTTAAAAC CATCGAGGAA GACGACCCCA GCGAAGACGA GATGAACCAG
CGGCAGCGGG TTATTTTGCT GCTGGATGAA ATCCAGTCTA CATTCAAGAA AAAACGCAAG
GTTTATCAGC GTCTTGACGA GTGCTGTACG CTTGAGCGCC GCGTATATGG TATCCAGAAG
GAAATTATCG CCTATAAAGA GGAGATAGTC TCCCGACTGC GCGACATCAA GCTGGAAAAA
ACGCTTATCG ACCGTATTAT AGAAACGGTT GAAGACTATG TGCGTCAGAT GCATAACTGC
CAGCGGGACC TTTCGGCATA TATATTGTCC ACCGGCAAGA CCCAGACGGA AATGCGCGAG
CTGTTCCGTC AGCTGGACGA CCGCGAAATC AATCCGGTTG TGGCGGCAGA CAGCCTTAAT
CTGACGGTTG AAGAGCTGTT TTCATTCAAG GAAATGATCA TGGGCAAAAT GGAAATCCTG
AACAGGCTTC AGGAGAAATG CTGCCACAAC GTGACTGACC TTGAAGAAGT GCTCTGGCGT
ATCAAGCGCG GAAACAGTCA TGCCATGCGT GCCAAGCAGG AACTTATCCG TTCCAACCTG
CGCCTTGTGG TGAGCATCGC CAAAAAATAC ACCAACCGCG GTTTGCAGTT TCTTGACCTG
ATTCAGGAAG GGAACATCGG CCTGATGAAA GCTGTGGACA AGTTCGAGTA TCAGCGCGGG
TACAAGTTTT CGACCTACGC CACGTGGTGG ATCCGTCAGG CCATCACGCG CGCCATTGCA
GATCAGGCGC GGACAATCCG TATTCCTGTG CATATGATCG AAACCATCAA CAAGCTGATC
CGTACATCGC GTTATCTTGT TCAGGAACTC GGGCGTGACC CCACACCTGA AGAAATCGCC
GAACGTATGG ATTACCCCAT CGACAAGGTG AAAAAGGTTC TGAAAATTGC CAAGGAACCC
ATCTCCCTTG AAACGCCTAT CGGCGATGAA GAAGATTCGA GCCTCGGTGA TTTTATTGAA
GACAAGAAAG CCGTTGCTCC TGCCGAAGAA GTGGTCAACA CCAAGCTGAG CGAGCAGATT
GCCGCAGTGC TGGCTGATCT GACTCCCCGC GAAGAGCAGG TGCTGCGTAA GCGCTTCGGC
ATCAACGAGA AGTCCGATCA CACTCTTGAG GAGGTCGGAA AGCTGTTTAA CGTGACACGA
GAGCGCATCA GGCAGATAGA AGCCAAGGCG CTGCGCAAGC TCAGGCATCC GGTAAGGAGC
CAGACCCTGC GCTCGTACTA CGAAAGCTAG
 
Protein sequence
MGNIKEIQQI KTLIAKGKVA GFLTFEEVNK ALPAEVNTPE QIEEIIGIFD QLDIAIVDSE 
KDGKKISVSP AEADDEGGDG GLELTDEDDP ADYSSRSTDP VRMYLREMGA VPLLDRDGEV
NIAKKIETGE QDVLYALVEV PIAVEELISV GEDLKENRIK LKDVVKTIEE DDPSEDEMNQ
RQRVILLLDE IQSTFKKKRK VYQRLDECCT LERRVYGIQK EIIAYKEEIV SRLRDIKLEK
TLIDRIIETV EDYVRQMHNC QRDLSAYILS TGKTQTEMRE LFRQLDDREI NPVVAADSLN
LTVEELFSFK EMIMGKMEIL NRLQEKCCHN VTDLEEVLWR IKRGNSHAMR AKQELIRSNL
RLVVSIAKKY TNRGLQFLDL IQEGNIGLMK AVDKFEYQRG YKFSTYATWW IRQAITRAIA
DQARTIRIPV HMIETINKLI RTSRYLVQEL GRDPTPEEIA ERMDYPIDKV KKVLKIAKEP
ISLETPIGDE EDSSLGDFIE DKKAVAPAEE VVNTKLSEQI AAVLADLTPR EEQVLRKRFG
INEKSDHTLE EVGKLFNVTR ERIRQIEAKA LRKLRHPVRS QTLRSYYES