Gene Dshi_2261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2261 
SymbolrpoD 
ID5713914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2381601 
End bp2383595 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content61% 
IMG OID641268183 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001533598 
Protein GI159044804 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.435799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCA AAGACACTGA CGACCGCAAG ACCGATGACC AGGACGCCGA GGCAATGCTC 
GACATGAGCC AAGCGGCGGT CAAAAAGATG ATCGCCGAAG CCCGGACGCG CGGCTACATA
ACCTACGATC AGCTTAATCA GGTTCTGCCG CCGGATCAGG TCAGCTCGGA ACAGATCGAG
GACGTGATGT CCATGCTGTC CGAGATGGGC ATCAACATCA TCGAGGAAGA CGAGGCCGAG
GACGGCGATG CGCCCGCGAA TGCATCGACG CAGGTCGTCG AATCCTCCAG TTCCCGCGAA
GTAGCGGTCT CTACCTCCTC GTCGGAGACG CTCGACCGCA CCGACGACCC GGTCCGGATG
TATCTGCGCG AGATGGGGTC TGTCGAACTG CTGAGCCGCG AGGGCGAGAT TGCCATCGCC
AAGCGGATCG AAGCGGGCCG GAATACCATG ATCGCGGGGC TCTGCGAGAG CCCGCTGACC
TTTCAGGCGA TCACGATCTG GCGCGACGAA CTGCTGTCCG AGGACATCCT GTTGCGGGAC
GTCATTGACC TGGAGGCCAC CATGGGTGGG GCCATGGACG AGGACGAAGA AAGTTCCAGT
GTCGTGGAAA TGGCAGCCAA CAGCCCGGCC GCCCCGTCAG AGAAGTCCGA CGAGCCGGAA
CTGGACGCGG ACGGCAACCC GATTGCCAAG ACCGACGATG AAGACGACGA GGACGATCAG
GCCAACATGT CCCTCGCTGC CATGGAAGCG GCCCTCAAGC CCAAGGTGCT GGAGACGCTC
GACCTGATCG CGCGCGACTA TGCACAGTTG AGCGAGATGC AGGATCTGCG CATTTCGGCC
ACGCTGAACG AGGATGAGAG CTTCTCGGTC GAGGACGAGT CGAACTACCA GAAGCTGCGG
GCCGAGATCG TTCTCCTCGT CAACGAGTTG CACTTGCACA ACAACCGTAT CGAGGCGCTG
ATCGACCAGC TGTATGGGAT CAACCGTCGG ATCATGGCCA TCGACTCGAG CATGGTGAAG
CTTGCCGATC AGGCGCGGAT CAACCGCCGC GAGTTCGTCG AAGCCTATCG CGGCTACGAG
TTGGACCCGA CCTGGTTGGA GCGTATGGCC GAAAAGCCCG GGCGCGGATG GCAGGCTTTC
ATCGAACGCT CCACAACCAA GATCGAAGAG CTGCGCTCGG ACATGGCCCA GGTCGGCACC
TATGTCGGCG TGGACATCAC CGAGTTCCGT CGCATCGTGC AGCAGGTGCA GAAAGGCGAG
AAGGAGGCCC GGCAGGCGAA GAAGGAAATG GTCGAGGCGA ACCTGCGCCT CGTAATTTCC
ATTGCCAAGA AATACACCAA TCGCGGCCTG CAGTTCTTGG ACCTTATCCA GGAAGGCAAT
ATCGGCCTGA TGAAGGCCGT CGACAAGTTC GAGTATCGCC GCGGCTACAA GTTTTCCACC
TATGCGACCT GGTGGATCCG TCAGGCGATC ACCCGGTCCA TTGCGGACCA GGCGCGCACG
ATCCGGATCC CAGTGCATAT GATCGAGACG ATCAACAAGC TGGTGCGCAC CGGGCGCCAG
ATGCTGCATG AGATCGGCCG CGAGCCGACA CCGGAGGAAC TGGCCGAAAA GCTCCAAATG
CCGCTGGAAA AGGTGCGCAA GGTGATGAAG ATCGCCAAGG AGCCGATCAG CCTGGAGACG
CCCATCGGCG ACGAGGAAGA CAGCCAGCTT GGGGACTTCA TCGAGGACAA GAATGCGGTT
CTGCCGCTGG ACTCGGCCAT TCAGGAGAAC CTGAAAGAGA CCACGACCCG GGTGCTGGCT
TCGCTGACCC CGCGCGAAGA ACGCGTGTTG CGCATGCGGT TCGGAATCGG GATGAACACC
GACCATACGC TCGAAGAGGT TGGTCAGCAG TTCAGCGTGA CCCGCGAACG CATCCGGCAG
ATCGAGGCGA AAGCGCTGCG CAAGCTCAAG CATCCCAGCC GGAGCCGGAA GCTGCGCAGT
TTCCTGGACC AGTAA
 
Protein sequence
MAAKDTDDRK TDDQDAEAML DMSQAAVKKM IAEARTRGYI TYDQLNQVLP PDQVSSEQIE 
DVMSMLSEMG INIIEEDEAE DGDAPANAST QVVESSSSRE VAVSTSSSET LDRTDDPVRM
YLREMGSVEL LSREGEIAIA KRIEAGRNTM IAGLCESPLT FQAITIWRDE LLSEDILLRD
VIDLEATMGG AMDEDEESSS VVEMAANSPA APSEKSDEPE LDADGNPIAK TDDEDDEDDQ
ANMSLAAMEA ALKPKVLETL DLIARDYAQL SEMQDLRISA TLNEDESFSV EDESNYQKLR
AEIVLLVNEL HLHNNRIEAL IDQLYGINRR IMAIDSSMVK LADQARINRR EFVEAYRGYE
LDPTWLERMA EKPGRGWQAF IERSTTKIEE LRSDMAQVGT YVGVDITEFR RIVQQVQKGE
KEARQAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST
YATWWIRQAI TRSIADQART IRIPVHMIET INKLVRTGRQ MLHEIGREPT PEELAEKLQM
PLEKVRKVMK IAKEPISLET PIGDEEDSQL GDFIEDKNAV LPLDSAIQEN LKETTTRVLA
SLTPREERVL RMRFGIGMNT DHTLEEVGQQ FSVTRERIRQ IEAKALRKLK HPSRSRKLRS
FLDQ