Gene TM1040_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2141 
Symbol 
ID4076455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2245649 
End bp2247643 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content59% 
IMG OID638007461 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_614135 
Protein GI99081981 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.320176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.578732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCA AAGATACTGA CGACCGCAAG CCTGACGATC AGGACGCCGA AATTTCGCTC 
GATATGAGCC AGGCGCAAGT CAAGAAGATG ATCGCCGAAG CCCGCGAGAA GGGCTACATC
ACCTACGATC AGCTCAATCA GGTTTTGCCG CCGGATCAGG TCTCCTCCGA ACAGATCGAA
GACGTGATGT CTATGCTGTC GGAAATGGGC ATCAACATCA TCGAAGATGA AGAGGCCGAG
GAAGAAGAGA ACAAAGGCAC CACCGAACTG GTGACCACGG AATCCAACCG TGAAGTCGCA
GTGGCTGGCG CTGCGACTGA GAAGCTCGAC CGGACCGACG ATCCGGTCCG CATGTACCTG
CGCGAAATGG GCAGCGTCGA GCTGTTGTCG CGCGAGGGCG AGATCGCGAT TGCAAAGCGC
ATCGAGGCCG GGCGCAACAC CATGATCCTC GGGCTCTGCG AAAGCCCGCT GACATTTCAG
GCGATCACCA TCTGGCATGA CGAACTCCTC TCCGAGGATA TTCTTCTGCG CGATGTGATT
GATCTTGAGG CGACCTTTGG CAATCAGATG GACGAGGACG GTGACGTCGC CGAGCCGGTT
GTCGATCCCT CTGCAGTGTC TGGCGCTGCA AAGCCAGAAA AAGACTCCGG GCCCGAGCTT
GATGCCGATG GCAACCCGAT CCTCAACAAC GACGACGATG ACGACGACGA CGATGATCAG
GCCAACATGT CCCTTGCGGC CATGGAAGCG GCGCTCAAGG ATCGGGTTCT GGAAACGCTC
GAGCGGATTT CCAGTGACTT TGCGATGCTG TCGGAAATGC AGGATCTGCG GATCTCTGCG
ACGCTCAATG AGGATGGGTC CTTTTCGGCT GATGACGAAG CCAAGTACCA GCAGCTGCGC
TCTGAAATCG TGGAACTGGT GAACGGTCTT CACCTGCACA ACAACCGCAT CGAGGCGCTG
ATCGACCAGC TTTATGGTAT CAACCGCCGC GTCATGCAGA TCGATAGCGC CATGGTCAAA
CTCGCTGACC AGGCCCGTAT CAACCGCAAG GAATTCGTCG AAGCTTACCG TGGTCGCGAA
CTCGACCCCA ACTGGCTTTC CGAGATGAGC GAAAAGCCCG GTCGGGGCTG GCAGATGTTC
ATCGAGCGCT CGACCGAGAA GGTCGAAGAG CTGCGCGCCG ACATGGCACA GGTCGGTCAG
TACGTCGGTC TGGACATCTC GGAATTCCGC CGCATCGTGC AGCAGGTGCA AAAAGGTGAA
AAAGAGGCCC GTCAGGCCAA GAAGGAAATG GTCGAGGCCA ACCTGCGCCT CGTGATCTCA
ATCGCCAAGA AATACACCAA CCGGGGCCTG CAGTTCCTCG ACCTCATTCA GGAAGGCAAC
ATCGGCCTGA TGAAGGCGGT CGACAAGTTC GAATACCGTC GCGGCTATAA GTTCTCGACC
TATGCAACCT GGTGGATCCG TCAGGCGATC ACCCGCTCGA TCGCCGATCA AGCGCGCACC
ATCCGTATCC CGGTGCATAT GATCGAGACC ATCAACAAGC TGGTCCGCAC CGGCCGTCAG
ATGCTCCACG AAATCGGCCG CGAGCCGACG CCGGAAGAGC TGGCAGAAAA GCTGCAGATG
CCGCTCGAGA AGGTCCGCAA GGTGATGAAG ATCGCCAAGG AGCCCATTTC GCTCGAGACT
CCGATCGGGG ACGAGGAAGA CAGCCAGCTG GGCGATTTCA TCGAGGACAA GAATGCCGTG
CTGCCCCTGG ACAGCGCCAT TCAGGAAAAC CTCAAGGAAA CCACCACGCG GGTTCTGGCC
TCGCTCACCC CGCGCGAGGA GCGCGTGCTG CGGATGCGGT TTGGTATCGG CATGAACACC
GATCACACGC TCGAAGAGGT GGGCCAGCAG TTCAGCGTGA CCCGCGAACG GATCCGTCAG
ATCGAGGCCA AGGCGCTCAG GAAGCTCAAG CATCCGAGCC GCTCTCGCAA GCTGCGCAGC
TTCCTCGATC AGTAA
 
Protein sequence
MAAKDTDDRK PDDQDAEISL DMSQAQVKKM IAEAREKGYI TYDQLNQVLP PDQVSSEQIE 
DVMSMLSEMG INIIEDEEAE EEENKGTTEL VTTESNREVA VAGAATEKLD RTDDPVRMYL
REMGSVELLS REGEIAIAKR IEAGRNTMIL GLCESPLTFQ AITIWHDELL SEDILLRDVI
DLEATFGNQM DEDGDVAEPV VDPSAVSGAA KPEKDSGPEL DADGNPILNN DDDDDDDDDQ
ANMSLAAMEA ALKDRVLETL ERISSDFAML SEMQDLRISA TLNEDGSFSA DDEAKYQQLR
SEIVELVNGL HLHNNRIEAL IDQLYGINRR VMQIDSAMVK LADQARINRK EFVEAYRGRE
LDPNWLSEMS EKPGRGWQMF IERSTEKVEE LRADMAQVGQ YVGLDISEFR RIVQQVQKGE
KEARQAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST
YATWWIRQAI TRSIADQART IRIPVHMIET INKLVRTGRQ MLHEIGREPT PEELAEKLQM
PLEKVRKVMK IAKEPISLET PIGDEEDSQL GDFIEDKNAV LPLDSAIQEN LKETTTRVLA
SLTPREERVL RMRFGIGMNT DHTLEEVGQQ FSVTRERIRQ IEAKALRKLK HPSRSRKLRS
FLDQ