Gene TM1040_0563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0563 
Symbol 
ID4077914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp599575 
End bp600474 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content57% 
IMG OID638005860 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_612558 
Protein GI99080404 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.939582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0697509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATT ATGCAAACCT TCCCGCCCCG TCTCCCGAAG GCGGACTCAA TCGGTATCTG 
CAGGAAATTC GCAAGTTCCC GCTTCTGGAG CCGGAAGAGG AATACATGCT GGCCAAGAGA
TGGGTCGAAG AGCAGGACAC CGAAGCCGCG CACAAGATGG TAACATCGCA TCTGCGACTG
GCAGCAAAAA TTGCCATGGG ATACCGGGGT TACGGGTTGC CTCAGGCAGA AGTCATTTCT
GAAGCTAATG TTGGTCTGAT GCAGGCGGTA AAGCGGTTCG ATCCGGAAAA AGGCTTCCGT
CTGGCAACCT ATGCGATGTG GTGGATCCGC GCCTCCATTC AGGAGTATAT CCTGCGGTCC
TGGTCGCTGG TGAAGCTTGG GACCACATCT GCGCAGAAGA AGCTGTTTTT CAATCTGCGC
AAAGCCAAGG CCCGGATCGG TGCACTTGAG GATGGAGATC TGCGGCCCGA AGTGGTGAAG
AAGATCGCCA CAGATCTTGG CGTGACCGAG GATGAGGTGA TCTCCATGAA CCGACGTATG
TCGGGCGGCG ATGCGTCGCT CAATGCCATG GTGGGCAGCG ACGGTGACAG CACCATGCAG
TGGCAGGATT GGCTCGAGGA TGAGGACGCC GATCAGGCGG GAGATTACGA GGCCCGTGAC
GAGCTGCAAG CGCGCCGGGA GCTTCTCGCC GAGGCCATGA GCGTCCTCAA CGATCGCGAG
AAAGACATTT TGACCCAGCG TCGTCTGGCC GAGCAGGCCA AGACGCTTGA AGAGCTGAGT
GTCCAATATG ATGTGAGCCG GGAGCGCATT CGCCAAATCG AAGTGCGCGC CTTTGAAAAG
CTACAGAAGA AAATGCGCGA GCTCGCGGCT GGCAAGGGGA TGCTGCAGTC GAAGCTCTGA
 
Protein sequence
MANYANLPAP SPEGGLNRYL QEIRKFPLLE PEEEYMLAKR WVEEQDTEAA HKMVTSHLRL 
AAKIAMGYRG YGLPQAEVIS EANVGLMQAV KRFDPEKGFR LATYAMWWIR ASIQEYILRS
WSLVKLGTTS AQKKLFFNLR KAKARIGALE DGDLRPEVVK KIATDLGVTE DEVISMNRRM
SGGDASLNAM VGSDGDSTMQ WQDWLEDEDA DQAGDYEARD ELQARRELLA EAMSVLNDRE
KDILTQRRLA EQAKTLEELS VQYDVSRERI RQIEVRAFEK LQKKMRELAA GKGMLQSKL