Gene TM1040_0279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0279 
Symbol 
ID4077414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp284956 
End bp285972 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID638005573 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_612274 
Protein GI99080120 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.464101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCATA AGAATTGGGC AGAATTGATC AAGCCCACGC AGCTTGAGGT GAAACCGGGC 
AATGATCCGG CACGTCAGGC AACGCTCGTT GCGGAACCGC TGGAGCGTGG CTTTGGTCTG
ACGCTCGGCA ACGCGCTGCG CCGCATCCTG ATGAGCTCGC TGCAAGGCGC GGCCATCACA
TCCGTCCAGA TCGACAACGT GCTGCACGAG TTTTCCTCCG TGGCCGGTGT TCGTGAAGAC
GTCACAGACA TCATCCTGAA CCTCAAGCAG GTCTCCCTGC GCATGGAAGT CGAAGGGCCC
AAGCGCCTGT CGATCAATGC CAAAGGTCCG GCCGTCGTCA CCGCAGGCGA CATTGCCGAA
ACCGCTGGCA TCGAAGTTCT GAACCGCGAG CACGTCATCT GCCACCTCGA CGATGGTGCG
GATCTGTTCA TGGAACTCAC TGTCAACACC GGCAAAGGCT ATGTCTCTGC CGAGAAGAAC
AAGCCCGAGG ACGCACCGAT TGGTCTTATT CCGATCGACG CGATCTATTC CCCGGTCAAG
AAGGTCTCTT ACGACGTTCA GCCGACCCGC GAAGGTCAGG TTCTGGACTA TGACAAGCTG
ACCCTCAAAG TTGACACCGA CGGCTCCATC ACCCCCGAAG ACGCGCTGGC TTTTGCGGCC
CGCATCCTTC AGGACCAGCT GTCGATCTTC GTGAACTTCG ACGAGCCGGA ATCCGCAGGT
CGTCAGGACG AGGACGATGG TCTCGAGTTC AACCCGCTTC TCCTCAAGAA AGTGGACGAG
CTGGAACTGT CCGTGCGTTC GGCAAACTGC CTCAAGAACG ACAACATCGT CTATATCGGC
GATCTGATCC AGAAAACCGA AGCCGAGATG CTCCGCACCC CGAACTTCGG CCGCAAGTCC
TTGAACGAAA TCAAGGAAGT GCTGTCTGGC ATGGGTCTGC ACCTCGGTAT GGACGTCGAG
GACTGGCCGC CGGACAACAT CGAAGAGCTG GCCAAGAAAT TCGAAGACAG CTTCTAA
 
Protein sequence
MIHKNWAELI KPTQLEVKPG NDPARQATLV AEPLERGFGL TLGNALRRIL MSSLQGAAIT 
SVQIDNVLHE FSSVAGVRED VTDIILNLKQ VSLRMEVEGP KRLSINAKGP AVVTAGDIAE
TAGIEVLNRE HVICHLDDGA DLFMELTVNT GKGYVSAEKN KPEDAPIGLI PIDAIYSPVK
KVSYDVQPTR EGQVLDYDKL TLKVDTDGSI TPEDALAFAA RILQDQLSIF VNFDEPESAG
RQDEDDGLEF NPLLLKKVDE LELSVRSANC LKNDNIVYIG DLIQKTEAEM LRTPNFGRKS
LNEIKEVLSG MGLHLGMDVE DWPPDNIEEL AKKFEDSF