Gene RoseRS_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1158 
Symbol 
ID5208109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1440837 
End bp1441808 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content54% 
IMG OID640594775 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001275515 
Protein GI148655310 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00804357 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0422731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATA TCGCCATGCC AAAGATTGAA GTCGTTACCG CTGCTGAAAA TTACGGACGG 
TTCAAAATCG AGCCGCTCGA TCCAGGGTAC GGACATACGC TTGGAAATGC GCTGCGCCGC
GTTCTGCTCT CTTCCATCCC TGGTGCGGCG ATTACGAAGA TCAAGATCGA GGGAGTATTT
CATGAGTTCT CGACTATTCC GGGGGTCAAA GAAGACGTCA CTGAGATTGT CTTGAACATC
AAAGGTATTC GTTTGCGTTC CTATGCCGAA CGTCCTGTGA AAATATCGCT GTCGAAACGC
GGCGCCGGCG TTGTTCGCGC TGCGGATATT GATGCGCCAA GCAATGTCGA GATCGTTAAT
CCCAACCACT ATATCTGTAC GATCGATCGC GACGATGCCG CCATTGATAT GGAAATGACG
GTCGAACGCG GGCGCGGCTA CCTGCCGGCG GATCAGCGCG ATGCGCTGCC GATCGGCGAA
ATCCCGATTG ATGCGATCTT TACTCCTGTC CCCAAGGTCA ATTATGTGGT TGAACATATT
CGCGTGGGGC AGGCGACCGA TATCGACAGC CTGTTGATCG AAATCTGGAC TGATGGAACG
ATCAAGCCGG GGGATGCGCT CAGCCACGCG GCGCAGGTGC TGGTTCAGTA TTCGCAGACG
ATTGCCGACT TCAATCGCCT CTCGACAGAA GCGGAACCGA CTACAGCGCC CAACGGACTG
GCTATCCCGG CGGATATTTA TGATACGCCG ATCGAGGAGC TCGATCTCTC AACACGGACC
TACAACTGTC TCAAGCGCGC CGATATTACC AAAGTCGGTC AGGTGCTCGA AATGGACGAA
AAGGCGCTGC TGTCGGTGCG GAATCTGGGA CAAAAATCGA TGGAAGAAAT CCGCGATAAA
TTGATCGAAC GCGGCTATAT TCCACGGATC GGTCAAACAT CGCACGCGGC TCGTGCAGAG
ATCGAGGGTT GA
 
Protein sequence
MLDIAMPKIE VVTAAENYGR FKIEPLDPGY GHTLGNALRR VLLSSIPGAA ITKIKIEGVF 
HEFSTIPGVK EDVTEIVLNI KGIRLRSYAE RPVKISLSKR GAGVVRAADI DAPSNVEIVN
PNHYICTIDR DDAAIDMEMT VERGRGYLPA DQRDALPIGE IPIDAIFTPV PKVNYVVEHI
RVGQATDIDS LLIEIWTDGT IKPGDALSHA AQVLVQYSQT IADFNRLSTE AEPTTAPNGL
AIPADIYDTP IEELDLSTRT YNCLKRADIT KVGQVLEMDE KALLSVRNLG QKSMEEIRDK
LIERGYIPRI GQTSHAARAE IEG