Gene Rpal_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0371 
Symbol 
ID6408017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp394333 
End bp395232 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content65% 
IMG OID642710281 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_001989407 
Protein GI192288802 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.532177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGTG CTGCTACTCT CCCGATTCTC AACGGAGAAT CCGGCCTTGC TCGCTACCTT 
GCGGAAATCC GCAAGTTTCC GATGCTCGAG CCTCAGCAAG AGTACATGCT CGCCAAGCGC
TGGCGCGAAC ACGGTGATCG CGACGCGGCG CATCAACTCG TCACCAGCCA TCTTCGCCTC
GTCGCCAAAA TCGCGATGGG CTACCGCGGC TATGGCCTGC CGATCTCCGA GGTCGTCTCG
GAAGGCAACG TCGGCCTGAT GCAGGCGGTG AAGCGGTTCG AACCCGAGAA GGGCTTCCGC
CTCGCCACCT ACGCGATGTG GTGGATCAAG GCGTCGATTC AAGAGTACAT CCTGCGTTCG
TGGTCACTCG TGAAGATGGG TACCACCGCG AACCAGAAGA AGCTGTTCTT CAACCTGCGC
AAGGCGAAGA GCAAGATCTC GGCGCTGGAC GAGGGCGATC TGCATCCCGA CCAGGTCAAG
CTGATCGCCA CCCGGCTCGG GGTGACTGAG CAGGACGTGG TCGACATGAA TCGCCGCCTC
GGCGGCGACG CTTCGCTCAA CGCGCCGATC CGCGACGACG GCGAGCCGGG CGAATGGCAG
GACTGGCTGG TCGATCAGTC GCCGAGCCAG GAAGCGGTGA TGGCCGAGCA CGAAGAGCTC
GACCAGCGCC GGGCCGCGCT CAACGGCGCG ATCCAGGTGC TGAACCCGCG CGAACGGCGG
ATCTTCGAGG CCCGCCGCCT CGCCGACGAG CCGATGACGC TTGAAGACCT CGCCTCCGAA
TTCGGCGTGT CGCGCGAGCG CGTGCGCCAG ATCGAAGTGC GGGCGTTCGA GAAGGTGCAG
AGCGCCGTCA AGGGCACCAT CGCCCGCCAG GAACAGGCGG CGCTGGAAGC CGCGCACTGA
 
Protein sequence
MARAATLPIL NGESGLARYL AEIRKFPMLE PQQEYMLAKR WREHGDRDAA HQLVTSHLRL 
VAKIAMGYRG YGLPISEVVS EGNVGLMQAV KRFEPEKGFR LATYAMWWIK ASIQEYILRS
WSLVKMGTTA NQKKLFFNLR KAKSKISALD EGDLHPDQVK LIATRLGVTE QDVVDMNRRL
GGDASLNAPI RDDGEPGEWQ DWLVDQSPSQ EAVMAEHEEL DQRRAALNGA IQVLNPRERR
IFEARRLADE PMTLEDLASE FGVSRERVRQ IEVRAFEKVQ SAVKGTIARQ EQAALEAAH