Gene Rpal_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0052 
Symbol 
ID6407694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp55551 
End bp57197 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content64% 
IMG OID642709960 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_001989090 
Protein GI192288485 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.15874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTGT CGCAACGTCT CGAATTCCGC CAGACCCAGT CTCTGGTGAT GACCCCGCAA 
CTGATGCAGG CGATCAAGCT GCTTCAGCTG TCGAATCTCG ACCTCGCGGC TTTCGTCGAG
GACGAGATCG AGAAGAACCC GTTGCTCGAT CGCGCCGGCG ACAATGCGGA ACCGCCGGTG
GCCGGTGAGG CATCGGCGGA AGGCGCAGAG GGCGGCGGTG AATTCGGCGG GAGCGGCGGC
GAGGATCTCG GCGGCGAGGG CACGTCGGAT TTCGTCGATC CCGCAGGCGC CAGCGCATTC
GAGCCCGGCA CCGAAGAATG GATGCATCGC GATCTCGGCA GCCGCAGCGA TATCGAGCAG
ACCCTCGATA CCGGTATGGA AAACGTTTTC CCTGAAGAGC CAGCCGAAGC CGCGGCCCGC
GCCGCGCAGG ACGCCGCACC GGCCTCCTAT ACCGAATGGG GCGGCGGGGC TTCATCGGAT
GAGGGCTACA ATCTCGAAGC CTTCGTGGCC GCTGAATCGT CGCTGGCCGA TCATCTCGCC
GAACAGCTCG CGGTCGCGGT AACGACGCCA TCGCAGCGGC TGATCGGACA ATACCTGATA
GATCTGGTCG ATGATGCCGG CTATCTGCCG GCTGATCTCG GCGATGCCGC CGAACGGTTG
GGCGCGAGCC AGGCCGAAGT GGAAGCGCTG GTGCAGGTGC TGCAGACCTT CGATCCGCCC
GGTATCTGCG CCCGCAACCT GAGCGAATGC CTCGCCATCC AACTGCGCGA GCGCGACCGC
TACGATCCGG CGATGCAGGC GTTGGTCGAG CACCTCGACT TGCTGGCCAA ACGCGACGTC
GCGTCATTAC GCAAGATCTG CGGCGTCGAC GACGAAGACC TCGTCGACAT GATCGGCGAG
ATTCGTCATC TCGATCCGAA GCCCGGCCTG AAGTTCGGCT CGTCGCGGGT GCAGACAGTT
GTGCCCGACG TCTTCGTCCG TCCCGGCCCC GACGGCGGTT GGCTGGTCGA GCTCAACAGC
GACACACTGC CGAAGGTGCT GGTCAACCAG TCATATTATT CCGAGCTGTC GAAGACGATC
CGCAAGGACG GCGACAAGTC GTATTTCTCC GACTGCCTGC AGAACGCTAC CTGGCTGGTG
CGCGCGCTCG ATCAGCGCGC CCGCACCATC CTGAAGGTGG CGACCGAGAT CGTGCGCCAG
CAGGACGGCT TCTTCACCCA CGGCGTCAAG CATCTGCGGC CGCTGAATCT CAAGGCCGTG
GCTGACGCGA TTCAGATGCA CGAATCGACG GTGTCGCGCG TCACCGCCAA CAAATACATG
GCGACCAATC GCGGCACTTT CGAACTTAAG TATTTCTTCA CCGCGTCGAT CGCGTCCGCC
GACGGCGGCG AGGCGCACTC GGCCGAAGCA GTACGGCACC AGATCCGCCA ACTGATCGAC
AGCGAAGATC CGTCAGCGAT CCTGTCGGAT GATACGATCG TCGAACGGCT TCGCGAGGCC
GGCATCGACA TCGCGCGCCG CACCGTCGCG AAATATCGCG AAGCGATGCG CATTCCCTCT
TCAGTGCAGC GCCGTCGCGA CAAGCAGAAC ATGCTGGGCA CACAGGCCGG GAGCGCAAGC
CGCTCCCGCG ACACAGCCCC AGCTTGA
 
Protein sequence
MALSQRLEFR QTQSLVMTPQ LMQAIKLLQL SNLDLAAFVE DEIEKNPLLD RAGDNAEPPV 
AGEASAEGAE GGGEFGGSGG EDLGGEGTSD FVDPAGASAF EPGTEEWMHR DLGSRSDIEQ
TLDTGMENVF PEEPAEAAAR AAQDAAPASY TEWGGGASSD EGYNLEAFVA AESSLADHLA
EQLAVAVTTP SQRLIGQYLI DLVDDAGYLP ADLGDAAERL GASQAEVEAL VQVLQTFDPP
GICARNLSEC LAIQLRERDR YDPAMQALVE HLDLLAKRDV ASLRKICGVD DEDLVDMIGE
IRHLDPKPGL KFGSSRVQTV VPDVFVRPGP DGGWLVELNS DTLPKVLVNQ SYYSELSKTI
RKDGDKSYFS DCLQNATWLV RALDQRARTI LKVATEIVRQ QDGFFTHGVK HLRPLNLKAV
ADAIQMHEST VSRVTANKYM ATNRGTFELK YFFTASIASA DGGEAHSAEA VRHQIRQLID
SEDPSAILSD DTIVERLREA GIDIARRTVA KYREAMRIPS SVQRRRDKQN MLGTQAGSAS
RSRDTAPA