Gene RSP_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3606 
Symbol 
ID3721764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp701300 
End bp702598 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content73% 
IMG OID640073273 
ProductSigma54-2 (RNA polymerase sigma-54 factor) 
Protein accessionYP_355111 
Protein GI77465608 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCCC GCCAGCGCAT CAGTATCGCC CAGACCCAGA GGCTGCAGCT CAATCTCGGC 
CTCACCGCCT CGATCCGCGT CCTGAATTCC GATGCCGAGG GCCTCACGCG CTACCTGCAG
GAGCAGGCGG CGGAGAACCC CCATATCCAG CTCGAACCGG CAACCTCGAC CGACTGGCTG
CCGCGCTGGA CGAGCGTTCT GTCGCGCCTC GCGCAGGGCG AGGGGTCGGC GGGCGGAGAG
ACGGTGGCGG CGGCGGGGCC GAGCCTCATG GCGCATGTGA TGGCGCGCAT CGACACGCTT
TATCCGCGCG GGCCCGAGCG GCGGATCGCC ATCCTTCTGG CCGAAGCGCT GGAGCCCACC
GGCTGGCTCG GGACGGGACC GGACGAGATC GCCCGGCAGG CCCGCGTCCC CTCCGCCGAG
GTCGAGGCCG TGCTGGCCGG GCTGCAGAAG ATCGAGCCCG CCGGCCTCTT CGCCCGAACC
CTCGCCGAGT GCCTGCGGCT TCAGGCCATC GAGGCCGAGC GGCTCGATTC CACCCTGAGC
TGCCTTCTCG ACCATCTCGA CCTGGTGGCA GAGGGGGCCC TCGGGCGGCT CGCGCGGCTC
TGCAACACGG ACGAGGCCGG GGTGACCGCG CGCCTGCGGC TCCTGCGGAC CTTCGACCCG
AAGCCCGGCG CGCAGTTCGA TCCGGGCGCG GCGCCGGTGC GCGAGCCCGA CCTGATCGCG
ACGAAGGGCG AGGCCGGGTG GGAGGTGTCG CTGAACCGCT CGGCCATGCC CACGGTGCAG
ATCCGCAAGC CGGACAAGCG CCCGACGACG CCGGCCGCCC GCGCGGCCTG GACCCAGGCG
CAGGCGGTGG GCCGGATGAT CGAGAACCGC AATGCCACGC TGCTGAGGGT CGCGCGCGAG
ATCCTCGCCC GGCAGGAGGC GGCGCTCGAC GAAGGTCCCT CGGCGCTCGT GGCCCTGACC
ATGACCGAGG TGGCCGAGGC GCTCGGCATC CACGAGAGCA CGGTGAGCCG CGTGGTCGCG
GGCACCTGCG TGGACACGCC GCGCGGCACC TGGTGGCTGC GGCGCATGTT CAGCGGCCGC
CTTGCCGAGG GCGGTCCCTC GGCCGCGGCC ATCCGCGCCG CCATCGCCCG CCTCGTCGCG
CAGGAAGATC CGGCCGCGCC TTTGTCCGAC GGCGCTCTGG TCGAGGCGCT GGCGGCCGAG
GACATGCAGC TGGCGCGCCG CACCGTCGCC AAATATCGCG AGATGCTGAA CATCCCCCCC
GGACACCGCC GCCGCCGCAG GCCCTCGCGC TCGGCCTGA
 
Protein sequence
MKSRQRISIA QTQRLQLNLG LTASIRVLNS DAEGLTRYLQ EQAAENPHIQ LEPATSTDWL 
PRWTSVLSRL AQGEGSAGGE TVAAAGPSLM AHVMARIDTL YPRGPERRIA ILLAEALEPT
GWLGTGPDEI ARQARVPSAE VEAVLAGLQK IEPAGLFART LAECLRLQAI EAERLDSTLS
CLLDHLDLVA EGALGRLARL CNTDEAGVTA RLRLLRTFDP KPGAQFDPGA APVREPDLIA
TKGEAGWEVS LNRSAMPTVQ IRKPDKRPTT PAARAAWTQA QAVGRMIENR NATLLRVARE
ILARQEAALD EGPSALVALT MTEVAEALGI HESTVSRVVA GTCVDTPRGT WWLRRMFSGR
LAEGGPSAAA IRAAIARLVA QEDPAAPLSD GALVEALAAE DMQLARRTVA KYREMLNIPP
GHRRRRRPSR SA