Gene RSP_3543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3543 
Symbol 
ID3721957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp633384 
End bp634484 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID640073207 
Producttype I restriction-modification system restriction subunit 
Protein accessionYP_355045 
Protein GI77465542 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGTC GTCACGGGCC TTTCGGGAAG CGGCAAGACG AAGGAAATTC GCGATCTGCT 
GCGCCGCTTC CAGGAAAGCG CGGCGGTCCT CCCGTCGGGC CAGCCGGCAC GGATCGTCGA
GTGCATTCTC GATGCAAAGG GCGGCTGGAA GGACCTCAAC AGGAAGACCC TCAAGGCAAT
GGGGTATCCG ATTTCGGACA GTTCACGACT GCGCCCAACC TCATGCGCGA GGGTACACGG
GGTCAGGTAT CGCCGGTGGA ACGCGACGGC ATCTTCGGCG GCGATATAAA GCTAGAGGTC
TCCGACCCGC ACCAGGTCTA TGCACTGCTT GCCCGGATGC AGGAGATGCA CATCCTCGAC
CAAGGCGAGA TCGACCGGTT CGTGTCACGC TTCCTGCAAG CCAATCAGCG GGCCGATGAG
CGGCCGGTGC TGGAAGGCAT CGTCCGGCAG ACAGTGGAGC GCTTCCGGAC GGCCCTGACC
GAGGAGCAGC AGGAAGAGTT CCGGCAGCTG CTGGCTTCCT TCCTGCGGTT TTATGCCTTC
ATCTCGCAGG TCATCGCCCT CGAGGACAGC GACCTCGAGA AGATGTACCT CTTCGGCAGC
TGGCTGAAAC GCCTGCTTCC GTCGCGCGAG GCGCCGCAAG GCGGCGATGT CACCGACGAC
ATGCTGGAGT TGCAGGCCTT CCGGCTCTCG GAAGGCGAGG TTGTCGATGC GTCGCTCGAA
GCAACAGAGG CGAAGCCGCT GTCCCCGATC GACCGTTTCG GGGCGAACCC TTTTACTGAA
GAAGAGCGGC GCACGCTTTC GGAAATCATC AAGGCGTTCA ACGACCGGCA CGCCACGAAC
TTCACCGAAG AGGATTACAT CCGCTTCGAG GCAGTGAACG AGGCCATCCT CGACGACGAG
GCTTGGGCCG AAATGCTGCG GAACAACCCG CCCGAGGTCG TGCGGCCCAG GTTCGGCGAG
GAGTTCATGC GTAGGGCCAT TCTGGCGTTC CAACGCGACC GCCAGATGCA GAGCGCCTTC
CTCCAAGATC GGGAAGGCCG GGAGATGATC ATGGGGCTGA TGTTCGGGCG AGCCGTGCGC
GGAGCAAGAA AGTCAGCATA G
 
Protein sequence
MPRRHGPFGK RQDEGNSRSA APLPGKRGGP PVGPAGTDRR VHSRCKGRLE GPQQEDPQGN 
GVSDFGQFTT APNLMREGTR GQVSPVERDG IFGGDIKLEV SDPHQVYALL ARMQEMHILD
QGEIDRFVSR FLQANQRADE RPVLEGIVRQ TVERFRTALT EEQQEEFRQL LASFLRFYAF
ISQVIALEDS DLEKMYLFGS WLKRLLPSRE APQGGDVTDD MLELQAFRLS EGEVVDASLE
ATEAKPLSPI DRFGANPFTE EERRTLSEII KAFNDRHATN FTEEDYIRFE AVNEAILDDE
AWAEMLRNNP PEVVRPRFGE EFMRRAILAF QRDRQMQSAF LQDREGREMI MGLMFGRAVR
GARKSA