Gene RSP_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4107 
Symbol 
ID3711856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007489 
Strand
Start bp62031 
End bp63725 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content67% 
IMG OID640069450 
Productsite-specific recombinase 
Protein accessionYP_345317 
Protein GI77404744 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTGC CGTCCCACGG CTTCACCATC CTGCGCAACG CGCCCGACAG CGCCCGCGGA 
GCCACCGCCC GTTCGGCCCG TCGCGCGGCC GGCGCTGCCG AACCAATGCC CACCGGCTCC
GGCCGGCAAC TCGCGGTGCT CTACGCGCGC TACTCCAGCG CGAAGCAGAA CCCCATGTCG
TGCGAGGATC AGCTGGCGCT CTGCCGCGAG ACGGCGGCGA ACCTGAACTT CGAGATCGCC
GCCGAATTTT TCGACGCGGC GGCCAGCGGT CGTACCCTGC TGCGCAACCG TCCCGGCGTG
TGCGAAATGA AGGCCCGCGT TGCGAAGGGC GACGTTGCCG TCCTCATCGT CGAAGGCATC
GAGCGGATCG GCCGCCGCGC CCGCGACATC GCCGAGGTGT CCGAGTGGTT CGAGAGCCAG
AACGTCGACC TGGTTGCCGC GAATGGCGGC CGGATCCCGT GGAAACTCGT GCCCTTCCAC
GGCGCTATCG CGGAATTTCA AAGCCGCGAG ACCGCTGACA AGACCCGCCG CGGCCAGGTC
GGCACCACGC GGCGCGGTCG CGTCTCGGCC GGCCTCGCTT ACGGTTACCG CGTCGCGCCC
GGCGCGGCAG AGTTCAACCG CGTCATCGAT CCGGCTCAGG CCGAAGTGGT GCGCCGAATC
TTCGAAGACT ACGCCGCCGG CCTCTCGCCA CGGCAGATCG TGTCCACGCT GAATGCCGAG
GGCATTCCCT CACCCACCGG CATGGCGTGG AACGACAGCA CCCTGCGCGG CAATGCGCAG
ACGCGCGACG GCATTCTGCG CAACGAAGCC TACGTCGGCA CGCTCGTCTA CGGCCGCAAC
CGTTTCACCC GCGATCCCGA CAGCGGCAAC CGGCTGTCGC GACCCGGCGA GGCAAATTCG
ATCGTCTATG TCGACCGGCC GGAGTTGCAG ATCATCCCCG AGGATCTCTG GAACCGTGTG
CAGGAGCGGC TTGAGAAAGC GTACAAACTG CGCGTGGCGC AGAAGCGGCA GCTGAACGAG
ACCCACCGCG CGCGGCACCT GCTGACCGGC ATCCTGCGCT GCGGATGCTG CGGCGGCTCC
ATCACGATCG TGAACGGCGA GCGCTACGGG TGCTACAATC GAAAGAGCAA GGGGCTCTCG
GTCTGCGGCA ACCGCCGGAC GATCCTTCGG CCGAAACTGG AAGAGGCCGT TCTCGCGCGC
ATCCGGGCGG GCCTGCTGAC GCCGGACCTC GCGCGTCACT TCGCGGCGGA AGTGCGGCGA
CTCTGGGAGG AGCAATCGGC GGGCCAGACG AATGCGCCCG CGCGTCTCAA GACGGATCTT
GCGCGGGTCA AGCGGTCGAT CGAAAACCTC ATCAACCGTC TGGAAGGCGA TGATCCTGGC
CCGCACATTC TCGAGAGACT GCGCGATCGC GAAGCGGAAG CTGCGCGGTT GACAACGGAA
CTTGCGGCGC TCGAAGCGCC CGCAAAAAAC CGTCAGCCGC CGTCTGCCGA GGAACTCGTG
GCCGCTTATC GGGGCCACGT CGATCGCATG GAGTCACTGC TGCGGGATCC GGCGATGATC
GTCGAAGCGA ACGACCTGTT GCGACACATG TTGGGACACG TCTCCGTGCA TCCGGACGCT
GACAGACCGC GAGGATTCCG CATCGAGATC ACGGGCGACC TGCTCACCTT CCTTCTGCCG
TCCGCGTCAA AGTAG
 
Protein sequence
MSLPSHGFTI LRNAPDSARG ATARSARRAA GAAEPMPTGS GRQLAVLYAR YSSAKQNPMS 
CEDQLALCRE TAANLNFEIA AEFFDAAASG RTLLRNRPGV CEMKARVAKG DVAVLIVEGI
ERIGRRARDI AEVSEWFESQ NVDLVAANGG RIPWKLVPFH GAIAEFQSRE TADKTRRGQV
GTTRRGRVSA GLAYGYRVAP GAAEFNRVID PAQAEVVRRI FEDYAAGLSP RQIVSTLNAE
GIPSPTGMAW NDSTLRGNAQ TRDGILRNEA YVGTLVYGRN RFTRDPDSGN RLSRPGEANS
IVYVDRPELQ IIPEDLWNRV QERLEKAYKL RVAQKRQLNE THRARHLLTG ILRCGCCGGS
ITIVNGERYG CYNRKSKGLS VCGNRRTILR PKLEEAVLAR IRAGLLTPDL ARHFAAEVRR
LWEEQSAGQT NAPARLKTDL ARVKRSIENL INRLEGDDPG PHILERLRDR EAEAARLTTE
LAALEAPAKN RQPPSAEELV AAYRGHVDRM ESLLRDPAMI VEANDLLRHM LGHVSVHPDA
DRPRGFRIEI TGDLLTFLLP SASK