Gene Rsph17029_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3386 
Symbol 
ID4898492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp440422 
End bp441657 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content64% 
IMG OID640113985 
Producttype II restriction endonuclease, putative 
Protein accessionYP_001045254 
Protein GI126464141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.174346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTACG GCTCTCTCTC GGATCATTTC ACCGGGATCG TCGCCAAGCG CCTTTCAACG 
GTCGAGGCGG ATACGGCGCG GTCGAATCAG CACGAGTTCA ATGGTACGAA CGAGCTGCGC
CGGCTGCTGG GCGGCGAGCG CATCGAGCGC AGGCCGTCGC GCTTCATCTG GCTCGGCGGC
GAGAACGAGG GGATTACCGA CGACGCGCCA GTCACATGGT ACGACGCTCG GGAGCGCCAT
CCGACGCGAT CGGAATGGCG GCTCTATTTC CAAGCAAACG CCGTGACCGA GGTCGCGCAG
GCCGGTGACC TTCTGGTCGT GGCCCGCCGC CCGAGCGGCG ATCTGATGTT CATTGTCGCA
CCGCAGAGTT CCACTCTCGA GAACCAGATC GCCTGGCTCT TCGGGCTGGA TCACGGACTT
GGCGCCGGCT TCCGCTACGA GGGTTTCGAG GGAGCCGGCG ATCGCGGCCT CGACTTCGTC
AGCAATTATG TTCTTGAAGA GATCGGCATC GAGCCCGAGG TGCCGGAGGC CGATCGGCTG
GATGAAATCG TCGGCCGGTT CGGGGCGCAG TTCCCCAGCT CGCGGACATT CTCCGCGCTG
GCCCGGCAGA ACCTGCCCGA AGTCGATCCA CGCGACGATG CCGATGCAGC TCTGCTCGCA
TGGATCGAGT TCGAGGAGGC CCTTTTCCGG CGCTTGGAGC GCCATATCGT TGCCGCGCGC
TTGGAGGTTG GCTTCCTCAC GGATGGCACG GCCGATGTCG ACGGCTTTCT GCAGTTCTCA
CTGTCGGTAC AGAACCGCAG AAAAAGCCGC ATGGGCCTGT CGCTCGAGAA CCATGTCGAG
GAGATGTTGA CGGTACTGGG CCTCAGATAC GCACGGGGAG CGCGGACCGA GGGGAATTCG
AAGCCGGATT TCCTGTTCCC CGGCGTCGCC GAGTACGCCG ATCCGGGCTA CAGCGCTGAC
CGCCTGTCCA TGCTGGGAGT GAAGTCCACG CTCAAGGATC GCTGGCGCCA GGTGCTTGCA
GAAGCCGCAC GCATCGACCG CAAGCACCTG CTGACGCTGG AGCCCGGCAT CTCCACCCAC
CAGACGAACG AGATGATCCG TCACTCCCTG CAGCTCGTCG TGCCACGGGG CCTGCATACG
ACCTACACAC CGGAACAGGC TGGGTGGCTC ATGACCGTTC GCGGTTTCCT CAATCTGGTG
GCAGCACGGG AAGCTGCACG ACCCTATCGC GAATGA
 
Protein sequence
MRYGSLSDHF TGIVAKRLST VEADTARSNQ HEFNGTNELR RLLGGERIER RPSRFIWLGG 
ENEGITDDAP VTWYDARERH PTRSEWRLYF QANAVTEVAQ AGDLLVVARR PSGDLMFIVA
PQSSTLENQI AWLFGLDHGL GAGFRYEGFE GAGDRGLDFV SNYVLEEIGI EPEVPEADRL
DEIVGRFGAQ FPSSRTFSAL ARQNLPEVDP RDDADAALLA WIEFEEALFR RLERHIVAAR
LEVGFLTDGT ADVDGFLQFS LSVQNRRKSR MGLSLENHVE EMLTVLGLRY ARGARTEGNS
KPDFLFPGVA EYADPGYSAD RLSMLGVKST LKDRWRQVLA EAARIDRKHL LTLEPGISTH
QTNEMIRHSL QLVVPRGLHT TYTPEQAGWL MTVRGFLNLV AAREAARPYR E