Gene Rsph17029_3315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3315 
Symbol 
ID4898268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp372668 
End bp374377 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content66% 
IMG OID640113914 
Productphage terminase 
Protein accessionYP_001045183 
Protein GI126464070 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.172145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCACG CCAACGACTG GTCGACCGCC TGCCCGGACT GGGCGGAGCG GCTCGCGGCA 
GGCCGTCCGC TGATCCCCGA CCTGCCGCTC TTCCGGCCGG TTGCGGACAA GGCGCTGCGG
ATCTTCAAGA GCCTGCGCGT CCCGGACATG ATCGGGACGC CGACGCTGGG CGAGGTCTGC
GAGGAATGGA TCTTCGATCT CGTGCGCGCC ATCTTCGGCG CCTACGACCC GGAGACCCGC
CGCCGGATGA TCCGGCAGTT CTTCGTGATG ATCCCGAAGA AGAACGGGAA GTCCTCGATC
GCAGCCGCGA TCATCGTGAC GGCGGTGATC CTCAACGAGC GCCCGCTGGC AGAGGCGATC
CTGATCGCCG AGACGCAGAA GATCGCGGAC ATCGCCTTCC GCCAGGCGGC CGGGATCATC
CGGCTCGACG CGCGGCTCGA CAAGGAAAAG GGCGGCATCT TCGACGTCAA GGATCACTCC
AAGACGATCG TGCACATGAA CACGGGCGCG GTGATCCGCA TCCTCTCGGC CGATGGCGAT
GTCATCACCG GGTCGAAGGC GGCCTACATC CTCGTGGATG AGACGCATGT GCTCGGCCAC
AAGTCGAAGG CGGATGCGAT CTACCTTGAG CTGGAAGGCG GGCTCGCCGC CCGGCCCGAG
GGCTTCCTGC TGGAGATCAC GACCCAGTCG AAGGTCCAGC CGCATGGCGA GTTCAAGCGG
CGGCTGAAGC TGGCCCGCGA CGTGCGCGAT GGCAAGGTGA GCCTGCCGAT CCTGCCGGTG
CTCTACGAGC TGCCGGCAAA GATGCAGGCC GCCAAGGCAT GGATGGACGA CAGCACCTGG
GGGCTGGTGA ACCCGAACCT CGAGCGGTCG GTCTCGATCG ACTTCCTGCG CGAGAAGTTC
GTGGAGGCGC AGCAGGGCGG GGACGACAAG CTCGCGCTCT TCGCCTCGCA GCATCTCAAC
GTGGAGATGG GCATCGGCCT GCATTCCGAC CGTTGGGTCG GGGCGGACTA CTGGCTGAAG
AATGCAGAGC CGGGGCTGAC CTATGCGCAG CTGCTCGACC AGTGCGAGGT TGTCATCTTC
GGCGGCGATG TCGGCGGCGC GGACGATCTC TTCGGCCTCA CGGCCATCGG CCGGCACCGC
CAGACCAAGA TCTGGCTGAC CTGCAGCTGG GCGTGGTGCG TCAGAGACGT GCTGAAGAAC
CGCAAGGAGA TCGCGCCCCG GCTCGAGGAG CTGGAGAGGG CCGGAGATCT TCGGATCACC
GATGGCGCGG CCGAGCATGT CGAGGAGGCG GTCGCGATCA TCTGCGAGGC GCGGGACGCG
GGCAGGCTTC CGGACGGCGT CTGCATCGGC CTCGATCCCT ATGGCGTGGC GGCTCTCGTC
GATGCGCTGG AGGCCGAGGG CTTCGATCCG TCCACGCGGA TCGCGCCCAT CGGGCAGGGC
TACAAGCTGA ACGGCGCGGT GAAGGGACTG GAGCGGCGGC TGCTCGACGG CCGCATCCGG
CATGCCGGCC AGCCGATGAT GACGTGGTGC GTCGGCAACG CGAAGGCCGA GCAGCGCGGA
AACAATGTCT ACATCACGAA GGAGGCTGCA GGTGTGGCTA AGATCGATCC CCTGATCGCC
CTCTTCACGG GGGCGGTGCT GATGGACACC AATCCTCAGG CCCCGGCAAG CCTCGACGAC
TTCCTGTCCG ACCCGGTGAT GGTGATCTGA
 
Protein sequence
MIHANDWSTA CPDWAERLAA GRPLIPDLPL FRPVADKALR IFKSLRVPDM IGTPTLGEVC 
EEWIFDLVRA IFGAYDPETR RRMIRQFFVM IPKKNGKSSI AAAIIVTAVI LNERPLAEAI
LIAETQKIAD IAFRQAAGII RLDARLDKEK GGIFDVKDHS KTIVHMNTGA VIRILSADGD
VITGSKAAYI LVDETHVLGH KSKADAIYLE LEGGLAARPE GFLLEITTQS KVQPHGEFKR
RLKLARDVRD GKVSLPILPV LYELPAKMQA AKAWMDDSTW GLVNPNLERS VSIDFLREKF
VEAQQGGDDK LALFASQHLN VEMGIGLHSD RWVGADYWLK NAEPGLTYAQ LLDQCEVVIF
GGDVGGADDL FGLTAIGRHR QTKIWLTCSW AWCVRDVLKN RKEIAPRLEE LERAGDLRIT
DGAAEHVEEA VAIICEARDA GRLPDGVCIG LDPYGVAALV DALEAEGFDP STRIAPIGQG
YKLNGAVKGL ERRLLDGRIR HAGQPMMTWC VGNAKAEQRG NNVYITKEAA GVAKIDPLIA
LFTGAVLMDT NPQAPASLDD FLSDPVMVI