Gene RSP_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2998 
Symbol 
ID3720249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1694748 
End bp1696334 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content49% 
IMG OID640071192 
Productputative terminase large subunit 
Protein accessionYP_353065 
Protein GI77463561 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATGA GTGATCAATC TTTACCCGTT CTCTCGTCCG CTCAACCCGT CATTCCGCCT 
AAGCCGAAAC TGTCTAGGGC AGAAAAGAAT ATCAAGTGGA TTGAGTCGAA CCTGTTTATT
CCAGAAGGTA AGGACGTTGG TAAGCCATTC AGGCTTGTGG ACTTTCAGAA GGATATCATT
CGTTCGATCT ATGACAATCC GGCTGGAACT CGTCGGGCTA TTATTTCGAT GCCCCGTAAG
GCTGCGAAGA CAACGCTCTG TGCCGCTCTG ATGCTGTTGC ATCTGGTAGG TCGAGAAGCC
TTGCCCAACT CGCAACTATA CAGTGCAGCC CGAAGCAGGG ATCAAGCTGC GGAGTTGTTC
AAACTGGCAG TCAAGATGAT CAGGATGAGT CCGCGTATAT CGCGTTTTGT TCGGATTGTG
GAGACTAGTA AGCGGCTGAA GGTGCCCGAG CTAGGGACAG AGTATAGGGC TCTCTCCAAG
GATGCTGGAA CGGCTCAAGG GCTGAGTCCG TGCCTCGTGA TCCACGATGA GCTAGGGCAG
GTGAGAGGGC CAGTAGATCC GCTCTACGAA GCCTTGGAGC TTGCCACTGC GGCTCAGGCA
AACCCGCTTA CCCTCGTGAT CTCTACGCAG GCTCCGACTG ACAATGACCT TCTTAGTCAG
TTGATCGATG ACGCCGCAAC CGGGGCAGAT CCGACCAAGG TTCTTAAGCT CTATTCGTGT
CCGATGAATA TCGATCCGTT CTCAGAAGAA GCCCTAGCTG TCTCGCATCC TGCATGGAAT
TCCTTTGTGA ACCGCAAGGA ACTCAAGCAA ATGCAGGCCG AGGCCGCACG GATGCCTGCC
CGTGCTGCGG ATTTCCGCAA CTACACCCTC AACCAGAGGA TCGAAGTCAA CGCTCCATTC
ATTTCAAAAG ATGTTTGGGA TGAAGGCAAG GATAATCCCG AAGAATGGCA TGGAAAGGAT
GTTTGGCTTG GCCTTGATCT ATCTGAAACC CGAGATCTCA CTTCTCTTAC TTTAGCACAT
AAAGACGAGA ATGGTTTGCT TCACGTTCAT CCATTCTTTT GGCTTCCCGA TGAGGGAATA
GAGGATAAAT CGAGAAGTGA TAAGGTTCCT TATGACATTT GGGCCAAGGG TGGACTAATC
CATTTAAGCC AAGGAAGAAC CATCCAATAT AAGGATGTCG CTGCCAAGCT TAAGGAGATT
GCGGATAACG CCAATGTCCA GAAGGTAGCC TTTGACCGTT ACAAAATAAA ATACTTCAAG
CGCGACATGA TTGATTGTGG TTTTGATGAG CGATGGATTG ACGAGCACAT GGTTTCTTAT
GGGCAGGGCT TCGTTTCTAT GGGCATCGGA ATTAACGAGT TGGAGCGTTT AATTCTGGAT
GGCAAAATTC GCCATGGGAA CAACCCCGTC ATGAATATGT GCATGGCAAA CGTGAAAGTT
GTTTCGGACA CTTCAAACAA CCGCAAATTC ATCAAGCATA CTTCGACAAG ACGAATTGAC
GGCGCTGTTA CGTTAGCGAT GCTCGCCGGA ATGCTTGCTG ATCCAGATAA CAAGCCAAAG
CCCAAGCGAA AAGCTCTATT TGCTTAA
 
Protein sequence
MTMSDQSLPV LSSAQPVIPP KPKLSRAEKN IKWIESNLFI PEGKDVGKPF RLVDFQKDII 
RSIYDNPAGT RRAIISMPRK AAKTTLCAAL MLLHLVGREA LPNSQLYSAA RSRDQAAELF
KLAVKMIRMS PRISRFVRIV ETSKRLKVPE LGTEYRALSK DAGTAQGLSP CLVIHDELGQ
VRGPVDPLYE ALELATAAQA NPLTLVISTQ APTDNDLLSQ LIDDAATGAD PTKVLKLYSC
PMNIDPFSEE ALAVSHPAWN SFVNRKELKQ MQAEAARMPA RAADFRNYTL NQRIEVNAPF
ISKDVWDEGK DNPEEWHGKD VWLGLDLSET RDLTSLTLAH KDENGLLHVH PFFWLPDEGI
EDKSRSDKVP YDIWAKGGLI HLSQGRTIQY KDVAAKLKEI ADNANVQKVA FDRYKIKYFK
RDMIDCGFDE RWIDEHMVSY GQGFVSMGIG INELERLILD GKIRHGNNPV MNMCMANVKV
VSDTSNNRKF IKHTSTRRID GAVTLAMLAG MLADPDNKPK PKRKALFA