Gene RSP_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2471 
Symbol 
ID3720086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1110244 
End bp1111440 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content66% 
IMG OID640070650 
Productphage phi-C31 gp36 major capsid-like protein 
Protein accessionYP_352531 
Protein GI77463027 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGA CCTGGGCTCG GGCCGGGACA GGCATGTCCG CAGGCCCCGA TCCGGCCGTG 
GAGGCGAAAG CCGCAATGGC CGGTTTCCTG AAGGAGATCA ATCGCTTTCA GGAGGAGGTG
AAGAATGTGC TGCAACAACA GGAAGAGCGT TTGACCATGC TGGACCGCAA AACCATGATC
TACGGGCGCC CGGCGCTGGC GGCCGCGGCC GACCAGGAGG CGCCGCATCG CAAGGCGTTC
GGGGCCTATC TCCGCTCGGG CGACGACGAC GGTCTGCGCG GCCTCGTCCT CGAGGGCAAG
GCGATGACGG CGAGCGTCGC CTCGGACGGC GGCTATCTGG TCGATCCGCA GACCTCGGAC
GCCATCCGCT CGATGTTGCT GTCCACGGCC TCGATCCGTC AGATCGCCGG TGTGGTCCAT
GTGGAAGCCA CGAGCTTCGA CGTGCTGATC GACCGCACTG AGGTGGGGTC GGGCTGGGCC
ACGGAGGCCG CCACGATCAG CGAAAGCGCC TCGCCCACCA TCGAGCGGAT CTCGATCAAG
CTGCACGAAC TGTCGGCGAT GCCGAAGGCG AGCCAGCGGC TTCTGGACGA CTCGGCCTTC
GACGTCGAGA GCTGGCTGGC GGGCAAGATC GCGACGCGCT TCATGCGGGC CGAGAGCGCG
GCCTTCGTCA GCGGCGACGG GATCGACAAG CCGCGGGGCT TTCTGGCGCC GGCGAAGGTT
GCGAACGCGA GCTGGAGCTG GGGCTCGATC GGCTATGTCC CCTCGGGTGC GGCGAGCGAT
TTCCTCGCCA CGAACCCGGC CGATTGTATC ATCACCCTGA TCTATTCGCT CGGCGCCGAT
TACCGCGCGA ATGCGACCTT CGTGATGAAT TCGAAGACCG CGGGCGCGGT GCGGAAGATG
AAGGACTCGG ACGGCCGCTT CCTGTGGTCG GACGGGCTGG CGGCGGCGGA GCCTGCGCGG
CTGATGGGAT ATCCGGTTCT GCTGTGCGAG GACATGCCGG ACATTGCCGC GGGCGCCTTT
GCCATCGCTT TCGGGGATTT CGCCGCCGGC TACACGATCG CCGAGCGGCC CGAGGTGCGG
GTCCTGCGCG ATCCGTTCTC GGCCAAGCCC CATGTCCTCT TCTATGCGAC GAAGCGCGTG
GGAGGCGATG TCAGCGACTA TGCGGCGATC AAGCTCCTGA AGATCGCGGT GTCCTGA
 
Protein sequence
MTETWARAGT GMSAGPDPAV EAKAAMAGFL KEINRFQEEV KNVLQQQEER LTMLDRKTMI 
YGRPALAAAA DQEAPHRKAF GAYLRSGDDD GLRGLVLEGK AMTASVASDG GYLVDPQTSD
AIRSMLLSTA SIRQIAGVVH VEATSFDVLI DRTEVGSGWA TEAATISESA SPTIERISIK
LHELSAMPKA SQRLLDDSAF DVESWLAGKI ATRFMRAESA AFVSGDGIDK PRGFLAPAKV
ANASWSWGSI GYVPSGAASD FLATNPADCI ITLIYSLGAD YRANATFVMN SKTAGAVRKM
KDSDGRFLWS DGLAAAEPAR LMGYPVLLCE DMPDIAAGAF AIAFGDFAAG YTIAERPEVR
VLRDPFSAKP HVLFYATKRV GGDVSDYAAI KLLKIAVS