Gene RSP_2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2358 
Symbol 
ID3719895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp984222 
End bp985484 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID640070537 
Productphage phi-C31 gp36-like protein /HK97 family major capsid protein 
Protein accessionYP_352418 
Protein GI77462914 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCG CAGGCGCCGT CTCGGGCCTC TCCATCGGCT TCGAGACCGA GGCCGCCAAG 
CCCCGCGCCC GTGGCCGCTC CATCTCGAAG CTGAGGCTTC TCGAAGTCTC CGTTGTCGCC
GTCCCGTGTC ATCCGGGCGC GCAGATCCAT TCCATCAAGG CCGCAGATGA CACGGCAGAA
CCATGCACCG AAGGAAAGAC CCCCGTGGAG AACGAAGACC AGACCACCCC GGCCAACGCG
CCGGAGATCG ACACCAAGGC GTTCGACGCG CTGAAGCAGC GCCTCGACCA GCTCGAAGCA
AAGGCCAACC GCCCCGGCGT CACGACGACC GGCCCGGCCC CGAGCGCCGA AGCGAAGGCC
TTCGGCGGCT ATGTCCGGCG CGGCGTGGAG CGGATGGACC CCGCTGACAC CAAGTCGCTG
ACCGTCTCGA CCGCCGCGAA CGGCGGCTAC CTCGCGCCGA AGGAGTTCGG CGACGAGCTG
TTCAAGAACC TGATCGAGTT CAGCCCGATC CGCAAGTATG CCCGCGTCGT CCAGATCAGC
GCGCCCGAGA TCACCTATCC CAAGCGCGTC ACCGGCACCT CGGCGACCTG GGTCTCGGAA
GTCGGCGACC GCACCGGATC GGAACCGAGC TTCGATCAGG TCACGCTGAC CCCGCACGAG
CTGGCGACCT TCACCGACAT CTCGAACGCA CTTCTGGAAG ACAACGCCTA CAATCTCGAA
GGCGAGCTGA TGGCCGACTT CGCCGAGAGC TTCGGGCGCG CCGAGAGCGC GGCCTTCGTC
AACGGCGACG GTGTGGGCAA GCCGAAGGGT ATCATGGCGG CGGCGGGCAT CGCGACCCTG
AGCGGCGGTG CGGGCACGAT CACCGTTGCA TCGCTGATCG AAGCCTATCA CGCGATCCCT
ACCGTCTATG CACAGAATGC TGTCTGGGTG ATGAACCGCA CCACGCTGGC CAAGCTGCGC
ACCTACTTCA ACGGCATGGG CGAGCCGCTT CTCCTGGACA GCATCTCGGA GAAGGCCCCG
ACCACGCTTC TCGGCCGCCC CGTGGTCGAA GCGCCGGATA TGCCGAACAT GACGGCGGGC
GCCACCCCGA TCCTGTTCGG CGATCTGTCC GGCTACCGCA TCGTGGATCG CGTGGGCCTC
GCGATCATGC GCGACCCGTT CAGCCTCGCG ACCAAGGGGC AGGTCCGCTT CCACGCCCGC
AAGCGTGTGG GTGCCGACCT GACGCACCCC GACCGCTTCG TGAAGCTGAA GGTCGCGGCC
TGA
 
Protein sequence
MIRAGAVSGL SIGFETEAAK PRARGRSISK LRLLEVSVVA VPCHPGAQIH SIKAADDTAE 
PCTEGKTPVE NEDQTTPANA PEIDTKAFDA LKQRLDQLEA KANRPGVTTT GPAPSAEAKA
FGGYVRRGVE RMDPADTKSL TVSTAANGGY LAPKEFGDEL FKNLIEFSPI RKYARVVQIS
APEITYPKRV TGTSATWVSE VGDRTGSEPS FDQVTLTPHE LATFTDISNA LLEDNAYNLE
GELMADFAES FGRAESAAFV NGDGVGKPKG IMAAAGIATL SGGAGTITVA SLIEAYHAIP
TVYAQNAVWV MNRTTLAKLR TYFNGMGEPL LLDSISEKAP TTLLGRPVVE APDMPNMTAG
ATPILFGDLS GYRIVDRVGL AIMRDPFSLA TKGQVRFHAR KRVGADLTHP DRFVKLKVAA