Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2995 |
Symbol | |
ID | 3720321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 1691779 |
End bp | 1692888 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640071189 |
Product | phage major capsid protein, gp36 |
Protein accession | YP_353062 |
Protein GI | 77463558 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.100977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTATTG AACTTAAAAA TGCCATTGAA GCTTCGAACC AGTTGATTCA AGCTATCCGT TCGGAAGTCG AAGGCGTTAA ATCTGCCGAC GCTCTGTTTG AAGGAAAAAT GGCTCGCATG GAAGCCGAAC TTGCGGCGTC TCTGTCGGCC AAGTCGGCTC TTGAAGCTCG CCTGAACGCT CTGGAAACCG CCGCCGCGCG TCCTGTGTCG GGTAAAGCTG CCGAAGCTGC CGATGAATGC AAGTCCGCGT TCCGCAACTG GCTTGCTAAC CCGGAAAGCT TCGAAGCAAA ACAAGCGTTC GAACAGAAGG CTCTTGCCAC CACTGGCCTA GCGAACGTCA TTCCGCGCAC TGTTTCGGAT GAAGTTATCG CTGCTGCTCG GGGTTATTCG GCTCTTGCTG GTCTTGCTAA GTATGTCGTT ACCGGCACTT CGGAATTTGG CATCATGGTT TCGGGTGGTT CGGCTGTTAC TCGCGGTGGG GAAACCACGG TTCGCGGCGA AAACACTACT TCGCTCGTTT CTAAAAAGCC GATCTGGACC GATGTGGGTT CGAACGTCGC CGTTACCAAG CATTCCGCTA TGGACCTTCA AACGGACGTG ATCTCGTTCA TCGCTGATCA GTTTGCGGAA GACTTCGCTG CCGACATGGC CGATGGGTTT ATCAACGGCA CTGGCCTAAA TGACGACCCG CAAGGCATCC TTACCGCTGG CATCGAACTG AGCGGCGCTT CGGTTACGCC GGAACTGATC ATCGATCTTG CTTACAAAGT GAAGACGGTT GATCGCAACA ACGGCGCTTA CCTGATGGCT GGCACGACTG CTGCGGCTCT TTCGAAAGCC AAGGCTAACT CGCAATTCGT TCTCGAAATC AAAGAGGGCG TTACGATGAT CAACGGTCGT CCTGTGCATG TTGACGATTA CCTCCCGGAA GAGACGCCGG TTGTGTTCGG TAACTACAAG CGCGCCTTCC TGACTGCGGT TCGCGCCGAA GGTGTAACCG TCCAGATCAA CCCGTATAAG CAATCGAACG TGATCTTCGT TGAAGGCAAT CTTCGTTACG GTTCTGTTGT GCTTAACGGT GACGCTTACG CGAAGCTCGT TATCGCCTAA
|
Protein sequence | MTIELKNAIE ASNQLIQAIR SEVEGVKSAD ALFEGKMARM EAELAASLSA KSALEARLNA LETAAARPVS GKAAEAADEC KSAFRNWLAN PESFEAKQAF EQKALATTGL ANVIPRTVSD EVIAAARGYS ALAGLAKYVV TGTSEFGIMV SGGSAVTRGG ETTVRGENTT SLVSKKPIWT DVGSNVAVTK HSAMDLQTDV ISFIADQFAE DFAADMADGF INGTGLNDDP QGILTAGIEL SGASVTPELI IDLAYKVKTV DRNNGAYLMA GTTAAALSKA KANSQFVLEI KEGVTMINGR PVHVDDYLPE ETPVVFGNYK RAFLTAVRAE GVTVQINPYK QSNVIFVEGN LRYGSVVLNG DAYAKLVIA
|
| |