Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2471 |
Symbol | |
ID | 3720086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1110244 |
End bp | 1111440 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640070650 |
Product | phage phi-C31 gp36 major capsid-like protein |
Protein accession | YP_352531 |
Protein GI | 77463027 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGA CCTGGGCTCG GGCCGGGACA GGCATGTCCG CAGGCCCCGA TCCGGCCGTG GAGGCGAAAG CCGCAATGGC CGGTTTCCTG AAGGAGATCA ATCGCTTTCA GGAGGAGGTG AAGAATGTGC TGCAACAACA GGAAGAGCGT TTGACCATGC TGGACCGCAA AACCATGATC TACGGGCGCC CGGCGCTGGC GGCCGCGGCC GACCAGGAGG CGCCGCATCG CAAGGCGTTC GGGGCCTATC TCCGCTCGGG CGACGACGAC GGTCTGCGCG GCCTCGTCCT CGAGGGCAAG GCGATGACGG CGAGCGTCGC CTCGGACGGC GGCTATCTGG TCGATCCGCA GACCTCGGAC GCCATCCGCT CGATGTTGCT GTCCACGGCC TCGATCCGTC AGATCGCCGG TGTGGTCCAT GTGGAAGCCA CGAGCTTCGA CGTGCTGATC GACCGCACTG AGGTGGGGTC GGGCTGGGCC ACGGAGGCCG CCACGATCAG CGAAAGCGCC TCGCCCACCA TCGAGCGGAT CTCGATCAAG CTGCACGAAC TGTCGGCGAT GCCGAAGGCG AGCCAGCGGC TTCTGGACGA CTCGGCCTTC GACGTCGAGA GCTGGCTGGC GGGCAAGATC GCGACGCGCT TCATGCGGGC CGAGAGCGCG GCCTTCGTCA GCGGCGACGG GATCGACAAG CCGCGGGGCT TTCTGGCGCC GGCGAAGGTT GCGAACGCGA GCTGGAGCTG GGGCTCGATC GGCTATGTCC CCTCGGGTGC GGCGAGCGAT TTCCTCGCCA CGAACCCGGC CGATTGTATC ATCACCCTGA TCTATTCGCT CGGCGCCGAT TACCGCGCGA ATGCGACCTT CGTGATGAAT TCGAAGACCG CGGGCGCGGT GCGGAAGATG AAGGACTCGG ACGGCCGCTT CCTGTGGTCG GACGGGCTGG CGGCGGCGGA GCCTGCGCGG CTGATGGGAT ATCCGGTTCT GCTGTGCGAG GACATGCCGG ACATTGCCGC GGGCGCCTTT GCCATCGCTT TCGGGGATTT CGCCGCCGGC TACACGATCG CCGAGCGGCC CGAGGTGCGG GTCCTGCGCG ATCCGTTCTC GGCCAAGCCC CATGTCCTCT TCTATGCGAC GAAGCGCGTG GGAGGCGATG TCAGCGACTA TGCGGCGATC AAGCTCCTGA AGATCGCGGT GTCCTGA
|
Protein sequence | MTETWARAGT GMSAGPDPAV EAKAAMAGFL KEINRFQEEV KNVLQQQEER LTMLDRKTMI YGRPALAAAA DQEAPHRKAF GAYLRSGDDD GLRGLVLEGK AMTASVASDG GYLVDPQTSD AIRSMLLSTA SIRQIAGVVH VEATSFDVLI DRTEVGSGWA TEAATISESA SPTIERISIK LHELSAMPKA SQRLLDDSAF DVESWLAGKI ATRFMRAESA AFVSGDGIDK PRGFLAPAKV ANASWSWGSI GYVPSGAASD FLATNPADCI ITLIYSLGAD YRANATFVMN SKTAGAVRKM KDSDGRFLWS DGLAAAEPAR LMGYPVLLCE DMPDIAAGAF AIAFGDFAAG YTIAERPEVR VLRDPFSAKP HVLFYATKRV GGDVSDYAAI KLLKIAVS
|
| |