Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2358 |
Symbol | |
ID | 3719895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 984222 |
End bp | 985484 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640070537 |
Product | phage phi-C31 gp36-like protein /HK97 family major capsid protein |
Protein accession | YP_352418 |
Protein GI | 77462914 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.172606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGCG CAGGCGCCGT CTCGGGCCTC TCCATCGGCT TCGAGACCGA GGCCGCCAAG CCCCGCGCCC GTGGCCGCTC CATCTCGAAG CTGAGGCTTC TCGAAGTCTC CGTTGTCGCC GTCCCGTGTC ATCCGGGCGC GCAGATCCAT TCCATCAAGG CCGCAGATGA CACGGCAGAA CCATGCACCG AAGGAAAGAC CCCCGTGGAG AACGAAGACC AGACCACCCC GGCCAACGCG CCGGAGATCG ACACCAAGGC GTTCGACGCG CTGAAGCAGC GCCTCGACCA GCTCGAAGCA AAGGCCAACC GCCCCGGCGT CACGACGACC GGCCCGGCCC CGAGCGCCGA AGCGAAGGCC TTCGGCGGCT ATGTCCGGCG CGGCGTGGAG CGGATGGACC CCGCTGACAC CAAGTCGCTG ACCGTCTCGA CCGCCGCGAA CGGCGGCTAC CTCGCGCCGA AGGAGTTCGG CGACGAGCTG TTCAAGAACC TGATCGAGTT CAGCCCGATC CGCAAGTATG CCCGCGTCGT CCAGATCAGC GCGCCCGAGA TCACCTATCC CAAGCGCGTC ACCGGCACCT CGGCGACCTG GGTCTCGGAA GTCGGCGACC GCACCGGATC GGAACCGAGC TTCGATCAGG TCACGCTGAC CCCGCACGAG CTGGCGACCT TCACCGACAT CTCGAACGCA CTTCTGGAAG ACAACGCCTA CAATCTCGAA GGCGAGCTGA TGGCCGACTT CGCCGAGAGC TTCGGGCGCG CCGAGAGCGC GGCCTTCGTC AACGGCGACG GTGTGGGCAA GCCGAAGGGT ATCATGGCGG CGGCGGGCAT CGCGACCCTG AGCGGCGGTG CGGGCACGAT CACCGTTGCA TCGCTGATCG AAGCCTATCA CGCGATCCCT ACCGTCTATG CACAGAATGC TGTCTGGGTG ATGAACCGCA CCACGCTGGC CAAGCTGCGC ACCTACTTCA ACGGCATGGG CGAGCCGCTT CTCCTGGACA GCATCTCGGA GAAGGCCCCG ACCACGCTTC TCGGCCGCCC CGTGGTCGAA GCGCCGGATA TGCCGAACAT GACGGCGGGC GCCACCCCGA TCCTGTTCGG CGATCTGTCC GGCTACCGCA TCGTGGATCG CGTGGGCCTC GCGATCATGC GCGACCCGTT CAGCCTCGCG ACCAAGGGGC AGGTCCGCTT CCACGCCCGC AAGCGTGTGG GTGCCGACCT GACGCACCCC GACCGCTTCG TGAAGCTGAA GGTCGCGGCC TGA
|
Protein sequence | MIRAGAVSGL SIGFETEAAK PRARGRSISK LRLLEVSVVA VPCHPGAQIH SIKAADDTAE PCTEGKTPVE NEDQTTPANA PEIDTKAFDA LKQRLDQLEA KANRPGVTTT GPAPSAEAKA FGGYVRRGVE RMDPADTKSL TVSTAANGGY LAPKEFGDEL FKNLIEFSPI RKYARVVQIS APEITYPKRV TGTSATWVSE VGDRTGSEPS FDQVTLTPHE LATFTDISNA LLEDNAYNLE GELMADFAES FGRAESAAFV NGDGVGKPKG IMAAAGIATL SGGAGTITVA SLIEAYHAIP TVYAQNAVWV MNRTTLAKLR TYFNGMGEPL LLDSISEKAP TTLLGRPVVE APDMPNMTAG ATPILFGDLS GYRIVDRVGL AIMRDPFSLA TKGQVRFHAR KRVGADLTHP DRFVKLKVAA
|
| |