Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2352 |
Symbol | |
ID | 3719887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 978176 |
End bp | 979708 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640070529 |
Product | putative terminase large subunit / phage terminase |
Protein accession | YP_352410 |
Protein GI | 77462906 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.769747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGTTCC TGCGCCTGCT GAAGCACCCG GCGAGCACCG CGCCGGGCCG GGCCTTCCAG CTCTACCCGT GGCAAGAGCG CATCGTCCGG GCGATCTACG GACCCCGGCA CCCGGACGGC AGCCGGATCG TGAAGACGGT CTTCCTGCTG GTGCCGAGGG GCAACCGGAA GACCAGCCTC GCCGCCGCCC TGGCCCTTCT CCATACCCTC GGCCCCGAGC GCGTGCCCGC CGGGCAGGTG ATCTTCGCCG CCTGCGACCG CGAGCAGGCG GGCATCGGCT TCCGCGAGGC CGCGAACATC ATCCGCGCCG ACAAGCGCCT CGTGGCGGCC ACCCGGATCT ATGACGCGCA CAACTCGGCA AAGAAGATCG TCTTCCGCGC CGAGGATGTG ACCCTCTCGG CGATCTCGTC GGATGGTGGC GCGCAGCATG GCCGGACCCC GGCCTTCACG TTGATTGACG AGCTGCACGC ATGGAAGGGC CGCGACCTCT GGGAAGCCCT GAAGTCCGGC ACCGCCAAGG TGGCCGACAG CCTGACCGTG ATCGCCACCA CGGCCGGGCG CGGCGCCGAC ACGGTCGCGG CGGAACAATA TGCCTATGCC TTGGCCGTCG CGCGGGGCGA GATCGTGAAC CCGGCCTTCT TGCCGATCTT GTTCGCGCCC GAGCCAGGCG ACGACTGGAC CGACGAAGCG ACGTGGCACA AGGTCAACCC CGGCCTGCGC CACGGCTTCC CGAGCCTTGA GGGCCTGCGC ACGCTGGCGA AGGAAGCCGA GGGCCGCCCG AGCGATGCCT ACGCCTTCCG GCAGTTCAAC TTGAACGAGT GGATGGCGAA CAGCCGCGAT CCGCTCTTCG ACTTCGACAC CTACGACGCC CGGCGCTTCG CCGAGGATCT GGACGAGCTT GAGGGCCTGC CCTGCTGGAT CGGCGTTGAC CTGTCCATCT CGGGCGACCT CTCGGCCGTG GTGGCGGCGT GGAAGCACCC GGACGGGCAG ATCAGTGTCA AACCGTGGTT CTGGGTGCCC GGCGACGACC TCAAGGTCCG CGCCGACCGC GACGGCGTGC CTTACGAGCG ATGGCGCGAT GAAGGGCACC TGATCGCCAC GCCCGGCGCG ATCATCGACC AGAGCGCCGT GGCCGACCAG ATCCGCGAGC TATGCGCCCG CTTCGACGCG CAGGAAGTCG CCTTCGACCC GCACCTTGCC CGGCCGATGA TGCAGAGCCT CTATGACGAG GGCCTGCCGG TGGTGGAGAT GCGACAGGCG CCGCTGACTA TGGGCGTTGC CGCTGGCGAT CTTGAGCGCG TGGTGAACGG TCGCCTGATC CGCCACGACG GGCACCCGGT CCTGCGCAAC CATCTTCAGA ACGTGTGCGC CTCACGCAGC GAAAGCGGCC TGATCCGAAT GCACAAGGTC CAGCGCACCA GCCGGATTGA CGGCGCCGTT GCCGCTGCGA TGGCGGTTTC GCGCGCCGTC GCCGCCGACA GCCGCAAGTC GGCTTACTCC GATCCCGAGG CCGAGGGCCT GTTTGTTTTC TGA
|
Protein sequence | MKFLRLLKHP ASTAPGRAFQ LYPWQERIVR AIYGPRHPDG SRIVKTVFLL VPRGNRKTSL AAALALLHTL GPERVPAGQV IFAACDREQA GIGFREAANI IRADKRLVAA TRIYDAHNSA KKIVFRAEDV TLSAISSDGG AQHGRTPAFT LIDELHAWKG RDLWEALKSG TAKVADSLTV IATTAGRGAD TVAAEQYAYA LAVARGEIVN PAFLPILFAP EPGDDWTDEA TWHKVNPGLR HGFPSLEGLR TLAKEAEGRP SDAYAFRQFN LNEWMANSRD PLFDFDTYDA RRFAEDLDEL EGLPCWIGVD LSISGDLSAV VAAWKHPDGQ ISVKPWFWVP GDDLKVRADR DGVPYERWRD EGHLIATPGA IIDQSAVADQ IRELCARFDA QEVAFDPHLA RPMMQSLYDE GLPVVEMRQA PLTMGVAAGD LERVVNGRLI RHDGHPVLRN HLQNVCASRS ESGLIRMHKV QRTSRIDGAV AAAMAVSRAV AADSRKSAYS DPEAEGLFVF
|
| |