Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3357 |
Symbol | |
ID | 4898453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 407432 |
End bp | 409594 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640113956 |
Product | phage terminase GpA |
Protein accession | YP_001045225 |
Protein GI | 126464112 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.27922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAGA TGCTCGACCG CGGCATCGGG CGGCTCACCC GCATTCCGCC CCTGCCGCCC TTCACCGCCC CCGAGGAGAT CCTGGCCGAC GCCCTGCCGC TCCTCGATCC GCCGAGCCGG GTCACGGTGA CCGAGGCGGC CCAGCGGCAC ATGCGCGTGC CGGTGCAGGG CAACTGGGTG CCCTTCGACC GGGCGGTGAC GCCCTATACC GTCGAGCCCG CGGACATGAC CCAGTCGCGC CGCTTCAAGG CCGTGGTCTT TCTCGGGCCG TCGCAGAGCG GCAAGAGCCA GATGATGCAG TCGGTCTCGG CCCATGCCGT CACCTGCGCG CCGGGCCCGG TGCAGGTCAT CCACATGACC AAGACCGATG CCGATGCGTG GGTGGAGGAG AAGCTCGACC CCACGATCCT GAACAGCCCG GCGCTCCGCG AGCGGCTGGG GACCGGGCGC GACGACAGCA CCTTCAGCCG CAAGCGCTTC AAGGGCATGC GGCTCACCAT CGGCTATCCG GTGCCGAACC AGCTCTCGAG CCGGTCGCAG CGCCTCGTGA TGCTGACCGA TTACGATCAC ATGCCCCAGA AGCTGGGGCC GAAGGACAGC CCGGAAGGTT CGCCCTTCGG CATGGCGCTG CAGCGGATCC GCACCTTCAT GAGCCGGGGC TGCGTCCTGG CGGAATCCTC GCCCGCCTTC CCGGTGGACC CGAATGCGGA CTGGGCGCCG CATGCGGGCC ATCCGCACAT GCTGCCGCCG GCCACGGCCG GGCTCGTGCC GATCTACAAC GAGGGCACGC GCGGGCGCTG GTACTGGGAA TGTCCGGACT GCGGCGATCT CTTCGAGCCG CGCTTCGACC GGCTGCATTA CGACGCGGAG CTCGATCCGG GCGCCGCCGG CGAGCAGGCG ATGATGGAAT GCCCGCACTG CGGAACGCTC ATCGCCCACC GTCACAAGGT CGGCCTCAAC CGCGCCGCGC TCGAGGGTCG TGGCGGCTGG CTGCACGAGG GCCGCCACAT CGAGGCGAAC GGGCGCCGGG CGCTGGTCCG GATCGACGAT CCCGACATCC GGCGCACGCC CATCGCGAGC TACAGTCTGA ACGGGGCCGC CGCGGCCTTC GCCTCGTGGG AAGAGCTGGT CCAGCGCTAC GAGACCGAGC GGCGGCGGTT CGAGGCGCTC GGCGACGACA CCGACTTCGC CCGGGTGCAT TACACCGACA TCGGCGTGCC CTACCGGCGC CCGGAGGCCG AAGAGGAGGG CGCCCTCACC GCGGCGCAGA TCCGTGAGCA CATGCGCGAG CAGGAGAGGC GCCTCGCCCC GGCCTGGACG CGCTTCGTCA CGGTCTCGAT CGACGTGCAG GGCAACCGCT TCGAGGTGCT GGTCATGGCC TGGGGCGCGC AGGGCGAGCG GATGCCGATC GACCGGTTCG CCGTGGCGCA GCCTCCCGAC CATGCCCCGC GCGCGAAGGG CTGTGACGAC CGCTACCGGG CGCTCGACCC CGGGCGCTAT GTCGAGGATG CCGATGCGCT CCTCGATCTG CCCGAGCGTC TCTACCCGGT GGAGGGGGCG AGCTGGAGCC TGAAGCCCTG CGCGCTGGTG ATCGACTTCA ATGGCCCTGC CGGCTGGTCG GACAATGCCG AGAAGTTCTG GCGCGCGCGC AGGCGCGACG GTCAGGGCGG GCTCTGGTGG CTCTCGATCG GCCGCGGCGG CTTCCAGCAG CGCGACCGGG TCTGGCACGA GGCGCCGGAG CGGGGCTCGA AGGGCAGGCG GGCGCGCGGC ATCAAGCTGC TGAACATGGC GACCGACCGG ATGAAGGAGA GCGTCCTCGC GGCCGTCGGC CGGTTCGAGG GCGGTCAGGG CGCCCAGCAT GTGCCCTCTT GGCTCGAGGC GGAGCATCTC GACGAGCTCC TCGCCGAGCG CCGGGGCGCC AAAGGCTACG AGAAGCGCAC GCCCGCCGCC CGCAACGAGA CGCTCGACCT CTCGGTGCAG GCGTTGGCCG TGGCGGAGTT CAAGGGGCTG AACCGGATCG ACTGGGAGGC GCCGCCCGCC TGGGCCGAGG CGGGGCCCGC CAACCCGTTC GCCGTGGCCG TGTCCGCGGC TGCGGCAGAG GCCGCACCGG CCCCGCGCCG GCGCGCGCGG ACCTCGCGCT CGCGATACAT GGAGGGATCA TGA
|
Protein sequence | MVEMLDRGIG RLTRIPPLPP FTAPEEILAD ALPLLDPPSR VTVTEAAQRH MRVPVQGNWV PFDRAVTPYT VEPADMTQSR RFKAVVFLGP SQSGKSQMMQ SVSAHAVTCA PGPVQVIHMT KTDADAWVEE KLDPTILNSP ALRERLGTGR DDSTFSRKRF KGMRLTIGYP VPNQLSSRSQ RLVMLTDYDH MPQKLGPKDS PEGSPFGMAL QRIRTFMSRG CVLAESSPAF PVDPNADWAP HAGHPHMLPP ATAGLVPIYN EGTRGRWYWE CPDCGDLFEP RFDRLHYDAE LDPGAAGEQA MMECPHCGTL IAHRHKVGLN RAALEGRGGW LHEGRHIEAN GRRALVRIDD PDIRRTPIAS YSLNGAAAAF ASWEELVQRY ETERRRFEAL GDDTDFARVH YTDIGVPYRR PEAEEEGALT AAQIREHMRE QERRLAPAWT RFVTVSIDVQ GNRFEVLVMA WGAQGERMPI DRFAVAQPPD HAPRAKGCDD RYRALDPGRY VEDADALLDL PERLYPVEGA SWSLKPCALV IDFNGPAGWS DNAEKFWRAR RRDGQGGLWW LSIGRGGFQQ RDRVWHEAPE RGSKGRRARG IKLLNMATDR MKESVLAAVG RFEGGQGAQH VPSWLEAEHL DELLAERRGA KGYEKRTPAA RNETLDLSVQ ALAVAEFKGL NRIDWEAPPA WAEAGPANPF AVAVSAAAAE AAPAPRRRAR TSRSRYMEGS
|
| |