Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3781 |
Symbol | |
ID | 3721541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | - |
Start bp | 907304 |
End bp | 909466 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640073452 |
Product | putative phage terminase large subunit |
Protein accession | YP_355289 |
Protein GI | 77465786 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAGA TGCTCGACCG CGGCATCGGG CGGCTCACCC GCATTCCGCC CCTGCCGCCC TTCACCGCCC CCGAGGAGAT CCTGGCAGAC GCGCTGCCGC TCCTCGATCC GCCGAGCCGG GTCACGGTGA CCGAGGCGGC CGAGCGGCAC ATGCGCGTGC CGGTGCAGGG CAACTGGGTG CCGTTCGACC GGGCGGTGAC GCCCTATACC GTCGAGCCCG CCGACATGAC CCAGTCGCGC CGCTTCAAGG CCGTGGTCTT CCTCGGGCCG TCGCAGAGCG GCAAGAGCCA GATGATGCAG TCGGTCTCGG CCCATGCCGT CACCTGCGCG CCGGGCCCGG TGCAGGTCAT CCACATGACC AAGACCGATG CCGACGCCTG GGTCGAGGAG AAGCTCGACC CCACGATCCT GAACAGCCCG GCGCTGCGCG AGCGCCTGGG CACCGGGCGC GACGACAGCA CCTTCAGCCG CAAGCGCTTC AAGGGCATGC GGCTCACCAT CGGCTATCCG GTGCCGAACC AGCTCTCGAG CCGGTCTCAG CGCCTCGTGA TGCTCACCGA TTACGATCAC ATGCCCCAGA AGCTCGGGCC GAAGGACAGC CCGGAGGGCT CGCCCTTCGG CATGGCGCTG CAGCGGATCC GCACCTTCAT GAGCCGGGGC TGCGTCCTGG CCGAGACCTC GCCCGCCTTC CCGGTGGACC CGAATGCGGA CTGGGCGCCG CATGCCGGCC ATCCGCACAT GCTGCCGCCG GCCACGGCCG GGCTCGTGCC GATCTACAAC GAGGGCACGC GCGGGCGCTG GTACTGGGAA TGCCCGGACT GCGGCGATCT CTTCGAGCCG CGCTTCGACC GGCTGCATTA CGATGCGGAT CTCGATCCGG GCGCGGCGGG CGAGCAGGCG ATGATGGAAT GCCCGCACTG CGGAACGCTC ATCGCCCACC GTCACAAGGT CGGCCTCAAC CGCGCCGCGC TCGAGGGTCG CGGTGGCTGG CTGCACGAGG GCCGCCACAT CGAGGCGAAC GGGCGCCGGG CGCTGGTCCG GATCGACGAT CCCGACATCC GACGCACGCC CATCGCGAGC TACAGTCTGA ACGGGGCCGC CGCGGCCTTC GCCTCGTGGG AGGAGCTGGT CCAGCGCTAC GAGACCGAGC GGCGGCGGTT CGAAGCCTTG GGCGACGACA CCGACTTCGC CCGGGTGCAT TACACCGACA TCGGCGTGCC TTACCGGCGC CCCGAGGCCG AAGAGGAGGG CGCCCTCACG GCGGCGCAGA TCCGCGAGCA CATGCGCAGT CAGGAACGGC GCGTGGCCCC GGCCTGGACG CGCTTCGTCA CGGTCTCGAT CGACGTGCAG GGCAACCGCT TCGAGGTGCT GGTCATGGCC TGGGGCGCGC AGGGCGAGCG GATGCCGATC GACCGGTTCG CGGTGGCGCA GCCTCCCGAC CATGCCCCGC GCGCGAAGGG TGACGACGAG CGATACCGGG CGCTCGACCC CGGCCGCTAT GTCGAGGATG CCGATGCGCT CCTCGATCTG CCCGAGCGTC TCTATCCGGT GGAGGGGGCG AGCTGGAGCC TGAAGCCCTG CGCGCTGGTG ATCGACTTCA ACGGCCCGGC CGGCTGGTCG GACAATGCCG AGAAGTTCTG GCGCGCGCGC CGGCGCAACG GTCAGGGCGG GCTCTGGTGG CTCTCGATCG GCCGCGGGGG CTTTCAGCAG CGCGACCGGG TCTGGCACGA GGCGCCCGAG CGGGGCTCGA AGGGCAGGCG CGCGCGCGGC ATCAAGCTGC TGAACATGGC GACCGACCGG ATGAAGGAGA GCGTCCTCGC GGCCGTCGGC CGGTTCGAGG GCGGTCAGGG CGCCCAGCAT GTGCCCTCCT GGCTCGAGGC GGAGCATCTC GACGAGCTCC TCGCCGAGCG CCGGGGCGCC AAGGGCTACG AGAAGCGCCA GGGCGCTGTC CGCAACGAGA CGCTCGATCT CTCGGTGCAG GCGCTGGCCG TAGCGGAGTT CAAGGGGCTG AACCGGATCG ACTGGCAGGC GCCGCCCGCC TGGGCCGAGG CGGGGCCCGC CAACCCGTTC GCCGTGGCCG TGTCCGCAGC TGCGGCAGAG GCCGCACCGG CCCCGCGCCG GCGCGCGCGG ACCTCGCGCT CGCGATACAT GGAGGGATCA TGA
|
Protein sequence | MVEMLDRGIG RLTRIPPLPP FTAPEEILAD ALPLLDPPSR VTVTEAAERH MRVPVQGNWV PFDRAVTPYT VEPADMTQSR RFKAVVFLGP SQSGKSQMMQ SVSAHAVTCA PGPVQVIHMT KTDADAWVEE KLDPTILNSP ALRERLGTGR DDSTFSRKRF KGMRLTIGYP VPNQLSSRSQ RLVMLTDYDH MPQKLGPKDS PEGSPFGMAL QRIRTFMSRG CVLAETSPAF PVDPNADWAP HAGHPHMLPP ATAGLVPIYN EGTRGRWYWE CPDCGDLFEP RFDRLHYDAD LDPGAAGEQA MMECPHCGTL IAHRHKVGLN RAALEGRGGW LHEGRHIEAN GRRALVRIDD PDIRRTPIAS YSLNGAAAAF ASWEELVQRY ETERRRFEAL GDDTDFARVH YTDIGVPYRR PEAEEEGALT AAQIREHMRS QERRVAPAWT RFVTVSIDVQ GNRFEVLVMA WGAQGERMPI DRFAVAQPPD HAPRAKGDDE RYRALDPGRY VEDADALLDL PERLYPVEGA SWSLKPCALV IDFNGPAGWS DNAEKFWRAR RRNGQGGLWW LSIGRGGFQQ RDRVWHEAPE RGSKGRRARG IKLLNMATDR MKESVLAAVG RFEGGQGAQH VPSWLEAEHL DELLAERRGA KGYEKRQGAV RNETLDLSVQ ALAVAEFKGL NRIDWQAPPA WAEAGPANPF AVAVSAAAAE AAPAPRRRAR TSRSRYMEGS
|
| |