Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1761 |
Symbol | |
ID | 5083667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1795011 |
End bp | 1797776 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640483321 |
Product | phage tape measure protein |
Protein accession | YP_001167959 |
Protein GI | 146277800 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.337369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACTG ACGTCGAAAA GCTGGTCGTC CAGCTCTCCG CGGACATCAA GCAGTATCAG CGCGAGATGA ACAAGGCAGT GGGCGTTTCA AACAAGCAGG CGAGGGCTGT CGAAAATCGC TTTCGTCAGA TGAACCGGAA CCTCGACAGC ATCGGCCGCA CCGCGGCCCG GTCGCTGGTC ACGCCCCTGC TCGGCATCGG GGCTGCCCTG TCGGTGCGCG AGGTGGGGCG CTACGCGGAC GCCTGGACCA ATGCCAAGAA CAGCCTGGCC GTGGCTGGCG TGACCGGCAG GGCGCAGGCC GATGTGCTCG ACAGGCTCTA TCAGTCGGCG CAGAGCAACG CGGCTCCGGT GGGCGCGCTG GCCGATCTCT ATGGCAAGGC CTCGCAGGCC TCGGACGTTC TGGGCGCCTC GCAGGCTGAC CTGATCAAGT TCTCCGACGG CGTCGCGACG GCGCTGCGCG TTCAGGGCGG GTCGGCGGCC CAGGCGTCCG GCGCGCTGAC GCAGCTCGGC CAGCTGCTCG GGTCGGCTCG CGTGCAGGCC GAAGAGTTCA ACTCGGTGAA CGAAGGCGCC CGGCCGATCC TGATGGCGGT CGCCGCTGGT CTTGATGAGG CGGGCGGCTC CGTCTCGAAG CTCAAGACGC TGGTGAATGA CGGCAAGGTC AGCGGACAGC AGTTCTTTCA GGCTTTCCTG CGCGGTCTGC CCACCATCCA GGGGATGGCG GCGAACGCCA CCACCACGAT CGAGCAGGGC ATCACGAAGG TCGAGAACGC GTTCACGCGC TACATTGGTC AGACCGATGA GAGTCTCGGT GCCTCTCAGC GTCTAGTGCA GGGCCTCTCT GCCCTTGCAG ACGATTTCGA GAGCATCGCC GACATCACGC TCAAGGTGGC CGCGCTGATC GGGGCCGGGC TGCTTGGCCG CTCCATCGCG GGCATGGTCA AAAATCTCGG CATCGCCGGA GTGGCCGCCA CGGAGTTCGT CGCCGCGATC CGGGTGGCAT CGTCCGTCTC CGGCGTCGCG AAGGCCATCG GCGGCTTGAG CGCGGCGGCG GGGCCGATCG GGGCCGTGGT GGGCGTGGCC GCGGTGGGCG CGCTGATGCT GTACTCGGCC AGCACGGCCG ACGCGCGCGA GGCTTCGGAG GCCTTCGAGC AGCGGCTCGA CCGCATTCGC GCCGCGGCGC CTCAGATGGC CACGGCCGTC GAGGACGGCG CTCGCCGCGC GAAGGATGCG ATGGAGAGCC TGAAAAACCC GGCGGTGATC AAGCTGGAGA TGGAGAGCGC AGAGGCTGAC GCGAACCTCG ATGCCATCCG CACAAACCTA CAGCGGGCAA TGGAACAGCT CGATCACCTG GCGGGCATGA AGGTCTTCAC CGAGGCTCAG CAGGAAGCGA TCAGGGACTT TAACGACAAG GTGCTCCAAG GCACGGCCAG CGCCGAAGAA ATGGCTGAGG TGCTGAAGGA TCTCGGTAAG ATCGACGGGG TGGGCCAGAG CCTGATCGAT TCCGTCACGA GCCTCTGGCG CGCGCTCAAC CTGACCTCGG CCGCGGCGCG GCAGGTCCAG TCGGACATTC AGGCCGCGCT CGGGGCCTAT GCGCCTGACC TGGGCCGGTT CGGTCAGGTC CAGGATGGGA TACTTTCCCG GCTCTACGCC GATGAAGCCG CCAAGCGCGC GGGCCGCGCC TATCTCGACG AACAGCAGCG CCTGCAAGGC CTCACGCGGG AGCAGCTCGC GCTTGAGACG GAAATCTCGC GCCTGCGGAA GAGCATGCCC GACGGCGCGA CCGTCAGCGA CGAGGATCTT GCAGAGACGG CGCGCGGCAA CCTCGCCGCG GCCGCTCGTC GGGCTGAGGA AGGCCGGAAG GGCGGCGGTG GCGGCGGTCG CGAGGCCAAG GCCGTCGAGC GCTTCGATGA CCGGATCCTG AAGGAGACCG AGGGGCTCAA GGCCGAGACC GCCGCGCTCA ATCAGCTGGA GCTCGGACAG GACCAATACG GCACCGCAGT GGCCCGCGCC CGCAAGGAAG CGGAGATGTT GCAGGACCTG CACAACAAGG GGCTCACGAT CACGCCGGCC CTGCGCGCGC AGGTGCAGGG GCTGGCCGCG GACTGGCAGC ATGCAGCCGA GGCGAACGCC ATGGCAACCG AGCGGCATGA GGAGTTCCAG AGCAACCTGC AGGACACGAA GGCCACCATG AGCCAAGCGT TCAGCGGGCT GATCAAGGGG GCTCACGGCT TCTCCGACGC GCTGGGGATG GTGATCGACA AGCTCGCCGA CATCGCCCTG AACGCGGCCT TCAACGGCCT CTGGAACGGC ATTCTCGGGG GAGCTGCGTC GGGCCTCGTG TCCGGGCTTG GGGGCTTCTC CAAGGGGGGC TACACCGGGC CGGGCGGCAA GCACGAGCCC GCCGGCGTCG TCCACAAGGG CGAGGTGGTC TGGAGCCAGG CCGATGTGGC GAAGGCCGGC GGCGTGGCCG TGGTCGAGGC GATGCGGCGC GGCGCCAAGG GCTATGCGTC GGGCGGGGTC GTGATGCCCG AGATGCCGCG CGTCCCGGCG CCGAAGGCGC TCGCGGCCAT GGCGGCCAAG GCGCAGGTCC CGCGCGTGGT CTATGTCCCG CAGCCTTACG TCGCCCAGGT CTCGGCCGAC GATGATGGCC GGATCATCGG CACCATGCGG CGGGTCGCCC TGGCCACGTC TGCGGCCACA GCGGCCGGCC AGCAGCGGCA ACTCGGATCG TCCATCAATA GCTACAGCGC GCGGGGCACG ACCTGA
|
Protein sequence | MATDVEKLVV QLSADIKQYQ REMNKAVGVS NKQARAVENR FRQMNRNLDS IGRTAARSLV TPLLGIGAAL SVREVGRYAD AWTNAKNSLA VAGVTGRAQA DVLDRLYQSA QSNAAPVGAL ADLYGKASQA SDVLGASQAD LIKFSDGVAT ALRVQGGSAA QASGALTQLG QLLGSARVQA EEFNSVNEGA RPILMAVAAG LDEAGGSVSK LKTLVNDGKV SGQQFFQAFL RGLPTIQGMA ANATTTIEQG ITKVENAFTR YIGQTDESLG ASQRLVQGLS ALADDFESIA DITLKVAALI GAGLLGRSIA GMVKNLGIAG VAATEFVAAI RVASSVSGVA KAIGGLSAAA GPIGAVVGVA AVGALMLYSA STADAREASE AFEQRLDRIR AAAPQMATAV EDGARRAKDA MESLKNPAVI KLEMESAEAD ANLDAIRTNL QRAMEQLDHL AGMKVFTEAQ QEAIRDFNDK VLQGTASAEE MAEVLKDLGK IDGVGQSLID SVTSLWRALN LTSAAARQVQ SDIQAALGAY APDLGRFGQV QDGILSRLYA DEAAKRAGRA YLDEQQRLQG LTREQLALET EISRLRKSMP DGATVSDEDL AETARGNLAA AARRAEEGRK GGGGGGREAK AVERFDDRIL KETEGLKAET AALNQLELGQ DQYGTAVARA RKEAEMLQDL HNKGLTITPA LRAQVQGLAA DWQHAAEANA MATERHEEFQ SNLQDTKATM SQAFSGLIKG AHGFSDALGM VIDKLADIAL NAAFNGLWNG ILGGAASGLV SGLGGFSKGG YTGPGGKHEP AGVVHKGEVV WSQADVAKAG GVAVVEAMRR GAKGYASGGV VMPEMPRVPA PKALAAMAAK AQVPRVVYVP QPYVAQVSAD DDGRIIGTMR RVALATSAAT AAGQQRQLGS SINSYSARGT T
|
| |