Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1670 |
Symbol | |
ID | 4075773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1769678 |
End bp | 1770901 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006983 |
Product | Phage portal protein, HK97 |
Protein accession | YP_613665 |
Protein GI | 99081511 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTGC GAGATCGCTT GAAATCCATG ATCATTCGCC GCTTTGGGCT GACTGATGCT CACCAGATGG GACTTCATCG TGCCAGTGAT GCGGGGGAGA TCGTCACCGG GCATAGCGCG CTCGGAATTT CGACGGTTTG GGCCTGCACC AACTTGATTG CGGGAACAAT CGGCTCGCTT CCGTTGATGG TGTATCGCCG GAACGGCCAA ACGCGCACCG TTGACCGCTC GCACGTCGTT TACCGACTGC TGCATGACAG CCCGAACTAT GATCAGACAG CGCTGGACTT CTGGGAATTT ATGGCGGCGT CTCTGGAGCT GTGGGGCAAC GCCTACGCGC ATATCTTGCG GGAAAATGGC AAGATCGTCG GCCTAGTGCC TGTCGCGCCA GATCTGATGA GGGTTCGCCG TCTGCCGACC GGTGAAATCG AATATCGTTG GTCGGAAGAT GGCAAAACCC ATCGGGAGCT TGACGGCGCT GTGCTGCACG TTCGCGGTTT TGGAGGTTCG CCGCTTGGCG GCATGTCTAC CCTACAATTC GCTCGCAATG CTTTCGGCTT GGCACGTGCG GTTGACCGAG CAGCGGGTGA GACGTTCAAG AACGGGATGC GCCCCTCTGG CGCGTTGAAA TTTGACAATT GGCTGACTGA TGAGCAGCGG GCCCGGGCCA AATCCACTTT GGTTGACGAC ATGGTGGGGG CGCAGAACTC CGGGCGCCCA ATTGTTCTGG AGGGTGGCAC CAATTGGGTG CCATTCACGA TCAACCCCGA TGATGCGCAA ATGCTGGAAA GCCGACGCTT CTCGGTGGAG GAAATCTGCC GGTTTTTCGG CGTGCCGCCG CATATGGTTG GGCACACAGA GAAAAGCACG AGTTGGGGAA CTGGCTTGGA GCAGCAAACC CTTGCGTTTC AGAAGTTCAC CCTTCGCCGC CGCATCAAGC GGATTGAGCA GGCGCTGATG AAACAGCTCC TGACCCCTGC TGAACGGGCG CGCGGGCTGA TGATCGAATT CAACCTGGAA GGGCTGCTTC GCGGAGACAG CAAGTCGCGC GCCGATTTCT ACCAGTCTGG CCTGCAGAAC GGCTGGCTGA CCATCAATGA GGTGCGCGCG CTGGAGAACA AGCCCCCGGT GGCGGGCGGC GAGGTGCCGC GAATGCAGAT GCAGAACGTG CCGATCACCG AGGTAGGCAA GCAATTGGAG GCTGGAAATG ACGATGATGC ATAA
|
Protein sequence | MGLRDRLKSM IIRRFGLTDA HQMGLHRASD AGEIVTGHSA LGISTVWACT NLIAGTIGSL PLMVYRRNGQ TRTVDRSHVV YRLLHDSPNY DQTALDFWEF MAASLELWGN AYAHILRENG KIVGLVPVAP DLMRVRRLPT GEIEYRWSED GKTHRELDGA VLHVRGFGGS PLGGMSTLQF ARNAFGLARA VDRAAGETFK NGMRPSGALK FDNWLTDEQR ARAKSTLVDD MVGAQNSGRP IVLEGGTNWV PFTINPDDAQ MLESRRFSVE EICRFFGVPP HMVGHTEKST SWGTGLEQQT LAFQKFTLRR RIKRIEQALM KQLLTPAERA RGLMIEFNLE GLLRGDSKSR ADFYQSGLQN GWLTINEVRA LENKPPVAGG EVPRMQMQNV PITEVGKQLE AGNDDDA
|
| |