Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1624 |
Symbol | |
ID | 4077726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1731086 |
End bp | 1732372 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638006937 |
Product | Phage portal protein, HK97 |
Protein accession | YP_613619 |
Protein GI | 99081465 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0210145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTT TTGATTTCCT TCGGCCAGCG GCGACGCTGG CTGAACCAGC CGTGCGCGCT GAACCACCGG TGTCTGCAGC TTCAGACACC TCGGTTTCGA GCGAACGTCA ATGGAAGGGT TTTGTCGTTG CGGGCGGTCG ATCAAAGGCC GGCGTGCCGG TGAGCGAAAC CACCGCCTTG ACCATTCCAG CCACCTTGCA GGCGCTTCGG ATCTTGACCG GCGTCTTTGC GATGACGCCG CTGCATTTTT ACCAGAAGAC CGACAGGGGG CGGATGTCGG CCGAAGGCAA CCCGGCGGCG CAGTTGTTTC GGCTGGGTCC AAACAGTCAC CAGACTGCCT ATGCGTTCTT TGAGCTCTTG TTGGCGGACA TCCTGTTGAC CGGGAATTTC TACGGCTATG TCAGCCGGGA TTTTCGGGGC GAGGTGAAGG CGGTGACCCG GCTGAAGCCC GGCCAGTGTC AGCCGGTCGA ATATTTCGAT CGCGCCGAAG GATCGATCCT TTTTTTCGAT GCCACCTTGC CGGATGGAAG CCATGAGCGT TTCCCGGCGC GTGATATCTT TCACGTTGGC GGGTTCTCTC GGGATGGGGT TCAGGGTCTC AACCCCGTGC AATACGCACG TGATGCCTTG GGCGGTGCAA TTGCAACAGC GGATCACGCG GCTCGGTTCT GGAACAAGGG CGGGCGCCCT TCGACGGTGC TGACCAGCGA GAAAGCCATT GGGCCTACCG ATAAGGATCG GATCAAGACC GACTGGACCC AAATGTATTC GGGTCCAGAC GCCGATATGG TTGCTGTCCT CGATCAAGAT CTAAAGGCCG AGTTCTTGGC TCATGATTTG AAATCCAGCC AATATCTGGA AACGCGACAG TTTCAGGTGG TCGATCTCGC CCGAATCTGG GGTGTGCCGC CGCACCTGAT CTTTGATTTG TCGCGCGCCA CCTTCGGTAA CATCGAGCAA CAAAGCCTCG AGTTCGTCAT CTATCACTTG GGCCCCCATT ACACGCGGGT TGCCCAGGCC GCGACTAAGG CCTTTGCCAA GATCGGGTTC TACTTTGAGC ATGTCACTGC CGAGCTAGTC AAAGGCGACC TGAAGAGCAG GATGGAGGCT TATTGGCTGC AACGCCAGAT GGGCATGGTG AACGGCAACG AACTGCGCAG CTACGAGAAC CTTCCCAACA TCGAAGGCAA TGCGGGGACG GACTACTGGA TGCCGGCGAA CATGCAAGTA GCCGGAGCAT CTGCGGCGCC GGGCAGCACC GAAGACACCA GCGGAGACCA ATCATGA
|
Protein sequence | MGIFDFLRPA ATLAEPAVRA EPPVSAASDT SVSSERQWKG FVVAGGRSKA GVPVSETTAL TIPATLQALR ILTGVFAMTP LHFYQKTDRG RMSAEGNPAA QLFRLGPNSH QTAYAFFELL LADILLTGNF YGYVSRDFRG EVKAVTRLKP GQCQPVEYFD RAEGSILFFD ATLPDGSHER FPARDIFHVG GFSRDGVQGL NPVQYARDAL GGAIATADHA ARFWNKGGRP STVLTSEKAI GPTDKDRIKT DWTQMYSGPD ADMVAVLDQD LKAEFLAHDL KSSQYLETRQ FQVVDLARIW GVPPHLIFDL SRATFGNIEQ QSLEFVIYHL GPHYTRVAQA ATKAFAKIGF YFEHVTAELV KGDLKSRMEA YWLQRQMGMV NGNELRSYEN LPNIEGNAGT DYWMPANMQV AGASAAPGST EDTSGDQS
|
| |