Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1058 |
Symbol | |
ID | 4077198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1137683 |
End bp | 1138888 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638006362 |
Product | Phage portal protein, HK97 |
Protein accession | YP_613053 |
Protein GI | 99080899 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0489201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTTTG ACCTGCTGCG ACGCAAGACG GGGCGAGTGG ATCAGACGGC GGCTGCGGCC CCGTCGCTCA AGGCCAGCGC GGCTGCGCGG GTGCTGCCCA TGGGCAATGG CACACGCGCT GCCTGGGGGC CGCGCGATAC GGTCTCTTTG ACCCGGGCCG GGTTTATGGG CAACCCGGTT GGACATCGCG CGGTGAAGCT CATCGCCGAG GCCGCAGCCG CGCTGCCCCT GGTGCTGCAA AGCACGGAGG CCCGCTACGA GAGCCATCCG CTGCTGGCGC TGCTCTCCCG ACCCAACGCC ACCCAGGCGC GGGCCGAGAT GCTGGAGGCG CTCTATGCCA ACCTTTTGTT GTCAGGCAAC GCCTATCTGG AAGCGGTCGC GGCAGAGGAG GGCTGGCCCG TCGAGCTGCA TGTGCTGCGC CCGGATCGCA TGCGGGTGGT GCCGGGGGAA GATGGCTGGC CGGTGGGCTA TGACTATGCG GTGGGCGGCA AGACGCATCG CTTTGCCATT GACCCCGCGC GCCCGGCGAT CTGCCACCTC AAGAGCTTTC ACCCGCTGGA TGATCACTAC GGGCTCGCGC CGCTCCAGGC GGCTGCCACC GCGGTCGAGG TCCATGGCGC CGCCGCGCGC TGGTCAAAAT CCCTGCTGGA CAATGCGGCG CAACCCTCCG GCGCGCTGGT CTGGACCGGC TCGGATGGGC TGGGGCAGAT GGGCGACGAC CAGTTCCGCC GCCTCACCGA AGAGATCGAG GCCAATTTCC AGGGTGCGCG CAATGCCGGG CGGCCCATGG TGCTGGAGGG CGGGCTCGAC TGGAAACAGA TGGGCTTCAG CCCCTCAGAT ATGGAGTTTC ACCGCACCAA GGACAGCGCT GCGCGCGAGA TCGCGCAGGC CTTTGGGGTG CCGCCGATGC TCTTGGGCAT TCCGGGCGAT GCCACCTATG CCAACTATCA GGAGGCCAAC CGGGCGTTTT ATCGCCTGAC GGTGCTGCCC TTGGCGATGC GGGTGGCGGC GAAACTCTCC GATTGGCTGA TGCGCTTTGG CACCGAGGCG GTCGAGCTCA AACCCGATCT TGACCAAGTG CAGGCCCTCA GCAGCGAGCG CGAGGCCCAG TGGCGGCGCA TCACTGCGGC TGATTTCCTG AGCGAGGCCG AGAAACGCCA GATGCTTGGG CTGCCCCCAC GCACCGCGGA GGTGGCGGAT GACTGA
|
Protein sequence | MVFDLLRRKT GRVDQTAAAA PSLKASAAAR VLPMGNGTRA AWGPRDTVSL TRAGFMGNPV GHRAVKLIAE AAAALPLVLQ STEARYESHP LLALLSRPNA TQARAEMLEA LYANLLLSGN AYLEAVAAEE GWPVELHVLR PDRMRVVPGE DGWPVGYDYA VGGKTHRFAI DPARPAICHL KSFHPLDDHY GLAPLQAAAT AVEVHGAAAR WSKSLLDNAA QPSGALVWTG SDGLGQMGDD QFRRLTEEIE ANFQGARNAG RPMVLEGGLD WKQMGFSPSD MEFHRTKDSA AREIAQAFGV PPMLLGIPGD ATYANYQEAN RAFYRLTVLP LAMRVAAKLS DWLMRFGTEA VELKPDLDQV QALSSEREAQ WRRITAADFL SEAEKRQMLG LPPRTAEVAD D
|
| |