Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1061 |
Symbol | |
ID | 4077201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1139733 |
End bp | 1140929 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006365 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_613056 |
Protein GI | 99080902 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0887541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACC ATCCCTTCAC GGGCCACGCG CCCGAGGATG CGGCGACCCC GCCGCAGAAT GTGGCCACGG AAGTGAAACA GGCTGTTTCG CAATTCGTGC AGCATTTCAA GGGGTTCCAA GACGACGTGA CCCAAAAACT CAAACAGACG GAAGAGCGTA TGACCATGTT GGATCGTAAA ACCCAAACCG CGGCGCGGCC GCATCTCGCA GCGGCCGAGG TCGATGGCGC ACCGCATCAA AAGGCGATGC AAGCCTATCT GCGCCAGGGC GACGAGGAGG GCTTTCGCGG CCTCGACCTG GAGGGCAAGG CCATGTCCAC GGCGGTGAAT TCCGATGGCG GTTTCCTGGT CGATCCGCAG ACGGCGGATG TGGTGAGATC GGTGCTGCAC TCCACTGCGT CGATCCGTGC GGTGGCCTCG GTGGTCAATG TGGAGGCGAC CTCCTTTGAC GTGCTGATCG ACCATTCCGA CGTGGGCGCG GGCTGGGCCA CCGAGACCGG CTCGGTTACA GAGACCGGCA CGCCGTCCAT TGACCGTATC GTGATCCCGC TGCACGAGCT CTCGGCCTTG CCCAAGGCTT CGCAACGGCT GCTGGATGAC AGTGCCTTTG ACATCGAGGG CTGGCTTGCG GGCCGGATCG CCGACAAATT CGCCCGTGCC GAAGCGCAGA GTTTTATCTC GGGGGATGGC GTGGACAAAC CCACCGGCAT CCTGACCCAC CCCACGGTGG ACAATGGCAG CTGGAGCTGG GGGAACATCG GCTATGTGGC CACCGGCAGC GATGGCGGCA TAGGCTCGGC AGATGCGATC GTGGATCTGG TCTATGCGCT GGATGCGCGC TACCGCGCTG GGGCGAGCTT TGTGATGAAC TCCAAAACGG CGGGTCTTAT CCGCAAACTC AAGGACGCCG ATGGTCGTTT CCTGTGGTCC GACGGTCTTC AGGCGGGCGA ACCTGCGCGG CTGATGGGCT ATCCGGTGCT GGTGGCCGAG GATATGCCGG ATGTGGCCTC GGATAGTCTC TCGATTGCCT TTGGGGATTT TGCCGCGGGC TACACGATCG CCGAGCGTCC CGATCTGCGC GTTCTGCGCG ATCCCTTCTC TGCAAAACCG CATGTGCTGT TTTATGCCTC CAAGCGCGTG GGCGGCGACG TGAGCGATTT TGCCGCGATC AAGCTGATGA AATTCGGGCT GAGCTGA
|
Protein sequence | MTDHPFTGHA PEDAATPPQN VATEVKQAVS QFVQHFKGFQ DDVTQKLKQT EERMTMLDRK TQTAARPHLA AAEVDGAPHQ KAMQAYLRQG DEEGFRGLDL EGKAMSTAVN SDGGFLVDPQ TADVVRSVLH STASIRAVAS VVNVEATSFD VLIDHSDVGA GWATETGSVT ETGTPSIDRI VIPLHELSAL PKASQRLLDD SAFDIEGWLA GRIADKFARA EAQSFISGDG VDKPTGILTH PTVDNGSWSW GNIGYVATGS DGGIGSADAI VDLVYALDAR YRAGASFVMN SKTAGLIRKL KDADGRFLWS DGLQAGEPAR LMGYPVLVAE DMPDVASDSL SIAFGDFAAG YTIAERPDLR VLRDPFSAKP HVLFYASKRV GGDVSDFAAI KLMKFGLS
|
| |