Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1302 |
Symbol | |
ID | 4078501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1394279 |
End bp | 1396231 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006610 |
Product | phage terminase GpA |
Protein accession | YP_613297 |
Protein GI | 99081143 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.656027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGCCC GGGCTGATTT CTCGAATGCG CGGGCTGTTG TCCGCATTGC TCGGCGCGCC CGGGCATTTC TGCGCCCGCC GCCGAACCTA AAGCCGTCGG AGTGGGCCGA GGAAAATATC AAGATCCCCG TCGGAAACGC TGTGCCGGGC AAAATGCGGT TCGACAACGC GCCCTATCAG CGCGAAGTGA TCGATATGAC CGCAGATCCG CGGTGCAACC GGATCTCGCT CATGTGGGGC GCGCAGGTCG GCAAGACGCA GACCGCGCTT GCGGCGCAGG CCTATCGCAT CGGCTTCAAT CCTGTGTCTC AGATGATGAT GCAGCCGAGC CAAGGCGATT TGACGACGTG GCTCGAGACC AAATTCAACC CGCTTGTTGA AGAGAACGAA GACCTCGCCG AGCTGATCGC GAAGCCGCGC GGGCGCCAAG GCGTCAATAA TCAGCGCATG AAGAGTTACC CGGGCGGGTT TCTCATGTTC AGTTGGTCGG GTTCTCCGAA GACCATGCGC GGACGTTCGG CGCCGTTCAT TGTTTGCGAT GAAACCGACG GCTACGACCG GACAAACGAA GGCCACCCGG TCGGCCTGCT GTGGCAACGG GCGGCGACCT TTGGTGATCA GCGGCTCTTG TTGGAAATCA GCACGCCGAC GATCAAGGGC GGCAGCTGGA TTGAAAAGTC CTTTGAGGCA GGAGACCAGC GCTATTTTTA CGTGCGGTGT CCGCATTGCG GCCACCTGCA ACGGCTGAAC TGGTCGCAGG TGACCTGGTC CAAAGACGCC GACGGGCTGC ACCTCGCCGA AACAGCAGGC TATCTGTGCG TGGGTGAGGG ATGCGGAACC GTTTGGAGCG ATGGCGAGCG AGTGGCAGCG ATCCGCAACG CCGAACGCGA CGGCGGCGGC TGGATCGCCA GCAAGCCGTT CCGAGGTCAC GCGTCCTATC ATTTGTCGGA GCTGTATTCC TGCTTTCGGC GGCTCGAGGA TATCGTGCAG TCCTTCCTCG ACAAAAAGGC CGCGGGGGAT TTGCAAACCT TTGTGAACGT GTCGCTCGCT GAGACATGGG AAGAGGAAGG CGACAAGCTC GAGGCGTCCG TGCTGATGGC ACGGGCTGCT AAGTTCGCAG CACCGGTCCC GTTGGGGGCA GGCGTCCTGA CGGCTGGTAT TGATATGCAA AACGACCGCC TCGAGGTCGA AATAGTTGGC TGGGGCTTGG GGGAGGAATC CTGGTCTGTC GATTATCGGG TTTTGTGGGG CGATCCGCTG CAACAGGACG TCTGGGACGA ACTAGACGCC TTGCTGTCGG AAACATGGGA GCACGAGAGC GGAGCGGAGT TGCGGGTCTC TGCCGCCTGC CTCGATACCG GGGGTGAAGG TGGGCGCACG CAAGCAGCCT ATGACTATGC CCGCAAGCGT TTGGGCCGCA AGGTCTGGGC GATCAAGGGC GTCGGTGGCT GGGGCAGACC CATCGTTACG CAGCCCTCGA AGGTCAAACA AAAGGGCGTG CGCCCCGTCT ACTTGCACTC CATCGGCGTC GATGAGGCGA AAGCCGTAGT CGCCCAGCGG GCGCGGATTA GCGACGCGGG GCCGGGCCAT TGCCATTTCC CGGCAGATCG GGATCCGGCG TGGTTCGATA TGTTCACCGC CGAGGCGCTG CGCACCCGGT ATGTGAAGGG GTTCGCAGTC CGAGAATGGC ACAATGTACG CCCACGCAAC GAAGCATTTG ACTGCCGCGT CTATGCCTAC GCAGCGCTGC GCATCCTGCG CCCGAATATC AAACGCCTGG TCGCGTCACT GGATGTTCAG GGGCAGGAGA CCGAGGACGA GGCCGTCGAG ATTTCACCAG ACGCTGAGCG AGCTTTGGTA AAACCGCCAG AGGAACCGGA AGCCCCAAAC AGCCAAGCTC CGAAAAAGAC TGGCTGGGGG GCGAAGAAGC GCCGACGCCG TCGCAGGTAT TGA
|
Protein sequence | MNARADFSNA RAVVRIARRA RAFLRPPPNL KPSEWAEENI KIPVGNAVPG KMRFDNAPYQ REVIDMTADP RCNRISLMWG AQVGKTQTAL AAQAYRIGFN PVSQMMMQPS QGDLTTWLET KFNPLVEENE DLAELIAKPR GRQGVNNQRM KSYPGGFLMF SWSGSPKTMR GRSAPFIVCD ETDGYDRTNE GHPVGLLWQR AATFGDQRLL LEISTPTIKG GSWIEKSFEA GDQRYFYVRC PHCGHLQRLN WSQVTWSKDA DGLHLAETAG YLCVGEGCGT VWSDGERVAA IRNAERDGGG WIASKPFRGH ASYHLSELYS CFRRLEDIVQ SFLDKKAAGD LQTFVNVSLA ETWEEEGDKL EASVLMARAA KFAAPVPLGA GVLTAGIDMQ NDRLEVEIVG WGLGEESWSV DYRVLWGDPL QQDVWDELDA LLSETWEHES GAELRVSAAC LDTGGEGGRT QAAYDYARKR LGRKVWAIKG VGGWGRPIVT QPSKVKQKGV RPVYLHSIGV DEAKAVVAQR ARISDAGPGH CHFPADRDPA WFDMFTAEAL RTRYVKGFAV REWHNVRPRN EAFDCRVYAY AALRILRPNI KRLVASLDVQ GQETEDEAVE ISPDAERALV KPPEEPEAPN SQAPKKTGWG AKKRRRRRRY
|
| |