Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1671 |
Symbol | |
ID | 4075774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1770902 |
End bp | 1772683 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006984 |
Product | phage terminase |
Protein accession | YP_613666 |
Protein GI | 99081512 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCT CGGAGCGCAT TGACCAGACC ACGCGATACG CGCGGGATGT GGTTTCGGGA AAGATCGTAG CGGGCGCATT CGTGCGGGCG CAATGCCAGC GGCACCTAGA TGATCTGAAG CATGGCCCGG CGCGCGGGTT GCAGTGGGAT ATTGAGCAGG CCGAGCGGGC CATTCGGTTC TTTCCGGCCA TGTTGTCGAT CACCGAAGGG GCCAAGGAGG GCGAGCCATT CAAGCTGCTG CCCTGGCACT TGTTCGTGGT AGGGTCGATT TTCGGCTGGC GAACAGCGGA AGGTTTCATC CGGTTCCGGT TCGTGTGGCT GGAAACCGGC AAGGGGCAAG CGAAATCGCC GCTCATGGCG GCTGTAGGTA TCTATCTCAG CGGGTTCTAT GGCCGGAAGC GGGCCGAGGT CTACTGCATC GGGGAAACGA AGGACACGGC GCGGGTTATG TTCCGCGATG CGGTGGCGAT GCTTCGGGCG CCGATCCCAG GCAAAGGCGG CATGACGCTG GAAGACGCGG CGTTTGTGAT CCGCGGCACC GGCGACCTTG CCTACAGCGT CGAGCATCCC GACAGCGGAT CATTCATGCG GCCCATCGCG AACAATGACA GCGTTTCGGG GCCAAAGCCG ATCCTTGTGG CTGGCGATGA GATCCACGAG ATGAAAAGCG GCAAGGCGAT TGAGATGTGG CGGGCCGCAG TCACCAAGAA ACACGGTGAC AGCATACTGA TGCTTGGAAC GAACACCCCA AGCGCAGACC AGCAGGTGGG GACGGACTAC AGCGAGTTTT GCCAGAAGGT GGTGACCGGC GACTTCACTG ACGACAGCGT GTTTGCATAT ATCGCGCGGG TGGACGAGGG CGACGACCCG CTGGAAGACG AAAGCTGCTG GGTTAAGGCG TTGCCCGCCC TGGGCATCAC CTATCCGGTG GACAACGTGC GCAAGCTGGT AGTGACGGCG AAACAGCAGA TCAGCACGCA GCTGACCACA AAGCGCCTGT ATTTCGGAAT CCCGGTAGGC TCTGCAGGGT TCTGGACCTC TGAGCAGGCA TGGAAGGAAG TGCAGGGCAA GGTCGATGAT GCGAAGATGA TCGGCCGCCG GGCGCATTTG GCGCTCGACC TCTCCGAGAA GAACGACCTC ACCGCGCTCG CGGTGGCGTG GGAGGGCGAA AGGATCGATG TCAAATCATG GTATTGGACT CGTGAATTTG AGATCGAAGA GCGATCAACG GCAGATGCGA TCCCTTATCG CGAGTTGGAA GCGGCGGACC TGATAGAAGT CACGCCGGGG CGCGTAATCG ACTACACGTT CATTGCGGCG AAGATTATCG ACTTTTGTGG ACGCCACTCA GTGGTGCAGA TGGCTATCGA CAGCGCCCAC ATGGAAAAGC TCTGCGAGGC CTTCGATAAG GCGGGTTTCG CGTATTGGAT CGAAGAAGGC GACGACAAGC CGGGCAGCGG CCTGAAAATT GTCAGGCATA AGCAAGGCAC GAACGTCAGC TTCGACGGAA AGTTCCTCTG TATGCCGACT TCGATCACAC AGCTTGAGGA CCACATGCTG AAAGGCACGG TCAAGATCGA TCGCAACAAG CTCACCAGCA TCTGCGCGCG CAACGCGATC ATCCGCGAGG ATGGTTTCGG CAACCGGATG TTCGACAAAT CACGTTCGCG GGGCCGGATC GATGGGGTTG TCACCCTAGC GATGGCCGTA GGGTCAGCCA CCGCGGCGAT GAAAACGAAG TCCGGATTGG ACGACTATTT TGCTTCACTT GGGGGCGGCT AA
|
Protein sequence | MSSSERIDQT TRYARDVVSG KIVAGAFVRA QCQRHLDDLK HGPARGLQWD IEQAERAIRF FPAMLSITEG AKEGEPFKLL PWHLFVVGSI FGWRTAEGFI RFRFVWLETG KGQAKSPLMA AVGIYLSGFY GRKRAEVYCI GETKDTARVM FRDAVAMLRA PIPGKGGMTL EDAAFVIRGT GDLAYSVEHP DSGSFMRPIA NNDSVSGPKP ILVAGDEIHE MKSGKAIEMW RAAVTKKHGD SILMLGTNTP SADQQVGTDY SEFCQKVVTG DFTDDSVFAY IARVDEGDDP LEDESCWVKA LPALGITYPV DNVRKLVVTA KQQISTQLTT KRLYFGIPVG SAGFWTSEQA WKEVQGKVDD AKMIGRRAHL ALDLSEKNDL TALAVAWEGE RIDVKSWYWT REFEIEERST ADAIPYRELE AADLIEVTPG RVIDYTFIAA KIIDFCGRHS VVQMAIDSAH MEKLCEAFDK AGFAYWIEEG DDKPGSGLKI VRHKQGTNVS FDGKFLCMPT SITQLEDHML KGTVKIDRNK LTSICARNAI IREDGFGNRM FDKSRSRGRI DGVVTLAMAV GSATAAMKTK SGLDDYFASL GGG
|
| |