Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1947 |
Symbol | |
ID | 4244371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3018030 |
End bp | 3019724 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638107067 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_721674 |
Protein GI | 113475613 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0106408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAA CTCATGCTAA ATCATCGGAA AAATTGACAT TTTTAACTAC ATTAAATACT AAAAATAATC AAACTATTTC CTCTTCTAAG GGAATTTTTA GCATCTTTTC TGCGTTCCCA GAATATGACC GTGGTAACAG ACTGTATGAA ATGGGTAGAT ACGAGTCAGC TATTCCCTAC TACGAAAATG CAGTGAAAAT AAAGCCAGAC TGGGCTATAG GTTGGTTAAA ACTAGCTGAG GCCTTATCTA AATTGCAAAA GTATGAACAA GCAGTAGAGG CTTATAAAAG ATCCCTATCT CTCAAACAAA ACGCTCATCA AGCTTGGCAT AGTTATGGAG TTGTATTATC TAATTTAAAG CAGTATGAGC AAGCGATCGC TTGCTTTGAC AAAGCAATTA AAATTAATCC AAATGATTAT CAATCATGGT TTAATAAAGC AATTATTTTA AGCGAATTAA AACAAGATTT ACCTGCGATA TACTGCTACA AAGAAGCACT AAAAATACAA CCTATGAAGG GAGAAATTTG GTATGGTCAA GGTCAAGCAT TATTAAATGT GCAAAAATAT GCTGAAGCAT TAGCAGCTTA TGATTGTGCT GCGAAGCTGC AACCTGATAA TTATGATATT TGGTTTAAGA GAGGATTAGC TTTATTTCAA ACTCAACGTT ATGCAGAAGC AGTTATCAGT TATGGCCACG CTATAGAATT ACAACCAGAG AATTATCTAG GTTGGTTTAA CTTAGGTATT GCTCAAAGTA AACTACATAA ATATCACGAT GCAGTCTCTT CTTTTAATAA GGCAATTAAA TTAAATCCTG ATGATTATGA AGCTTGGTAT TATAAAGGAT TAGCTTTAAA AAATCATTGG AAAGAAGGAG GAGTTGCTTG TTTAGATAAG GCAATTAATT TTAACCCTAA TTTACCAGAA ATTTGGATTA GTCGTGGTTA TATTTTATTA GATTTATTTA AATATCGTGA GGCATTAGAG TCTTTTAATA AGGCAATTAC AATTAACTCT AATTATCCCG AATCTTGGTT AGGTAGAGGT AAAGCATGGA TGGCTCTAGG TAAATATAAT GAAGCTCTTA TTGCTTATGG TAATGCTGTT AGTATTGAGC CATATTTTTT AGAGGCTTGG AATTGTCGAG GTGAAGCATT AGAAAGAGTC CAAAATTATG ATCAAGCATT GGCAGCTTAT GACAAAGTGA TAAAAATGAG TTTTGAGCAA GGAGTTTCTG TTGCTAAAGT AGGTTTACAG AGAGGAGCAG CTTTAGAAAA GTTAGAGCGA TATCCTGAAG CAATAGAAGC TTATAATTTG GTAATTGAAA AACAACCAAA TAATTTTGAT GGTTGGTTAA ACCGGGGATT AAACTTAGAA AAAATGGCAA ATTATGAAGA AGCTGTTTTG AGTTATAGTC GAGCTATTAG TATATGGCCT AGTAATTATC AAGCTTGGTT ACAATTAGCT TTAATGCTGG AAAAATTAGA GAGGTTAGAT GAAGCGATCG TTGCCTATAA CAAAATCATT TCTTTAAGGC CTGGTAATCA TGAAACTTGG TTGAAAAGAG GATTAATTCT GGAAAGATTA GGATATGTTC AAGAAGCTGT TAGTTCTTAC AAAATTGTAT TAGAAATTAA ACCTGACTAT CACGAAGCAA TTGAAAGAAA AAAACGATTA GAATTAACTG TTTAA
|
Protein sequence | MSKTHAKSSE KLTFLTTLNT KNNQTISSSK GIFSIFSAFP EYDRGNRLYE MGRYESAIPY YENAVKIKPD WAIGWLKLAE ALSKLQKYEQ AVEAYKRSLS LKQNAHQAWH SYGVVLSNLK QYEQAIACFD KAIKINPNDY QSWFNKAIIL SELKQDLPAI YCYKEALKIQ PMKGEIWYGQ GQALLNVQKY AEALAAYDCA AKLQPDNYDI WFKRGLALFQ TQRYAEAVIS YGHAIELQPE NYLGWFNLGI AQSKLHKYHD AVSSFNKAIK LNPDDYEAWY YKGLALKNHW KEGGVACLDK AINFNPNLPE IWISRGYILL DLFKYREALE SFNKAITINS NYPESWLGRG KAWMALGKYN EALIAYGNAV SIEPYFLEAW NCRGEALERV QNYDQALAAY DKVIKMSFEQ GVSVAKVGLQ RGAALEKLER YPEAIEAYNL VIEKQPNNFD GWLNRGLNLE KMANYEEAVL SYSRAISIWP SNYQAWLQLA LMLEKLERLD EAIVAYNKII SLRPGNHETW LKRGLILERL GYVQEAVSSY KIVLEIKPDY HEAIERKKRL ELTV
|
| |