Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2809 |
Symbol | |
ID | 4245343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4358134 |
End bp | 4361091 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107861 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_722458 |
Protein GI | 113476397 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00683217 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAAA AATATCAAAT ACTTGAAAAA ATACAGGGAT TATCTATTTA TCAATTAAGT AATACTTACT ATGAAAAATC TGGTAAATTA ATCACTGTTA AACCTCAAGA AAAGAAAAAT ATTTTTCTGA AAAATAGTAT AGAAATCATC AAAAAAATTA CTAGGTTGAT GAAAATTAAC TCTCCCCAAA ATAAACTAAA TTTATTAGAA AATAACTACT CGCCTCAACC AGAAAAAAAA TTAACGGTGA GAGAAAAAGC TAACCTCCGC CTCGCCTATT ATTGGTTAAA AATATATCAA CCAGAACCAG GTGCATCTAA CTTGGAAAAA GTCAAAGGAT ATCTGCAAGC CTTCCGTCAT TTTGCCAAAA TGGGAGAGTG GGAAAAAGCT AGTCAAATTC TTTCTCTTAA ATTACCGCTA GTTGATGAAA ATTTAGACAC TCAACTAAAT AGATGGGGTT ACTATGAGGA ATGTATGGAA CTCTACAATA AATTATTAGG TAAATTAGAC CAAAGCTGGG ATGGTATTTG TTTAAATGGT TTGGGAAATG CTTACTTATC TCTCGGAAAA TATCACAAAG CTATTGAGCA TTATCAGCAG CACTTACAAA TAGCTAAGGA AATAGGTGAT CTTGGGGGAC AGGGTATTGC TTTAGGGAAT TTGGGAGATG CTTACCATTC TCTAGAAGAA TATAACAAAG CTATTGAGTA TCATCAGCAG CATTTACAAA TAGCGAAGGA AACAGGGGAA CTCGGGGGAG AAGGTATTGC TTTAGGGAAT TTGGGAAGTG CTTACTATTC CCTGGGAAAA TATCACAAAG CTATTGAGTG TCATCAGCAG CATTTACAAA TAACAAAGGA AATAGGGAAT ATGAAACAGC AGGGCATTGC TTTAGGCAAT TTAGGGAATG CTTACCATTC CCTAGGAGCA TATCACAAAG CTATGGAATA TCATCAACAG GACTTAGAAA TAGTTAGGAA AATAGGCGAT CTTGGGGGTG AGGGCATTGC TTTAGGGAAT TTGGGAAGTG CTTACTATTC CTTGGGAGAA TATCACAAAG CTATTGAGTC TCATCAGCAG CATTTACAAA TAGCTAGAAA AATAGGCAAT GTTAAACAGG AGGGTATTGC TTTGGGGAAT TTGGGTAATG CTTACCATTC CCTAGGAGCA TATCACAAAG CCATGGAATA TCATCAACAG GACTTAGAAA TAGTTAGGAA AATAGGGGAT CCTGGGGGTG AGGGTAATGC TTTGGGAAAT TTGGGTAATG CTTACTATTT TTTGGGGGAA TATCACAAAG CTATTGAACA TCATCAGCAG CATTTACAGA TAGCCAGAGA AATAGGTAGC CAGAAAGGAG AGGGTAATGC TTTGGGAAAT TTGGGTAATG CTTACTATTC TTTGGGGGAA TATCACAAAG CTATTGAGCA TCATCAGCAG CATTTACAAA TAGCCAGAGA AATAGGCAAC CAGAATAGAG AGGGTAATGC TTTGGGGAAT TTGGGTAATG CTTACTATTC TTTGGGAGAA TATCACAAAG CTATTGAGCA TCATCAGCAG CATTTACAAA TAGCCAAAGA AATAGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGAGAAT TTGGGTAATG CTTACTATTC CAGCAGAAAA TACGACAAAG CTATTGAACA TCATCAGCAG TATTTACAAA TAATTAGAGA AATAGGCGAT CGCTCTGGAG AAGGAAATGC TTTGAGGAAT TTGGGTAATG CTTACTATTC TAGCAGAAAA TACGACAAAG CTATTGAAGA TCATCAGCAA TATTTACAAA TAGCCCAAGA AATTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGAGAAT TTGGGAAATG CTTACTATTC CAGCAGAAAA TACGACAAAG CTATTGAACA TCATCAGCAG TATTTACAAA TAGCCCAAGA AATTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGGGAAT TTGGGTAATG CTTACTGTTC CCTAGGAGAA TATTACAAGG CTATTGAACA TCATCAGCAG CATTTACAAA TAGCCCAAGA AGTTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGGGAAT TTGGGTAATG TTTACTATTC TCTAGGAGAA TATCACAAAG CTATTGAACA TCATCAGCAG CATTTACAAA TAGCCCAAGA AATTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGGAAAT TTGGGAAATG CTTACTGTTC CCTAGGAGAA TATCACAAAG CTATTGAGCA TCATCAGCAG CATTTACAAA TAGCCCAAGA AATAGGCGAT CGCTATGAGG AGGGTAGTGC TTTGGGGAAT TTGGGTAATA CTTATGATTT TTTGGGAGAA TACGACAAAG CTATGGAGTA TCATCAGCAG CATTTACAAA TAGCTAGGGA AATTGGCGAT CGCTCTGGAG AGGGTAATGC TTTGGGGAAT TTGGGAAATG CTTACTATTC CCTGGAAGCA TATCACGAAG CTATTGACCA TCATCAGCGG CATTTACAAA TAGCAAAGGA AACAGGCAAC TTTAGAGGGG AGGGTGTTGC TTTGGGGAAT TTAGGAAATG TTTACTTATC CCTAGGAGAA TATTACAAGG CGATCGAGTC TTATCAGCAG AGCTTAGAAA TAGCTAGAGA AATAGGTAAC CGTTCTGGGG AAGGTGGTGC TTTGGGAAAC TGGGGAAAAA CTCTCCTCAA ATTAGAAAAG TATGCCGACT CTTTGGAATA TTCACAAGCA GCATTAGAGA TATTTGAGCA GATAGGAAAC CCTCATCACC AAGCAATAGT TCTGAAGAAT ATAGCAGAAA CCTATCAAAA TTTGGGGTTT TGGGATATGG CACGGCGGCA TTGCGAGGAG GCGTTGGCTA TTTTCACAGA ATTAGGAGTG CCAGAGTTGA GAGAATGTCA GGAGTTGTTT GAGGTTCTTG AAAAATAA
|
Protein sequence | MDKKYQILEK IQGLSIYQLS NTYYEKSGKL ITVKPQEKKN IFLKNSIEII KKITRLMKIN SPQNKLNLLE NNYSPQPEKK LTVREKANLR LAYYWLKIYQ PEPGASNLEK VKGYLQAFRH FAKMGEWEKA SQILSLKLPL VDENLDTQLN RWGYYEECME LYNKLLGKLD QSWDGICLNG LGNAYLSLGK YHKAIEHYQQ HLQIAKEIGD LGGQGIALGN LGDAYHSLEE YNKAIEYHQQ HLQIAKETGE LGGEGIALGN LGSAYYSLGK YHKAIECHQQ HLQITKEIGN MKQQGIALGN LGNAYHSLGA YHKAMEYHQQ DLEIVRKIGD LGGEGIALGN LGSAYYSLGE YHKAIESHQQ HLQIARKIGN VKQEGIALGN LGNAYHSLGA YHKAMEYHQQ DLEIVRKIGD PGGEGNALGN LGNAYYFLGE YHKAIEHHQQ HLQIAREIGS QKGEGNALGN LGNAYYSLGE YHKAIEHHQQ HLQIAREIGN QNREGNALGN LGNAYYSLGE YHKAIEHHQQ HLQIAKEIGD RSGEGNALEN LGNAYYSSRK YDKAIEHHQQ YLQIIREIGD RSGEGNALRN LGNAYYSSRK YDKAIEDHQQ YLQIAQEIGD RSGEGNALEN LGNAYYSSRK YDKAIEHHQQ YLQIAQEIGD RSGEGNALGN LGNAYCSLGE YYKAIEHHQQ HLQIAQEVGD RSGEGNALGN LGNVYYSLGE YHKAIEHHQQ HLQIAQEIGD RSGEGNALGN LGNAYCSLGE YHKAIEHHQQ HLQIAQEIGD RYEEGSALGN LGNTYDFLGE YDKAMEYHQQ HLQIAREIGD RSGEGNALGN LGNAYYSLEA YHEAIDHHQR HLQIAKETGN FRGEGVALGN LGNVYLSLGE YYKAIESYQQ SLEIAREIGN RSGEGGALGN WGKTLLKLEK YADSLEYSQA ALEIFEQIGN PHHQAIVLKN IAETYQNLGF WDMARRHCEE ALAIFTELGV PELRECQELF EVLEK
|
| |