Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0424 |
Symbol | |
ID | 4241929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 666287 |
End bp | 669340 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105743 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_720357 |
Protein GI | 113474296 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.316008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.221114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCAA TCTTAAAAAA AGCTACAGAA TCAATAGAAA TCAACCAAAA TTTGCAGCAA GTGCGACAGC AGAAAAGATT AATGGTAATT GACACGAGAG TGGAAAATTA TCGCCAGTTA ACTACAGGAG TTACTCCAGA AACAAAAGTA GCTTTAATTG ACCCTAAGCG TGATGGTATA GAACAAATTA CCGAACTTTT ATCTAATTAC CCAACTAATA GTTTACATAT TGTTTGTCAC GGCGACCCTG GCATTTTATA TTTGGGAAAA ACACCAATAA GCGAGCAGAA TCTTCCCAAA TATACCGGAT ATCTCCAAGA ATGGGGAATT GTCGATATAT TACTCTATAG TTGTAATATT GCCAGTCAAT CAGATAACTT GCTCCAACTT TTACATAAAT TCACTGGAGC CAATATTGCT GCTTCAAAAC ATCAGGTGGG CAACCCATTA AAAGGGGGAA CTTGGGATTT AGAAAGCCGC ATTGGTTCAG TTGCTTCAGA GATAGCTTTT TTACCACAAG TTATTCAAGA TTATCCTGGA GTATTTCACG GGTTACAATT TTATGTCGGT CAAGATGAAA CAGGTAAAGT CAGAGCCTAT CTTGGCAAAG ATGATAGGGT GAGTAACGCC TACACCAATA TATTTTCTGA TGACTATCTC ACTTTAAGTC CAGTTGGGAT TTGGGATGAC CGGGAAATAG AAGAAAAAGA TACTGCCTTT GACTATCGTT ATACTAATAA CGCATCCTAT TTTCCAGGCT TTACACCACA GATAGATATG TTCTATGAAA CTCCTGTAGA CGGGAGTCCT GCTGATGATT CTTTCGATCG CGAACGCAAT ACCTATTTCC AAGTCAACTT TCTAGATGAG CTGAAGGTAT GGAATGGTGA AGATTTTGTG GCTACAGGTG GAGAGGTGAT GACCTGGTCT CAAGAAAACT GGGATGTTAA TGATAAAGGT GAATGGTATG CAGAATTTGC CAATGAAGTC ACCACAGGGG AAGGTGTTGT AAAGGGAAAA CCTTACTATT ATGCAGTATA TGACGAACCA GGTAATGACC ACTGGCATTA CGTCATGGAG TTACAGGAAG GAACTTCACC TACAGGTATT GATGACGGGC TTTACTTGTT GCCAATTGAG GTTGAGACTA ATATTCCTGA TAGCATTCCT TCCGATCCTG TTTATATTCT CTACAATCAA AACCTCTCTG CTGCCCCCGA TGTAGACCAA GATACTATTC TGGCCGCTCA AAATTATCTG CTAGAAGAGT ACGGAATGAC TGAACCTGCT CAAGAATTTA AAGCTTTCTT AGATAGCAGT CAAAATGGAG TAGATTCAGA TGCTACAGGA CTTGCAACTT TTAGCCTGAA TGCAGAACAG ACAGAAATAG AATATACCAT TGAATTAAAT GGACTTGACC TAATTGAAGA TCCTGTAAAG CGTACAGCTG AGAACGCCAT TACCGAAATT CACTTTCACC ACCGCGGCTA CGGCACTGAT GGCAGTCACG TTTTGAACGT CTTTGGGACA CCATCGGAAG ATGATGATAA CATTGAGATT GATTACGAAA ATGAAAGGAT TATAGGTAAG TGGGATTGGT CTGATGCTGC TGCCAATTTT GCTCGCCATT CTGAGCCTAA ATCTGATGGT GTTTATGGTG ACTGGAAAGC TCCTAAATTA GAAATTGAAG TTGTTGAAGC CTACAATAAT GGTGCTTTAG GCATCAAATT GCGAGATGAA GAACCTGAAG ATAAACTCAA CATTTTTGAG GGAACTCAGC TCAACTTTGA CAACGGTGCC AGGGTTAAAA TAACAGAAAA CATTACCATA TCTAAGGCAA AAAGTAGTAT AGTTGAAGCA AGCATTCTGG AAGGGGGCGA TATTCCAGCT GGGGAAATAG CCATTCTATC TCCTGCTCAA ATGCCGGGAA CTACTCCCGT TACGACTTCT CTATTCAACT TATTTCATGG CAACCTCTAC GTTCAGGTGC ATACCAACCA AAATCCTGAA CCTGGTGATA TTCGGGGCCA AGTATTACCT GTGAGAACTT TAGCAGATGC CGAAGATGAT GTACTTTCTG GCGGTACAGG TAATGAATTA TTTGAATTGG ATGCCGCTGA TGTAGGAGGT ATTGAGATTG AAGAATTGGC TGGTCGCGAA ACTTTGACTC TTGAAGGTAT GGAAACTTCT ATGTCAAATT TGCATAGAGA AGATAACGAT CTAATTATAG ATGTTAACCA AGATGGCAGT TTCCAAACAG CGGAAGACCT GACCCTCAAA AACTTTTTTG CAGACGAAGT AGGAGTAGTT GGCAAAGGCT ATATTGAGTT AGTTGATGAG ATTTCTGGTA ATGAAATTTT AAGAGGTGTT TCTTCTAGTG GGAAGGATTT AGTCCATGGA ACTTCAGAAG ACGATATTCG GCAAGGTAAG GGGGGTAATG ATGTTATTTC TGGCTTTGGT GGTAATGACG AACTCTATGG TAACCGGGGT GATGATCTTC TCAAAGGTGG TGATGGTAAC GATATTCTCA AAGGTGGTTA TGAAAACGAT ACTTTAAAGG GGAATAGTGG TAATGATACC TTAATTGGTT GGCAAGGTTT CGATATTTTG CTTGGTGGCA GTGGAGAAGA TAATCTGAAG GGTGGTATCG GTCGCGATCG CCTGAATGGA GGAAAGGATG ACGATCTGCT CAGTGGTGGA GCGAGTCAGG ATAAATTTAT CTTTGCTGCC AATAAAACAT TTAGTGAAGC AGATTTAGGT GTGGATGAAA TTACTGATTT TGTATCTGGC CCAGATAAAA TTATCTTGGA TGTAACAGTG TTCACTGCCA TTAAGACTAC TCCTGGAGAA AGTCTAGATG AAGGCGAATT TGCTGTGGTG GATAATGAAA CTGATGTATT TAGCGCTGAT GCTACGATCG TTTATAACTC AGCCAATAGT ACATTATATT ATAATCCTAA TGGAGTTGAA AATGGCTTGG GAGATGGGGG TAGCTTTGCC CTGTTATCTA ATGAAGCATC TTTGAGTGCA GATGATTTCC TAGTCAGAGC TTGA
|
Protein sequence | MNSILKKATE SIEINQNLQQ VRQQKRLMVI DTRVENYRQL TTGVTPETKV ALIDPKRDGI EQITELLSNY PTNSLHIVCH GDPGILYLGK TPISEQNLPK YTGYLQEWGI VDILLYSCNI ASQSDNLLQL LHKFTGANIA ASKHQVGNPL KGGTWDLESR IGSVASEIAF LPQVIQDYPG VFHGLQFYVG QDETGKVRAY LGKDDRVSNA YTNIFSDDYL TLSPVGIWDD REIEEKDTAF DYRYTNNASY FPGFTPQIDM FYETPVDGSP ADDSFDRERN TYFQVNFLDE LKVWNGEDFV ATGGEVMTWS QENWDVNDKG EWYAEFANEV TTGEGVVKGK PYYYAVYDEP GNDHWHYVME LQEGTSPTGI DDGLYLLPIE VETNIPDSIP SDPVYILYNQ NLSAAPDVDQ DTILAAQNYL LEEYGMTEPA QEFKAFLDSS QNGVDSDATG LATFSLNAEQ TEIEYTIELN GLDLIEDPVK RTAENAITEI HFHHRGYGTD GSHVLNVFGT PSEDDDNIEI DYENERIIGK WDWSDAAANF ARHSEPKSDG VYGDWKAPKL EIEVVEAYNN GALGIKLRDE EPEDKLNIFE GTQLNFDNGA RVKITENITI SKAKSSIVEA SILEGGDIPA GEIAILSPAQ MPGTTPVTTS LFNLFHGNLY VQVHTNQNPE PGDIRGQVLP VRTLADAEDD VLSGGTGNEL FELDAADVGG IEIEELAGRE TLTLEGMETS MSNLHREDND LIIDVNQDGS FQTAEDLTLK NFFADEVGVV GKGYIELVDE ISGNEILRGV SSSGKDLVHG TSEDDIRQGK GGNDVISGFG GNDELYGNRG DDLLKGGDGN DILKGGYEND TLKGNSGNDT LIGWQGFDIL LGGSGEDNLK GGIGRDRLNG GKDDDLLSGG ASQDKFIFAA NKTFSEADLG VDEITDFVSG PDKIILDVTV FTAIKTTPGE SLDEGEFAVV DNETDVFSAD ATIVYNSANS TLYYNPNGVE NGLGDGGSFA LLSNEASLSA DDFLVRA
|
| |