Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0419 |
Symbol | |
ID | 4241924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 659042 |
End bp | 662110 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105740 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_720354 |
Protein GI | 113474293 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCAA TCTTAAAAAA AGCTACAGAA TCAATAGAAA TCAACCAAAA TTTGCAGCAA GTGCGACAGC AGAAAAGATT AATGGTAATT GACACGAGAG TGGAAAATTA TCGCCAGTTA ACTACAGGAG TTACTCCAGA AACAAAAGTA GCTTTAATTG ACCCTAAGCG TGATGGTATA GAACAAATTA CCGAACTTTT ATCTAATTAC CCAACTAATA GTTTACATAT TGTTTGTCAC GGCGACCCTG GCATTTTATA TTTGGGAAAA ACACCAATAA GCGAGCAGAA TCTTCCCAAA TATACCGGAT ATCTCCAAGA ATGGGGAATT GTCGATATAT TACTCTATAG TTGTAATATT GCCAGTCAAT CAGATAACTT GCTCCAACTT TTACATAAAT TCACTGGAGC CAATATTGCT GCTTCAAAAC ATCAGGTAGG CAACCCATTA AAAGGGGGAA CTTGGGATTT AGAAAGTCGC ATTGGTTCAG TTGCTTCAGA GATAGCTTTT TTACCACAAG TTATTCAGGA TTATCCTGGA GTATTTCACG GCTTACAATT CTATCTTGGC CAAGATGAAA CAGGTAAAGT CAGAGCCTAT CTTGGCAAAG ATGATATGGT GAGTAACGGC TACACAAACA TATTTTCTAA TGACTATCTC ACTTTAGGTT CAGTTGGGAT TTCGGATCAG CGATTAGTAC CAGAATCAGA TACTGCCTTT GAGTATAGCT ATACTGATAA TGCATCCTCT TTTCCAGGGT TCACACCACA GGCATATATG TTCTATGAGA CTCCTTTAGA CGGGAGTCCC GCCGATGATT CTTTCGTTCC CGAACGCAAT ACCTATTTCC AAGTCAACTT TCTAGATGAA CTGAAGGTAT GGAATGGTGA AGATTTTGTG GCTACAGGTG GAGAGGTAAT GACCTGGTCT CAACAAAAAA AGTCAGTCAT CAACGAAGAT CAATTCAGGG AATTGGTAAA TGAAGTCACC ACAGGGGAAG GTGTTGTAAA GGGAGAACCT TACTATTATT CAGAAATTCG CGAGGCAGGT AGAGACCACT GGCATTACAT CATGGAATTA CAGGAAGGGA CTTCAACTAC AGGTATTGAT GACGGACTTT ACCTCTTGGA AGTCGAAGTT GAGACTAACG TTCCTGATAG TATTCCTTCC GATCCTATTT ATATTCTCTA CAATCAAAAC CTCTCTCCCG CCCCAGATGC AGACCAAAAT ACTATTCTGG AGGCTCAAAA CTATTTGCTG GAAAACTTCG GGATAACTGA ACCTGCTCAA GAATTTAAAG CTTTCTTAGA TAGCAGTCAA AACGATGGAG TAGAGTCAGA TGCTACAGGA ATTGCAACTT TTAGGCTGAA TGCAGACCAG ACAGAAATAG AATATACGAT TGAATTAAAT GGAATCGATC TAATTGAAGA CCCTGCAGAA CGTACAGCTG ATAATGCAGT CACCAAAATC AACCTTCACC ACCGCGACTA TGGTATTAAT GGGGATCACG TTTTTAATAT CTTTGGGGTG GCGTCCGAAG ATGACAATGA TATTGAGATC GACTACGAAA ATGAAACGAT TACAGGTAAA TGGGATTGGT CTGATGCTGC TACCAATTTT ACTCGCCATT CTGAGCCGAA ATCTGGTGGC ACTTATGGTG ATAGAAAAAG CCCTGAATTA AAAATTGAAG TTATTGAGGC TTACGATAAT AGTAATGGTA CTTTAGGTAT CAAGCTGCAA GATGGGGAAC CTCAAGATGA ACTTGACATT TTTGAGGGAA CTAAACTTAA ATTTGAGAAC GGTACTACAG TTGAGTTCAC AGAAAATATC ACTATATTTC AGGAATTTCA GGAAGAAAGC AGCACTGCTC AAGTAAGTCT CCTAGAAGGA GATGATATTT TGGATGGGGA AATAGCTATT TTACCTCCTG CTCAAAGGTC GGCAACTACT CCCGTGACTA CTTCTCTAGT TCACCTATTT CATGGGAATC TCTACGTTCA GGTACAAACA AAGCAAAATT TTGACCCTGG TGATATTCGG GGCCAAATAT TACCTGTGAG AACTTTAGCA GATGCCGAAG ATGATGTACT TTCTGGCGGT ACAGGTAATG AATTATTTGA ATTGGATGCC GCTGATGTAG GAGGTGTTGA GATTAAAGAA TTGGCTGGTC GCGAAACTTT GACTCTTGAA GGTATGGAAA CTTCTATGTC AAATTTGCAT AGAGAAGATA ACGATCTAAT TATAGATGTT AACCAAGATG GCAGTTTCCA AACAGCGGAG GACCTGACCC TCAAAAACTT TTTTGCAGAC GAAGTAGGAG TAGTTGGCAA AGGCTATATT GAGTTAGTTG ATGAGATTTC TGGTAATGAA ATTTTAAGAG GTGTTTCTTC TAGTGGGAAG GATTTAGTCC ATGGAACTTC AGAAGACGAT ATTCGGCAAG GTAAGGGGGG TAATGATGTT ATTTCTGGCT TTGGTGGTAA TGACGAACTC TATGGTAACC GGGGTGATGA TCTTCTCAAA GGTGGTGATG GTAACGATAT TCTCAAAGGT GGTTATGAAA ACGATACTTT AAAGGGGAAT AGTGGTAATG ATACCTTAAT TGGTTGGCAA GGTTTCGATA TTTTGCTTGG TGGCAGTGGA GAAGATAATC TGAAGGGTGG TATCGGTCGC GATCGCCTGA ATGGAGGAAA GGATGACGAT CTGCTCAGTG GTGGAGCGAG TCAGGATAAA TTTATCTTTG CTGCCAATAA AACATTTAGT GAAGCAGATT TAGGTGTGGA TGAAATTACT GATTTTGTAT CTGGCCCAGA TAAAATTATC TTGGATGTAA CAGTGTTCAC TGCCATTAAG ACTACTCCTG GAGAAAGTCT AGATGAAGGC GAATTTGCTG TGGTGGATAA TGAAACTGAT GTATTTAGCG CTGATGCTAC GATCGTTTAT AACTCAGCCA ATAGTACATT ATATTATAAT CCTAATGGAG TTGAAAATGG CTTGGGAGAT GGGGGTAGCT TTGCCCTGTT ATCTAATGAA GCATCTTTGA GTGCAGATGA TTTCCTAGTC AGAGCTTGA
|
Protein sequence | MNSILKKATE SIEINQNLQQ VRQQKRLMVI DTRVENYRQL TTGVTPETKV ALIDPKRDGI EQITELLSNY PTNSLHIVCH GDPGILYLGK TPISEQNLPK YTGYLQEWGI VDILLYSCNI ASQSDNLLQL LHKFTGANIA ASKHQVGNPL KGGTWDLESR IGSVASEIAF LPQVIQDYPG VFHGLQFYLG QDETGKVRAY LGKDDMVSNG YTNIFSNDYL TLGSVGISDQ RLVPESDTAF EYSYTDNASS FPGFTPQAYM FYETPLDGSP ADDSFVPERN TYFQVNFLDE LKVWNGEDFV ATGGEVMTWS QQKKSVINED QFRELVNEVT TGEGVVKGEP YYYSEIREAG RDHWHYIMEL QEGTSTTGID DGLYLLEVEV ETNVPDSIPS DPIYILYNQN LSPAPDADQN TILEAQNYLL ENFGITEPAQ EFKAFLDSSQ NDGVESDATG IATFRLNADQ TEIEYTIELN GIDLIEDPAE RTADNAVTKI NLHHRDYGIN GDHVFNIFGV ASEDDNDIEI DYENETITGK WDWSDAATNF TRHSEPKSGG TYGDRKSPEL KIEVIEAYDN SNGTLGIKLQ DGEPQDELDI FEGTKLKFEN GTTVEFTENI TIFQEFQEES STAQVSLLEG DDILDGEIAI LPPAQRSATT PVTTSLVHLF HGNLYVQVQT KQNFDPGDIR GQILPVRTLA DAEDDVLSGG TGNELFELDA ADVGGVEIKE LAGRETLTLE GMETSMSNLH REDNDLIIDV NQDGSFQTAE DLTLKNFFAD EVGVVGKGYI ELVDEISGNE ILRGVSSSGK DLVHGTSEDD IRQGKGGNDV ISGFGGNDEL YGNRGDDLLK GGDGNDILKG GYENDTLKGN SGNDTLIGWQ GFDILLGGSG EDNLKGGIGR DRLNGGKDDD LLSGGASQDK FIFAANKTFS EADLGVDEIT DFVSGPDKII LDVTVFTAIK TTPGESLDEG EFAVVDNETD VFSADATIVY NSANSTLYYN PNGVENGLGD GGSFALLSNE ASLSADDFLV RA
|
| |