Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4191 |
Symbol | |
ID | 4245843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6461220 |
End bp | 6464168 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638109090 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_723668 |
Protein GI | 113477607 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.735923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAAA CAGTTAAACA ACAAGTCGAA CTAGCACCAA CAATTTTTAT TGAAAGTTTG GGAGAGGGAA CTCCTCTGAG CGACTCCATA ATTGGAAGTT CAAGAAGTGA TCTAATTAGA GCATTGTCAG GAGATGATAC AGTTTTTGCT CTAGATGAAA ATGACTCCCT CCTTGGGGGT GAAGGTCAAG ACTACATGAA CGGCAATGCC GGAAATGATT TTCTCGACGG AGGTCTGGGA AATGATTGTT TACGGGGGGG GAAAGGTTTA GATATTTTCC TAGGAAATAG TGGTAGTGAT ATATTATTTG GCGATCGCGA AAATGATACC ATATTGGGAA ATGAAGGTGA CGATAGTATT TTTGGAGGTA AAGATAATGA TAGTATTATA GGTGGTGGAG AGGATGACTT ATTATTTGGC GATCGAGGCA ATGACGTAGT TAATGGGGAT ATTGGCAATG ATACAATAGT TGGCGGTGTA GGTGACGATC GCTTATTAGG AAATGATGGT GGAGATACTT TATTTGGAGA AACAGGTAGA GATATTCTAG AAGGGAACGC AGGAGATGAT GTACTTTTTG GTGGCAAAGA TGCAGACTCT CTAGAAGGAG GAGTTGGTGC AGACTTTCTC TCAGGAGACA TTGGTAAAGA TACACTTACA GGTGGTATTG GACAAGATAC TTTTAATCTT GAACCCAACT CAGAAAGTCC AGGAGTTAGT GGTGCAGACT TTATAAGTGA CTTCAGCAAT GAAGATTTAA TTAATCTTAG TAACGGTTTG ACATTTGAAG ATCTAAGTAT TTTTCCATCC GAAGAAAACC CCGATAATAC AATTATTAGT GTAGGTGGTA CTAATGGTGA CTTCCTAGCA GTATTAGCAG GAGTTGAAAG TAGCACTATT GACTCTAGTA GTTTTGTAAC AGTCTCACTA CCCAGAGCAA CACCTCCTAG AGTTACACCA ACACCCACAC CAATTGTAAC ACCAGCACCC ACGCCAATCG TAACACCAGC ACCCACACCA AATATTGTAG AACCCATAAA CAACCCCGGT AACACAATTG TTAGTGTAGG TGGTACTAAC AGTGACTTAT TAACAGTAAC AGTATCAATA CCCCTAGGAA TAAGTGCTGG GGTCACACCA GCACCTGCGC CAGCACCAAT ACCTGCGCCA GCACCAGCAC CAACACCAAC ACCAGCACCA ACACCAACAC CAACACCAAC ACCTTCTAAT TTGCCACCCA AAGCAGAGGA TGATAACTTC ATAACTAATA AAGATGAACC ATTAACTATT ACTGCTGAAA AACTCCTAGA AAACGACTCT GATCCAGAAG GAGATCCTTT CAGTATTTCA GCTATAGATA GACCACAAAA CGGATCGTTA CTAAAAGAAA ATAGTATAAC TTATGTATAC ACTCCTAATC CTGCATTTAA GGGAAATGAT AGTTTCAGCT ATAGTATTAG TGACGGGAAT AATAATTCTG ATACTGCCCA AGTTAATATC AAAGTCAACG ATCCACCACA GTTAGCAAAC AACCGAGGTA TTATTCTTCT CAAAAATGGA GCCAGACAAA ATATATTAAA TACTAATCTT TTAGTTACAG ATACCGACAA CACAATAGAA GAAATTACCT ACAACCTTAT TAGAACACCA ATACAAGGTA ATCTTAGATT GGTTAATAAA CGTCTTAATC AGAATGATAC TTTCTCTCAG GCTGATATTA ATAACAATTT AATTCTGTAT ACTCCTGGTA ATACAGCAGG TAATTTTCCT TTTTTCTTTT CTGTCTCTGA TGGTGATGGC GGAAGTATTG CCAGTACATC TTTTAGCATT AGAGTGGTTG ATAATATTAT TGAAAGAGGA AATGGTAATA ATAACATTAA TGGTACTCAA GAAAGTGATT ACTTAATAGG AAATTCTGGC AACGATACTC TCAGTGGTGG TAATGGAGAT GATATTTTAG ATGGCGGAGC TGGAGAAGAT ATCTTATTAG GCGAAGCTGG TAACGACTCT TTATTTGGAG GAGAAGAAAC TGACACATTA TTTGGAGGAA TAGGGGACGA TACTCTTGAT GGTGGTGCTG ATAATGACTC TTTGTTAGGA CAAGGTGGCA ACGATCTATT ATTAGGAGCA GAACAACAAG ATACTCTTGA AGGTGGGCTA GGTGATGATA CCCTTGACGG CGGTACAGAT AATGATTCCC TATCTGGAGG AGATGACAAC GACTCCCTAT TGGGTAATCT AGGAAATGAT ACTCTTAGAG GTGATTCCGG TAACGATACT ATTAATGGGG GTGCTGATAG TGACAGTATT CTAGGTGGCA TAGGTGATGA CTCTTTGTTT GGTGGCTCAG GATCTGACAC TCTTGACGGT AATGAAGGAA ACGATTTCTT AAGTGGAACA GAAGGTAACG ATTCTATTAA TGGAAGTTTA GGAGATGATA TATTAAATGG TGGATTTGGT TCTGACAGAC TAACTGGTAA TCAAGGAGCT GATTCTTTCT TCTTTGAAGT ACCTAACGAA GGAGTTGATG TGATTACTGA TTTTAATGGG GATAATGTAG ATGCATTTTT GTTTAGATCG AAGAACTTTG GTAATTTAAC TAATCCTCCT GGTACTGCTA CAGAATTTTT CAGTGTAATA GTTTCTCTGG ATAATTTAGG TTCACAAGGT CAAAATATTA GCGAACAAGA ATTGATTATT TTTGAGAATA AATTTGAAAA TGTACAGCAA GTTAATTCTA TTTTGAAAAA TCAAAATGGT TCTGGTACAA ATCCAGCTTT CTTTATTTAT GTCAATAATA GTTTAAATAG CAAAGTTATT CTAGGATATG ACCCTAATCT TCAGGACGAC AATAGTCCTG CTTTTGATTT AGCAGTTATC AATAATATTC CTGTTTTTTC TGATGCTATT AATACTATTA TTGATTCTTC AGACTTTAAG TTTATTTGA
|
Protein sequence | MAQTVKQQVE LAPTIFIESL GEGTPLSDSI IGSSRSDLIR ALSGDDTVFA LDENDSLLGG EGQDYMNGNA GNDFLDGGLG NDCLRGGKGL DIFLGNSGSD ILFGDRENDT ILGNEGDDSI FGGKDNDSII GGGEDDLLFG DRGNDVVNGD IGNDTIVGGV GDDRLLGNDG GDTLFGETGR DILEGNAGDD VLFGGKDADS LEGGVGADFL SGDIGKDTLT GGIGQDTFNL EPNSESPGVS GADFISDFSN EDLINLSNGL TFEDLSIFPS EENPDNTIIS VGGTNGDFLA VLAGVESSTI DSSSFVTVSL PRATPPRVTP TPTPIVTPAP TPIVTPAPTP NIVEPINNPG NTIVSVGGTN SDLLTVTVSI PLGISAGVTP APAPAPIPAP APAPTPTPAP TPTPTPTPSN LPPKAEDDNF ITNKDEPLTI TAEKLLENDS DPEGDPFSIS AIDRPQNGSL LKENSITYVY TPNPAFKGND SFSYSISDGN NNSDTAQVNI KVNDPPQLAN NRGIILLKNG ARQNILNTNL LVTDTDNTIE EITYNLIRTP IQGNLRLVNK RLNQNDTFSQ ADINNNLILY TPGNTAGNFP FFFSVSDGDG GSIASTSFSI RVVDNIIERG NGNNNINGTQ ESDYLIGNSG NDTLSGGNGD DILDGGAGED ILLGEAGNDS LFGGEETDTL FGGIGDDTLD GGADNDSLLG QGGNDLLLGA EQQDTLEGGL GDDTLDGGTD NDSLSGGDDN DSLLGNLGND TLRGDSGNDT INGGADSDSI LGGIGDDSLF GGSGSDTLDG NEGNDFLSGT EGNDSINGSL GDDILNGGFG SDRLTGNQGA DSFFFEVPNE GVDVITDFNG DNVDAFLFRS KNFGNLTNPP GTATEFFSVI VSLDNLGSQG QNISEQELII FENKFENVQQ VNSILKNQNG SGTNPAFFIY VNNSLNSKVI LGYDPNLQDD NSPAFDLAVI NNIPVFSDAI NTIIDSSDFK FI
|
| |