Gene Tery_0424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0424 
Symbol 
ID4241929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp666287 
End bp669340 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content40% 
IMG OID638105743 
Producthemolysin-type calcium-binding region 
Protein accessionYP_720357 
Protein GI113474296 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.316008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.221114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAA TCTTAAAAAA AGCTACAGAA TCAATAGAAA TCAACCAAAA TTTGCAGCAA 
GTGCGACAGC AGAAAAGATT AATGGTAATT GACACGAGAG TGGAAAATTA TCGCCAGTTA
ACTACAGGAG TTACTCCAGA AACAAAAGTA GCTTTAATTG ACCCTAAGCG TGATGGTATA
GAACAAATTA CCGAACTTTT ATCTAATTAC CCAACTAATA GTTTACATAT TGTTTGTCAC
GGCGACCCTG GCATTTTATA TTTGGGAAAA ACACCAATAA GCGAGCAGAA TCTTCCCAAA
TATACCGGAT ATCTCCAAGA ATGGGGAATT GTCGATATAT TACTCTATAG TTGTAATATT
GCCAGTCAAT CAGATAACTT GCTCCAACTT TTACATAAAT TCACTGGAGC CAATATTGCT
GCTTCAAAAC ATCAGGTGGG CAACCCATTA AAAGGGGGAA CTTGGGATTT AGAAAGCCGC
ATTGGTTCAG TTGCTTCAGA GATAGCTTTT TTACCACAAG TTATTCAAGA TTATCCTGGA
GTATTTCACG GGTTACAATT TTATGTCGGT CAAGATGAAA CAGGTAAAGT CAGAGCCTAT
CTTGGCAAAG ATGATAGGGT GAGTAACGCC TACACCAATA TATTTTCTGA TGACTATCTC
ACTTTAAGTC CAGTTGGGAT TTGGGATGAC CGGGAAATAG AAGAAAAAGA TACTGCCTTT
GACTATCGTT ATACTAATAA CGCATCCTAT TTTCCAGGCT TTACACCACA GATAGATATG
TTCTATGAAA CTCCTGTAGA CGGGAGTCCT GCTGATGATT CTTTCGATCG CGAACGCAAT
ACCTATTTCC AAGTCAACTT TCTAGATGAG CTGAAGGTAT GGAATGGTGA AGATTTTGTG
GCTACAGGTG GAGAGGTGAT GACCTGGTCT CAAGAAAACT GGGATGTTAA TGATAAAGGT
GAATGGTATG CAGAATTTGC CAATGAAGTC ACCACAGGGG AAGGTGTTGT AAAGGGAAAA
CCTTACTATT ATGCAGTATA TGACGAACCA GGTAATGACC ACTGGCATTA CGTCATGGAG
TTACAGGAAG GAACTTCACC TACAGGTATT GATGACGGGC TTTACTTGTT GCCAATTGAG
GTTGAGACTA ATATTCCTGA TAGCATTCCT TCCGATCCTG TTTATATTCT CTACAATCAA
AACCTCTCTG CTGCCCCCGA TGTAGACCAA GATACTATTC TGGCCGCTCA AAATTATCTG
CTAGAAGAGT ACGGAATGAC TGAACCTGCT CAAGAATTTA AAGCTTTCTT AGATAGCAGT
CAAAATGGAG TAGATTCAGA TGCTACAGGA CTTGCAACTT TTAGCCTGAA TGCAGAACAG
ACAGAAATAG AATATACCAT TGAATTAAAT GGACTTGACC TAATTGAAGA TCCTGTAAAG
CGTACAGCTG AGAACGCCAT TACCGAAATT CACTTTCACC ACCGCGGCTA CGGCACTGAT
GGCAGTCACG TTTTGAACGT CTTTGGGACA CCATCGGAAG ATGATGATAA CATTGAGATT
GATTACGAAA ATGAAAGGAT TATAGGTAAG TGGGATTGGT CTGATGCTGC TGCCAATTTT
GCTCGCCATT CTGAGCCTAA ATCTGATGGT GTTTATGGTG ACTGGAAAGC TCCTAAATTA
GAAATTGAAG TTGTTGAAGC CTACAATAAT GGTGCTTTAG GCATCAAATT GCGAGATGAA
GAACCTGAAG ATAAACTCAA CATTTTTGAG GGAACTCAGC TCAACTTTGA CAACGGTGCC
AGGGTTAAAA TAACAGAAAA CATTACCATA TCTAAGGCAA AAAGTAGTAT AGTTGAAGCA
AGCATTCTGG AAGGGGGCGA TATTCCAGCT GGGGAAATAG CCATTCTATC TCCTGCTCAA
ATGCCGGGAA CTACTCCCGT TACGACTTCT CTATTCAACT TATTTCATGG CAACCTCTAC
GTTCAGGTGC ATACCAACCA AAATCCTGAA CCTGGTGATA TTCGGGGCCA AGTATTACCT
GTGAGAACTT TAGCAGATGC CGAAGATGAT GTACTTTCTG GCGGTACAGG TAATGAATTA
TTTGAATTGG ATGCCGCTGA TGTAGGAGGT ATTGAGATTG AAGAATTGGC TGGTCGCGAA
ACTTTGACTC TTGAAGGTAT GGAAACTTCT ATGTCAAATT TGCATAGAGA AGATAACGAT
CTAATTATAG ATGTTAACCA AGATGGCAGT TTCCAAACAG CGGAAGACCT GACCCTCAAA
AACTTTTTTG CAGACGAAGT AGGAGTAGTT GGCAAAGGCT ATATTGAGTT AGTTGATGAG
ATTTCTGGTA ATGAAATTTT AAGAGGTGTT TCTTCTAGTG GGAAGGATTT AGTCCATGGA
ACTTCAGAAG ACGATATTCG GCAAGGTAAG GGGGGTAATG ATGTTATTTC TGGCTTTGGT
GGTAATGACG AACTCTATGG TAACCGGGGT GATGATCTTC TCAAAGGTGG TGATGGTAAC
GATATTCTCA AAGGTGGTTA TGAAAACGAT ACTTTAAAGG GGAATAGTGG TAATGATACC
TTAATTGGTT GGCAAGGTTT CGATATTTTG CTTGGTGGCA GTGGAGAAGA TAATCTGAAG
GGTGGTATCG GTCGCGATCG CCTGAATGGA GGAAAGGATG ACGATCTGCT CAGTGGTGGA
GCGAGTCAGG ATAAATTTAT CTTTGCTGCC AATAAAACAT TTAGTGAAGC AGATTTAGGT
GTGGATGAAA TTACTGATTT TGTATCTGGC CCAGATAAAA TTATCTTGGA TGTAACAGTG
TTCACTGCCA TTAAGACTAC TCCTGGAGAA AGTCTAGATG AAGGCGAATT TGCTGTGGTG
GATAATGAAA CTGATGTATT TAGCGCTGAT GCTACGATCG TTTATAACTC AGCCAATAGT
ACATTATATT ATAATCCTAA TGGAGTTGAA AATGGCTTGG GAGATGGGGG TAGCTTTGCC
CTGTTATCTA ATGAAGCATC TTTGAGTGCA GATGATTTCC TAGTCAGAGC TTGA
 
Protein sequence
MNSILKKATE SIEINQNLQQ VRQQKRLMVI DTRVENYRQL TTGVTPETKV ALIDPKRDGI 
EQITELLSNY PTNSLHIVCH GDPGILYLGK TPISEQNLPK YTGYLQEWGI VDILLYSCNI
ASQSDNLLQL LHKFTGANIA ASKHQVGNPL KGGTWDLESR IGSVASEIAF LPQVIQDYPG
VFHGLQFYVG QDETGKVRAY LGKDDRVSNA YTNIFSDDYL TLSPVGIWDD REIEEKDTAF
DYRYTNNASY FPGFTPQIDM FYETPVDGSP ADDSFDRERN TYFQVNFLDE LKVWNGEDFV
ATGGEVMTWS QENWDVNDKG EWYAEFANEV TTGEGVVKGK PYYYAVYDEP GNDHWHYVME
LQEGTSPTGI DDGLYLLPIE VETNIPDSIP SDPVYILYNQ NLSAAPDVDQ DTILAAQNYL
LEEYGMTEPA QEFKAFLDSS QNGVDSDATG LATFSLNAEQ TEIEYTIELN GLDLIEDPVK
RTAENAITEI HFHHRGYGTD GSHVLNVFGT PSEDDDNIEI DYENERIIGK WDWSDAAANF
ARHSEPKSDG VYGDWKAPKL EIEVVEAYNN GALGIKLRDE EPEDKLNIFE GTQLNFDNGA
RVKITENITI SKAKSSIVEA SILEGGDIPA GEIAILSPAQ MPGTTPVTTS LFNLFHGNLY
VQVHTNQNPE PGDIRGQVLP VRTLADAEDD VLSGGTGNEL FELDAADVGG IEIEELAGRE
TLTLEGMETS MSNLHREDND LIIDVNQDGS FQTAEDLTLK NFFADEVGVV GKGYIELVDE
ISGNEILRGV SSSGKDLVHG TSEDDIRQGK GGNDVISGFG GNDELYGNRG DDLLKGGDGN
DILKGGYEND TLKGNSGNDT LIGWQGFDIL LGGSGEDNLK GGIGRDRLNG GKDDDLLSGG
ASQDKFIFAA NKTFSEADLG VDEITDFVSG PDKIILDVTV FTAIKTTPGE SLDEGEFAVV
DNETDVFSAD ATIVYNSANS TLYYNPNGVE NGLGDGGSFA LLSNEASLSA DDFLVRA