Gene Tery_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0419 
Symbol 
ID4241924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp659042 
End bp662110 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content40% 
IMG OID638105740 
Producthemolysin-type calcium-binding region 
Protein accessionYP_720354 
Protein GI113474293 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAA TCTTAAAAAA AGCTACAGAA TCAATAGAAA TCAACCAAAA TTTGCAGCAA 
GTGCGACAGC AGAAAAGATT AATGGTAATT GACACGAGAG TGGAAAATTA TCGCCAGTTA
ACTACAGGAG TTACTCCAGA AACAAAAGTA GCTTTAATTG ACCCTAAGCG TGATGGTATA
GAACAAATTA CCGAACTTTT ATCTAATTAC CCAACTAATA GTTTACATAT TGTTTGTCAC
GGCGACCCTG GCATTTTATA TTTGGGAAAA ACACCAATAA GCGAGCAGAA TCTTCCCAAA
TATACCGGAT ATCTCCAAGA ATGGGGAATT GTCGATATAT TACTCTATAG TTGTAATATT
GCCAGTCAAT CAGATAACTT GCTCCAACTT TTACATAAAT TCACTGGAGC CAATATTGCT
GCTTCAAAAC ATCAGGTAGG CAACCCATTA AAAGGGGGAA CTTGGGATTT AGAAAGTCGC
ATTGGTTCAG TTGCTTCAGA GATAGCTTTT TTACCACAAG TTATTCAGGA TTATCCTGGA
GTATTTCACG GCTTACAATT CTATCTTGGC CAAGATGAAA CAGGTAAAGT CAGAGCCTAT
CTTGGCAAAG ATGATATGGT GAGTAACGGC TACACAAACA TATTTTCTAA TGACTATCTC
ACTTTAGGTT CAGTTGGGAT TTCGGATCAG CGATTAGTAC CAGAATCAGA TACTGCCTTT
GAGTATAGCT ATACTGATAA TGCATCCTCT TTTCCAGGGT TCACACCACA GGCATATATG
TTCTATGAGA CTCCTTTAGA CGGGAGTCCC GCCGATGATT CTTTCGTTCC CGAACGCAAT
ACCTATTTCC AAGTCAACTT TCTAGATGAA CTGAAGGTAT GGAATGGTGA AGATTTTGTG
GCTACAGGTG GAGAGGTAAT GACCTGGTCT CAACAAAAAA AGTCAGTCAT CAACGAAGAT
CAATTCAGGG AATTGGTAAA TGAAGTCACC ACAGGGGAAG GTGTTGTAAA GGGAGAACCT
TACTATTATT CAGAAATTCG CGAGGCAGGT AGAGACCACT GGCATTACAT CATGGAATTA
CAGGAAGGGA CTTCAACTAC AGGTATTGAT GACGGACTTT ACCTCTTGGA AGTCGAAGTT
GAGACTAACG TTCCTGATAG TATTCCTTCC GATCCTATTT ATATTCTCTA CAATCAAAAC
CTCTCTCCCG CCCCAGATGC AGACCAAAAT ACTATTCTGG AGGCTCAAAA CTATTTGCTG
GAAAACTTCG GGATAACTGA ACCTGCTCAA GAATTTAAAG CTTTCTTAGA TAGCAGTCAA
AACGATGGAG TAGAGTCAGA TGCTACAGGA ATTGCAACTT TTAGGCTGAA TGCAGACCAG
ACAGAAATAG AATATACGAT TGAATTAAAT GGAATCGATC TAATTGAAGA CCCTGCAGAA
CGTACAGCTG ATAATGCAGT CACCAAAATC AACCTTCACC ACCGCGACTA TGGTATTAAT
GGGGATCACG TTTTTAATAT CTTTGGGGTG GCGTCCGAAG ATGACAATGA TATTGAGATC
GACTACGAAA ATGAAACGAT TACAGGTAAA TGGGATTGGT CTGATGCTGC TACCAATTTT
ACTCGCCATT CTGAGCCGAA ATCTGGTGGC ACTTATGGTG ATAGAAAAAG CCCTGAATTA
AAAATTGAAG TTATTGAGGC TTACGATAAT AGTAATGGTA CTTTAGGTAT CAAGCTGCAA
GATGGGGAAC CTCAAGATGA ACTTGACATT TTTGAGGGAA CTAAACTTAA ATTTGAGAAC
GGTACTACAG TTGAGTTCAC AGAAAATATC ACTATATTTC AGGAATTTCA GGAAGAAAGC
AGCACTGCTC AAGTAAGTCT CCTAGAAGGA GATGATATTT TGGATGGGGA AATAGCTATT
TTACCTCCTG CTCAAAGGTC GGCAACTACT CCCGTGACTA CTTCTCTAGT TCACCTATTT
CATGGGAATC TCTACGTTCA GGTACAAACA AAGCAAAATT TTGACCCTGG TGATATTCGG
GGCCAAATAT TACCTGTGAG AACTTTAGCA GATGCCGAAG ATGATGTACT TTCTGGCGGT
ACAGGTAATG AATTATTTGA ATTGGATGCC GCTGATGTAG GAGGTGTTGA GATTAAAGAA
TTGGCTGGTC GCGAAACTTT GACTCTTGAA GGTATGGAAA CTTCTATGTC AAATTTGCAT
AGAGAAGATA ACGATCTAAT TATAGATGTT AACCAAGATG GCAGTTTCCA AACAGCGGAG
GACCTGACCC TCAAAAACTT TTTTGCAGAC GAAGTAGGAG TAGTTGGCAA AGGCTATATT
GAGTTAGTTG ATGAGATTTC TGGTAATGAA ATTTTAAGAG GTGTTTCTTC TAGTGGGAAG
GATTTAGTCC ATGGAACTTC AGAAGACGAT ATTCGGCAAG GTAAGGGGGG TAATGATGTT
ATTTCTGGCT TTGGTGGTAA TGACGAACTC TATGGTAACC GGGGTGATGA TCTTCTCAAA
GGTGGTGATG GTAACGATAT TCTCAAAGGT GGTTATGAAA ACGATACTTT AAAGGGGAAT
AGTGGTAATG ATACCTTAAT TGGTTGGCAA GGTTTCGATA TTTTGCTTGG TGGCAGTGGA
GAAGATAATC TGAAGGGTGG TATCGGTCGC GATCGCCTGA ATGGAGGAAA GGATGACGAT
CTGCTCAGTG GTGGAGCGAG TCAGGATAAA TTTATCTTTG CTGCCAATAA AACATTTAGT
GAAGCAGATT TAGGTGTGGA TGAAATTACT GATTTTGTAT CTGGCCCAGA TAAAATTATC
TTGGATGTAA CAGTGTTCAC TGCCATTAAG ACTACTCCTG GAGAAAGTCT AGATGAAGGC
GAATTTGCTG TGGTGGATAA TGAAACTGAT GTATTTAGCG CTGATGCTAC GATCGTTTAT
AACTCAGCCA ATAGTACATT ATATTATAAT CCTAATGGAG TTGAAAATGG CTTGGGAGAT
GGGGGTAGCT TTGCCCTGTT ATCTAATGAA GCATCTTTGA GTGCAGATGA TTTCCTAGTC
AGAGCTTGA
 
Protein sequence
MNSILKKATE SIEINQNLQQ VRQQKRLMVI DTRVENYRQL TTGVTPETKV ALIDPKRDGI 
EQITELLSNY PTNSLHIVCH GDPGILYLGK TPISEQNLPK YTGYLQEWGI VDILLYSCNI
ASQSDNLLQL LHKFTGANIA ASKHQVGNPL KGGTWDLESR IGSVASEIAF LPQVIQDYPG
VFHGLQFYLG QDETGKVRAY LGKDDMVSNG YTNIFSNDYL TLGSVGISDQ RLVPESDTAF
EYSYTDNASS FPGFTPQAYM FYETPLDGSP ADDSFVPERN TYFQVNFLDE LKVWNGEDFV
ATGGEVMTWS QQKKSVINED QFRELVNEVT TGEGVVKGEP YYYSEIREAG RDHWHYIMEL
QEGTSTTGID DGLYLLEVEV ETNVPDSIPS DPIYILYNQN LSPAPDADQN TILEAQNYLL
ENFGITEPAQ EFKAFLDSSQ NDGVESDATG IATFRLNADQ TEIEYTIELN GIDLIEDPAE
RTADNAVTKI NLHHRDYGIN GDHVFNIFGV ASEDDNDIEI DYENETITGK WDWSDAATNF
TRHSEPKSGG TYGDRKSPEL KIEVIEAYDN SNGTLGIKLQ DGEPQDELDI FEGTKLKFEN
GTTVEFTENI TIFQEFQEES STAQVSLLEG DDILDGEIAI LPPAQRSATT PVTTSLVHLF
HGNLYVQVQT KQNFDPGDIR GQILPVRTLA DAEDDVLSGG TGNELFELDA ADVGGVEIKE
LAGRETLTLE GMETSMSNLH REDNDLIIDV NQDGSFQTAE DLTLKNFFAD EVGVVGKGYI
ELVDEISGNE ILRGVSSSGK DLVHGTSEDD IRQGKGGNDV ISGFGGNDEL YGNRGDDLLK
GGDGNDILKG GYENDTLKGN SGNDTLIGWQ GFDILLGGSG EDNLKGGIGR DRLNGGKDDD
LLSGGASQDK FIFAANKTFS EADLGVDEIT DFVSGPDKII LDVTVFTAIK TTPGESLDEG
EFAVVDNETD VFSADATIVY NSANSTLYYN PNGVENGLGD GGSFALLSNE ASLSADDFLV
RA