Gene Tery_4191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4191 
Symbol 
ID4245843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6461220 
End bp6464168 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content38% 
IMG OID638109090 
Producthemolysin-type calcium-binding region 
Protein accessionYP_723668 
Protein GI113477607 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.735923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAA CAGTTAAACA ACAAGTCGAA CTAGCACCAA CAATTTTTAT TGAAAGTTTG 
GGAGAGGGAA CTCCTCTGAG CGACTCCATA ATTGGAAGTT CAAGAAGTGA TCTAATTAGA
GCATTGTCAG GAGATGATAC AGTTTTTGCT CTAGATGAAA ATGACTCCCT CCTTGGGGGT
GAAGGTCAAG ACTACATGAA CGGCAATGCC GGAAATGATT TTCTCGACGG AGGTCTGGGA
AATGATTGTT TACGGGGGGG GAAAGGTTTA GATATTTTCC TAGGAAATAG TGGTAGTGAT
ATATTATTTG GCGATCGCGA AAATGATACC ATATTGGGAA ATGAAGGTGA CGATAGTATT
TTTGGAGGTA AAGATAATGA TAGTATTATA GGTGGTGGAG AGGATGACTT ATTATTTGGC
GATCGAGGCA ATGACGTAGT TAATGGGGAT ATTGGCAATG ATACAATAGT TGGCGGTGTA
GGTGACGATC GCTTATTAGG AAATGATGGT GGAGATACTT TATTTGGAGA AACAGGTAGA
GATATTCTAG AAGGGAACGC AGGAGATGAT GTACTTTTTG GTGGCAAAGA TGCAGACTCT
CTAGAAGGAG GAGTTGGTGC AGACTTTCTC TCAGGAGACA TTGGTAAAGA TACACTTACA
GGTGGTATTG GACAAGATAC TTTTAATCTT GAACCCAACT CAGAAAGTCC AGGAGTTAGT
GGTGCAGACT TTATAAGTGA CTTCAGCAAT GAAGATTTAA TTAATCTTAG TAACGGTTTG
ACATTTGAAG ATCTAAGTAT TTTTCCATCC GAAGAAAACC CCGATAATAC AATTATTAGT
GTAGGTGGTA CTAATGGTGA CTTCCTAGCA GTATTAGCAG GAGTTGAAAG TAGCACTATT
GACTCTAGTA GTTTTGTAAC AGTCTCACTA CCCAGAGCAA CACCTCCTAG AGTTACACCA
ACACCCACAC CAATTGTAAC ACCAGCACCC ACGCCAATCG TAACACCAGC ACCCACACCA
AATATTGTAG AACCCATAAA CAACCCCGGT AACACAATTG TTAGTGTAGG TGGTACTAAC
AGTGACTTAT TAACAGTAAC AGTATCAATA CCCCTAGGAA TAAGTGCTGG GGTCACACCA
GCACCTGCGC CAGCACCAAT ACCTGCGCCA GCACCAGCAC CAACACCAAC ACCAGCACCA
ACACCAACAC CAACACCAAC ACCTTCTAAT TTGCCACCCA AAGCAGAGGA TGATAACTTC
ATAACTAATA AAGATGAACC ATTAACTATT ACTGCTGAAA AACTCCTAGA AAACGACTCT
GATCCAGAAG GAGATCCTTT CAGTATTTCA GCTATAGATA GACCACAAAA CGGATCGTTA
CTAAAAGAAA ATAGTATAAC TTATGTATAC ACTCCTAATC CTGCATTTAA GGGAAATGAT
AGTTTCAGCT ATAGTATTAG TGACGGGAAT AATAATTCTG ATACTGCCCA AGTTAATATC
AAAGTCAACG ATCCACCACA GTTAGCAAAC AACCGAGGTA TTATTCTTCT CAAAAATGGA
GCCAGACAAA ATATATTAAA TACTAATCTT TTAGTTACAG ATACCGACAA CACAATAGAA
GAAATTACCT ACAACCTTAT TAGAACACCA ATACAAGGTA ATCTTAGATT GGTTAATAAA
CGTCTTAATC AGAATGATAC TTTCTCTCAG GCTGATATTA ATAACAATTT AATTCTGTAT
ACTCCTGGTA ATACAGCAGG TAATTTTCCT TTTTTCTTTT CTGTCTCTGA TGGTGATGGC
GGAAGTATTG CCAGTACATC TTTTAGCATT AGAGTGGTTG ATAATATTAT TGAAAGAGGA
AATGGTAATA ATAACATTAA TGGTACTCAA GAAAGTGATT ACTTAATAGG AAATTCTGGC
AACGATACTC TCAGTGGTGG TAATGGAGAT GATATTTTAG ATGGCGGAGC TGGAGAAGAT
ATCTTATTAG GCGAAGCTGG TAACGACTCT TTATTTGGAG GAGAAGAAAC TGACACATTA
TTTGGAGGAA TAGGGGACGA TACTCTTGAT GGTGGTGCTG ATAATGACTC TTTGTTAGGA
CAAGGTGGCA ACGATCTATT ATTAGGAGCA GAACAACAAG ATACTCTTGA AGGTGGGCTA
GGTGATGATA CCCTTGACGG CGGTACAGAT AATGATTCCC TATCTGGAGG AGATGACAAC
GACTCCCTAT TGGGTAATCT AGGAAATGAT ACTCTTAGAG GTGATTCCGG TAACGATACT
ATTAATGGGG GTGCTGATAG TGACAGTATT CTAGGTGGCA TAGGTGATGA CTCTTTGTTT
GGTGGCTCAG GATCTGACAC TCTTGACGGT AATGAAGGAA ACGATTTCTT AAGTGGAACA
GAAGGTAACG ATTCTATTAA TGGAAGTTTA GGAGATGATA TATTAAATGG TGGATTTGGT
TCTGACAGAC TAACTGGTAA TCAAGGAGCT GATTCTTTCT TCTTTGAAGT ACCTAACGAA
GGAGTTGATG TGATTACTGA TTTTAATGGG GATAATGTAG ATGCATTTTT GTTTAGATCG
AAGAACTTTG GTAATTTAAC TAATCCTCCT GGTACTGCTA CAGAATTTTT CAGTGTAATA
GTTTCTCTGG ATAATTTAGG TTCACAAGGT CAAAATATTA GCGAACAAGA ATTGATTATT
TTTGAGAATA AATTTGAAAA TGTACAGCAA GTTAATTCTA TTTTGAAAAA TCAAAATGGT
TCTGGTACAA ATCCAGCTTT CTTTATTTAT GTCAATAATA GTTTAAATAG CAAAGTTATT
CTAGGATATG ACCCTAATCT TCAGGACGAC AATAGTCCTG CTTTTGATTT AGCAGTTATC
AATAATATTC CTGTTTTTTC TGATGCTATT AATACTATTA TTGATTCTTC AGACTTTAAG
TTTATTTGA
 
Protein sequence
MAQTVKQQVE LAPTIFIESL GEGTPLSDSI IGSSRSDLIR ALSGDDTVFA LDENDSLLGG 
EGQDYMNGNA GNDFLDGGLG NDCLRGGKGL DIFLGNSGSD ILFGDRENDT ILGNEGDDSI
FGGKDNDSII GGGEDDLLFG DRGNDVVNGD IGNDTIVGGV GDDRLLGNDG GDTLFGETGR
DILEGNAGDD VLFGGKDADS LEGGVGADFL SGDIGKDTLT GGIGQDTFNL EPNSESPGVS
GADFISDFSN EDLINLSNGL TFEDLSIFPS EENPDNTIIS VGGTNGDFLA VLAGVESSTI
DSSSFVTVSL PRATPPRVTP TPTPIVTPAP TPIVTPAPTP NIVEPINNPG NTIVSVGGTN
SDLLTVTVSI PLGISAGVTP APAPAPIPAP APAPTPTPAP TPTPTPTPSN LPPKAEDDNF
ITNKDEPLTI TAEKLLENDS DPEGDPFSIS AIDRPQNGSL LKENSITYVY TPNPAFKGND
SFSYSISDGN NNSDTAQVNI KVNDPPQLAN NRGIILLKNG ARQNILNTNL LVTDTDNTIE
EITYNLIRTP IQGNLRLVNK RLNQNDTFSQ ADINNNLILY TPGNTAGNFP FFFSVSDGDG
GSIASTSFSI RVVDNIIERG NGNNNINGTQ ESDYLIGNSG NDTLSGGNGD DILDGGAGED
ILLGEAGNDS LFGGEETDTL FGGIGDDTLD GGADNDSLLG QGGNDLLLGA EQQDTLEGGL
GDDTLDGGTD NDSLSGGDDN DSLLGNLGND TLRGDSGNDT INGGADSDSI LGGIGDDSLF
GGSGSDTLDG NEGNDFLSGT EGNDSINGSL GDDILNGGFG SDRLTGNQGA DSFFFEVPNE
GVDVITDFNG DNVDAFLFRS KNFGNLTNPP GTATEFFSVI VSLDNLGSQG QNISEQELII
FENKFENVQQ VNSILKNQNG SGTNPAFFIY VNNSLNSKVI LGYDPNLQDD NSPAFDLAVI
NNIPVFSDAI NTIIDSSDFK FI