Gene Tery_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1835 
Symbol 
ID4241911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2804987 
End bp2811175 
Gene Length6189 bp 
Protein Length2062 aa 
Translation table11 
GC content43% 
IMG OID638106956 
Productpeptidyl-Asp metallopeptidase. metallo peptidase. MEROPS family M72 
Protein accessionYP_721564 
Protein GI113475503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCCTA TATCTGAAGA ATTAAACAAC TTTAGACAAG GTTTTACTTT TACAGGTACA 
GGCGAAGTGT TAGCCGAGGT AGATAAATTC GGTGATGAGC TGCGATTTTC TAGTGGTCTT
GAATTAAGTC CAGCTAGCGA GGAACTAAAA ATTGAAAATA TAGATACATC TAATGCTGAT
GAAAATTTCT TGTTGCCCAA GAGTAATTTG GGAAATTTAG ATTTAAATAT AGAGGACTCA
GAAGATAGGA CTTTAAAGAC TGAAACAGAC CCACTGATAG GACAAGACAA TGATCCCGTT
CAAGAAGTCG ACTCTTTAAT TAGTGGGACA TCTGATCGGG TAAACAATAT ACAAACTAGG
GCTGCTAAAA AGCGGGATAA AGCAGGTAAT AGTCGAGGAA AAGCCCACAA CTTAGGAGTT
TTAGAGGATG ATCAAAAGTT ACAGGAGTTT GTCGGTAAGT CTGACCGGAA GGACTTTTAT
AAGTTTAAGG TTAAGAACAA GACGGATATA GATATTGAGT TGGGCGGCCT CAGTGGAAAT
GCGGATTTGT ATTTGCTGAA TAATAGGGGG AAAGTTATTG AGAAGTCTAA TAAGGGTGGT
AAAAGGGCAG AGGATATTGA GCGGACTTTG AACCCAGGGA CTTATTATGT GAGGGTGCAG
TCAAAAGGTA GAGCAAATGC TAATTACAAC TTGAGTTTGG ATAGTGAGTT GTCAGATCTG
GCGGGTAATA GTTTGAAGAG AGCTCACAAC TTGGGAGTGT TGAAGGATGA TCAAAAGTTC
CAGGAGTTTG TCGGTAAGTC TGACCGAAAG GATATTTATA AATTTAAGGT TAAAGACAAG
ACGGATGTAG ATATTGAGTT AGGCGGCCTC AGTGGAAATG CGGATTTGTA TTTGCTGAAT
AATAGGGGGA AAGTTATTGA GAAGTCTACT AAGGGTGGCA AAAGGGTAGA GGACGTTGAC
CGGACTTTAA ACCCAGGGAC TTATTATGTA AGGGTTCAGT CGAGGAATAA GAGGGTTAAT
GCTGATTATA GTTTGAGTTT AGATACAAAG AGCCCACATT TGCATATTCA CGAACCTATA
ATTATAAACG CCATCGACAC TGCTGCAGAT CAACCCGGTA CATTAGATGA CTCTTCAACA
ATTGATTTGA TGGTTGTTTA CACTTCAGAA GCTCGCCGAG CGGAGGGGGG TATAGATGCT
ATCAAAGATC TAATTGAGTT TGCTGTTGAT GACGCCAATG AAGCCTTTGC TAAAAGCGGA
GTTCAATCCC AGTTACGGCT AGTCCATACA GCAGAGGTTA ACTATACCGA GTCGGGTAAA
AGTATTAAGG AACTTGAACG GCTAAAAAAC GATTCAGACG GCTATATGGA TGAAGTTCAC
GAACTTCGTA ATGATTATGG TGCTGACATT GTAAGTTTAT TTGTCAGCAG TCTTGACGAT
GCTGGTGGTA TAGCTTATCC AATGGGCACG CCAGCGTATC AATTTGAAAG TCACGCTTTC
AACGTTGTCA CCAATTATAA TGCCAAAACT CGTCACACTC TTGCTCATGA AATAGGACAT
AATCTAGGAT TAGCCCATGA TCGCGATAAT GCTGAAGGTC AAGGTTCTTT TCCTTACGGC
TATGGTTACA CTACTCCCAG TGGCGCGGGA ACCATAATGT CTTATGCTAG GAACAGACTT
CCATACTTCT CTAACCCTGC TATTAGCTAC AATGGTGAAG CACTTGGTCA AACCAACCGT
GAGAACTCTG CTTTAGCTAT TAACAAAGTG GCTCAATATG CTGCTAACTG GCGACCTTCT
GGTGGTACAA TTAAATCCCA AGACTCTATC ACTCTCAACT CCCCCAACGG AGGTAATACC
TTAGAACAGG GTTCTAATTA TACTATTACT TGGAATGACA ATATTAGCGA AAACATCAAA
CTGGAACTAT ACAAAGGAGG TTCTTTCTAC AGTACCATTA ACAGTTCTAC TACCAGCGAT
GGCAGTTACA GTTGGAGCAT ACCCACATCC ATAACCAGTG GCAGCGACTA CAAGTTAAAA
ATTAGTAGTG TTAGTGATAG TAGCTTGTAC GACTACAGCG ATAGCAACTT CACTATTGAA
CCAGAAGAGT TCATCACGCT CACAGCTCCC AACGGAGGTA ACAGTCTAGA ACCAGGGAGA
AGCTACTACA TCGACTGGGA GGATAATATA AGCGAGAATG TGAAACTAGA ACTGTACAAA
GGAGGTTCTT TCTACAGTAC TATTAATAGC TCTACCTCCA GTGATGGTCG TTACAATATC
TGGACAGTAC CCACATCCAT AACCAGTGGC AGCGACTACA AGATTAAAAT CAGTAGCGTT
AGTGATAGTG GCTTGTACGA CTACAGCGAT AGCAACTTCA CTATTGAACC AGAGGAGTTC
ATCACGGTCA CAGCTCCCAA CGGAGGTAAC AGTCTAGAAC CAGGAATAAG CTACTACCTT
GACTGGGAGG ATAATATAGG CGAGAATGTC AAAATAGAAC TGTACAAAGG AGGTTCTTTC
TACAGTACCA TTGACAGCTC TACCTCCAGT GATGGTCGTC ACATCTGGGG AGTACCCACA
TCCATAACCA GTGGCAGCGA CTACAAGATT AAAATCACTA GTGTTAGTGA TAGTGGCTTG
TACGACTACA GCGATAGCAA CTTCACTATT GAACCAGAAG AGTTCATCAC GCTCACATCT
CCCAACGGAG GTAACAGTCT AGAACCAGGG AGAAGCTACT ACATCGACTG GGAGGATAAT
ATAAGCGAGA ATGTGAAACT AGAACTGTAC AAAGGAGGTT CTTTCTACAG TACTATTAAT
AGCTCTACCT CCAGTGATGG TCGTTACAAT ATCTGGACAG TACCCACATC CATAAGCAGT
GGCAGCGACT ACAAGATTAA AATCAGTAGC GTTAGTGATA GTAGCTTGTA CGACTACAGC
GATAGCAACT TCACTATTGA ACCAGAAGAG TTCATCACGC TCACAGCTCC CAACGGAGGT
AACAGTCTAG AACCAGGGAG AAGCTACTAC ATCGACTGGG AGGATAATAT AAGCGAGAAT
GTGAAACTAG AACTGTACAA AGGAGGTTCT TTCTACAGTA CTATTAATAG CTCTACCTCC
AGTGATGGTC GTTACAATAT CTGGACAGTA CCCACATCCA TAACCAGTGG CAGCGACTAC
AAGATTAAAA TCAGTAGCGT TAGTGATAGT GGCTTGTACG ACTACAGCGA TAGCAACTTC
ACTATTGAAC CAGAAGAGTT CATCACGCTC ACAGCTCCCA ACGGAGGTAA CAGTCTAGAA
CCAGGGAGAA GCTACTACAT CGACTGGGAG GATAATATAA GCGAGAATGT GAAACTAGAA
CTGTACAAAG GAGGTTCTTT CTACAGTACT ATTAATAGCT CTACCTCCAG TGATGGTCGT
TACAATATCT GGACAGTACC CACATCCATA ACCAGTGGCA GCGACTACAA GATTAAAATC
AGTAGCGTTA GTGATAGTGG CTTGTACGAC TACAGCGATA GCAACTTCAC TATTGAACCA
GAAGAGTTCA TCACGCTCAC AGCTCCCAAC GGAGGTAACA GTCTAGAACC AGGGAGAAGC
TACTACATCG ACTGGGAGGA TAATATAAGC GAGAATGTGA AACTAGAACT GTACAAAGGA
GGTTCTTTCT ACAGTACTAT TAATAGCTCT ACCTCCAGTG ATGGTCGTTA CAATATCTGG
ACAGTACCCA CATCCATAAC CAGTGGCAGC GACTACAAGA TTAAAATCAG TAGCGTTAGT
GATAGTGGCT TGTACGACTA CAGCGATAGC AACTTCACTA TTGAACCAGA AGAGTTCATC
ACGCTCACAT CTCCCAACGG AGGTAACAGT CTAGAACCAG GGAGAAGCTA CTACATCGAC
TGGGAGGATA ATATAAGCGA GAATGTGAAA CTAGAACTGT ACAAAGGAGG TTCTTTCTAC
AGTACTATTA ATAGCTCTAC CTCCAGTGAT GGTCGTTACA ATATCTGGAC AGTACCCACA
TCCATAACCA GTGGCAGCGA CTACAAGATT AAAATCAGTA GCGTTAGTGA TAGTGGCTTG
TACGACTACA GCGATAGCAA CTTCACTATT GAACCAGAAG AGTTCATCAC GCTCACAGCT
CCCAACGGAG GTAACAGTCT AGAACCAGGG AGAAGCTACT ACATCGACTG GGAGGATAAT
ATAAGCGAGA ATGTGAAACT AGAACTGTAC AAAGGAGGTT CTTTCTACAG TACTATTAAT
AGCTCTACCT CCAGTGATGG TCGTTACAAT ATCTGGACAG TACCCACATC CATAACCAGT
GGCAGCGACT ACAAGATTAA AATCAGTAGC GTTAGTGATA GTGGCTTGTA CGACTACAGC
GATAGCAACT TCACTATTGA ACCAGAAGAG TTCATCACGC TCACAGCTCC CAACGGAGGT
AACAGTCTAG AACCAGGGAG AAGCTACTAC ATCGACTGGG AGGATAATAT AAGCGAGAAT
GTGAAACTAG AACTGTACAA AGGAGGTTCT TTCTACAGTA CTATTAATAG CTCTACCTCC
AGTGATGGTC GTTACAATAT CTGGACAGTA CCCACATCCA TAACCAGTGG CAGCGACTAC
AAGATTAAAA TCAGTAGCGT TAGTGATAGT GGCTTGTACG ACTACAGCGA TAGCAACTTC
ACTATTGAAC CAGAAGAGTT CATCACGCTC ACAGCTCCCA ACGGAGGTAA CAGTCTAGAA
CCAGGGAGAA GCTACTACAT CGACTGGGAG GATAATATAA GCGAGAATGT GAAACTAGAA
CTGTACAAAG GAGGTTCTTT CTACAGTACT ATTAATAGCT CTACCTCCAG TGATGGTCGT
TACAATATCT GGACAGTACC CACATCCATA ACCAGTGGCA GCGACTACAA GATTAAAATC
AGTAGCGTTA GTGATAGTGG CTTGTACGAC TACAGCGATA GCAACTTCAC TATTGAACCA
GAGGAGTTCA TCACGGTCAC AGCTCCCAAC GGAGGTAACA GTCTAGAACC AGGAATAAGC
TACTACCTTG ACTGGGAGGA TAATATAGGC GAGAATGTCA AAATAGAACT GTACAAAGGA
GGTTCTTTCT ACAGTACCAT TGACAGCTCT ACCTCCAGTG ATGGTCGTCA CATCTGGGGA
GTACCCACAT CCATAACCAG TGGCAGCGAC TACAAGATTA AAATCAGTAG CGTTAGTGAT
AGTGGCTTGT ACGACTACAG CGATAGCAAC TTCACTATTG AACCAGAGGA GTTCATCACG
GTCACAGCTC CCAACGGAGG TAACAGTCTA GAACCAGGAA TAAGCTACTA CCTTGACTGG
GAGGATAATA TAGGCGAGAA TGTCAAAATA GAACTGTACA AAGGAGGTTC TTTCTACAGT
ACCATTGACA GCTCTACCTC CAGTGATGGT CGTCACATCT GGGGAGTACC CACATCCATA
ACCAGTGGCA GCGACTACAA GATTAAAATC AGTAGCGTTA GTGATAGTGG CTTGTACGAC
TACAGCGATA GCAACTTCAC TATTGAACCA GAGGAGTTCA TCACGGTCAC AGCTCCCAAC
GGAGGTAACA GTCTAGAACC AGGAATAAGC TACTACCTTG ACTGGGAGGA TAATATAGGC
GAGAATGTGA AACTAGAACT GTACAAAGGA GGTTCTTTCT ACAGTACCAT TAACAGTTCT
ACCTCCAGTG ATGGTCGTTA CACCTGGCTA GTACCCACAT CCATAACCAG TGGCAGCGAC
TACAAGATTA AAATCACTAG TGTTAGTGAT AGTGGCTTGT ACGACTACAG CGATAGCAAC
TTCACTATTG AAGCAGATGA TTCAAACTCT GATAAATACT ACTTTACCTA CTTTTATGAT
CTAGGCGACT CCTATGATGG TTTCTTGTAC GAAAAAGCAG GAAGATATTC TTTAGATGAT
TCATTATACA GCAGTAATGG TCGCTACCAG ATATGGGATA TTGAGAGTGG CGTGGGTAGT
AAAAATGATA TTGGCGATGT TTACGTTTAT AGCTACTACG ATGAGAATTA TACTGGTGAA
ACTTACGAAC CATCCTGGTG GACTTGGGGA CTTACGGCAG GTGAAAATGG CTTAGGTAGT
GAGTCTGACA CAATATCGGG TTTCTATGGT GAAGAATATT TTGACCCCTA TAATGAAGCT
GACGGTTAA
 
Protein sequence
MDPISEELNN FRQGFTFTGT GEVLAEVDKF GDELRFSSGL ELSPASEELK IENIDTSNAD 
ENFLLPKSNL GNLDLNIEDS EDRTLKTETD PLIGQDNDPV QEVDSLISGT SDRVNNIQTR
AAKKRDKAGN SRGKAHNLGV LEDDQKLQEF VGKSDRKDFY KFKVKNKTDI DIELGGLSGN
ADLYLLNNRG KVIEKSNKGG KRAEDIERTL NPGTYYVRVQ SKGRANANYN LSLDSELSDL
AGNSLKRAHN LGVLKDDQKF QEFVGKSDRK DIYKFKVKDK TDVDIELGGL SGNADLYLLN
NRGKVIEKST KGGKRVEDVD RTLNPGTYYV RVQSRNKRVN ADYSLSLDTK SPHLHIHEPI
IINAIDTAAD QPGTLDDSST IDLMVVYTSE ARRAEGGIDA IKDLIEFAVD DANEAFAKSG
VQSQLRLVHT AEVNYTESGK SIKELERLKN DSDGYMDEVH ELRNDYGADI VSLFVSSLDD
AGGIAYPMGT PAYQFESHAF NVVTNYNAKT RHTLAHEIGH NLGLAHDRDN AEGQGSFPYG
YGYTTPSGAG TIMSYARNRL PYFSNPAISY NGEALGQTNR ENSALAINKV AQYAANWRPS
GGTIKSQDSI TLNSPNGGNT LEQGSNYTIT WNDNISENIK LELYKGGSFY STINSSTTSD
GSYSWSIPTS ITSGSDYKLK ISSVSDSSLY DYSDSNFTIE PEEFITLTAP NGGNSLEPGR
SYYIDWEDNI SENVKLELYK GGSFYSTINS STSSDGRYNI WTVPTSITSG SDYKIKISSV
SDSGLYDYSD SNFTIEPEEF ITVTAPNGGN SLEPGISYYL DWEDNIGENV KIELYKGGSF
YSTIDSSTSS DGRHIWGVPT SITSGSDYKI KITSVSDSGL YDYSDSNFTI EPEEFITLTS
PNGGNSLEPG RSYYIDWEDN ISENVKLELY KGGSFYSTIN SSTSSDGRYN IWTVPTSISS
GSDYKIKISS VSDSSLYDYS DSNFTIEPEE FITLTAPNGG NSLEPGRSYY IDWEDNISEN
VKLELYKGGS FYSTINSSTS SDGRYNIWTV PTSITSGSDY KIKISSVSDS GLYDYSDSNF
TIEPEEFITL TAPNGGNSLE PGRSYYIDWE DNISENVKLE LYKGGSFYST INSSTSSDGR
YNIWTVPTSI TSGSDYKIKI SSVSDSGLYD YSDSNFTIEP EEFITLTAPN GGNSLEPGRS
YYIDWEDNIS ENVKLELYKG GSFYSTINSS TSSDGRYNIW TVPTSITSGS DYKIKISSVS
DSGLYDYSDS NFTIEPEEFI TLTSPNGGNS LEPGRSYYID WEDNISENVK LELYKGGSFY
STINSSTSSD GRYNIWTVPT SITSGSDYKI KISSVSDSGL YDYSDSNFTI EPEEFITLTA
PNGGNSLEPG RSYYIDWEDN ISENVKLELY KGGSFYSTIN SSTSSDGRYN IWTVPTSITS
GSDYKIKISS VSDSGLYDYS DSNFTIEPEE FITLTAPNGG NSLEPGRSYY IDWEDNISEN
VKLELYKGGS FYSTINSSTS SDGRYNIWTV PTSITSGSDY KIKISSVSDS GLYDYSDSNF
TIEPEEFITL TAPNGGNSLE PGRSYYIDWE DNISENVKLE LYKGGSFYST INSSTSSDGR
YNIWTVPTSI TSGSDYKIKI SSVSDSGLYD YSDSNFTIEP EEFITVTAPN GGNSLEPGIS
YYLDWEDNIG ENVKIELYKG GSFYSTIDSS TSSDGRHIWG VPTSITSGSD YKIKISSVSD
SGLYDYSDSN FTIEPEEFIT VTAPNGGNSL EPGISYYLDW EDNIGENVKI ELYKGGSFYS
TIDSSTSSDG RHIWGVPTSI TSGSDYKIKI SSVSDSGLYD YSDSNFTIEP EEFITVTAPN
GGNSLEPGIS YYLDWEDNIG ENVKLELYKG GSFYSTINSS TSSDGRYTWL VPTSITSGSD
YKIKITSVSD SGLYDYSDSN FTIEADDSNS DKYYFTYFYD LGDSYDGFLY EKAGRYSLDD
SLYSSNGRYQ IWDIESGVGS KNDIGDVYVY SYYDENYTGE TYEPSWWTWG LTAGENGLGS
ESDTISGFYG EEYFDPYNEA DG