Gene Tery_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3359 
Symbol 
ID4243454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5154768 
End bp5158232 
Gene Length3465 bp 
Protein Length1154 aa 
Translation table11 
GC content41% 
IMG OID638108344 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_722934 
Protein GI113476873 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAA CACCGAGCTC GAGGAGAGTT CAGAACTCTA GGAATAATCC TCCAGTAGCT 
GATAAGAACA AAATCACAGT ATACGAAAAT AGCACAGACA CACCTTTAGG AATCACAGCT
CCCACGGACC CTGATGGAGA TCCCCTAACT ATTCGAGTCA TCGGATTACC AAGATTAGGA
ACAGTAACTA AAGCTGACGG TACAGAAGTT AAAAGACGTG ACAAATTAAC ATCAGAGGAA
TTAGTGGGGT TAGAGTATGA TGCCCCTAAC AATTACAACG GTAAAGGTAA CGCTGGAGGT
TTTTTCTACT TTGTTAATGA CGGTACATCC AACAGGTTAG GTAGCACACG TATAACTATT
AATCCTTTAC CTGAAGATTT CAAACCAGGG GAAGTCATTG TTAAACTTAA AGATGTCGAT
AATAAAACAT CTTCTAAGAA GATTGAATCT TTCCGTGATA ATCTAGATAT TGAAGTAATT
TCGACAATTG AAGGGATCGG CGTTGAGTTG TGGAAGTTAC CAAATTCTAC TAATGTACAA
GAGTTTGTTG AAGAATATAG TAGCCGTCCG GAGTTTGATA TTCAGCCGAA TTTTACAAAT
ACTAAGTTGT TTACTCCTAA TCATCCTGAT GATCCAGATT ATAATGTTCT AGAGGGTTCT
TCTCCTCCTG GGCTAGGACG TCTTTGGGGA CTCAACAATA AAGGACAAAC TGGGGGAACA
GATGATGCAG ATATTAATGC ACCAGAAGCT TGGGGTTTTA CAACAACTCC AGTAGTATCT
CCTACAGTTA ATTCTACTGT TCGTGTAGCT GTTATTGACA CAGGTGTTGA CGTAAATCAC
CCAGATTTAA CAGGTAACTT AAATTTAGAT TTGGCGGCTA ATACAATATT TGGCGATGAT
CCTGAGGATG TGACAGACAA CCACGGTCAC GGTACTCATG TTGCTGGAAT CATTGGTGCT
GTAGGAAATA ATCAGACAGG AGTAGTCGGA GTCAATTGGG ATGTTGAAAT CGTGCCGATA
AAAGCCTTTG ATGATATTGA TGGCGATGGT GTTCCAGAGG CAACTGATAT GGCTATTTTA
GAGGCAATTA ACTATGCGAT TAACGTTGCC AAAGTAGATA TAATCAATGC TAGTTGGGGC
AAGCTACCTG ATAATGATAA TGATGATAAT GATGATATAG CGGAACTTTG GAAGAATGTC
ATAGATAATG ATACTGATGA CGAAAGCCAA CCACCACCGT CACCGCCACC ACTATTCGTC
GCAGCAGCTG GTAATCAAGG GGTTGATATT GATGATCCGG AAAATGCTGT TTACCCTGCG
AGCATTGACT CACAAAACAT TATTTCTGTA GCTGCAACAG ACCATGACGA TAACCTTTCC
TCATTTTCCA ACTTTGGAGC ATCCGTTGAT TTAGCTGCTC CCGGCGGTAG CGATATACCA
GGTAATGGTC CTGGAAGTTC TACTGATCCT CGTAATATTT ATAGTACGTT GCCAAATAAT
GATTATGGAT ACAGTGCAGG TACTTCCGCC GCCGCAGCTT ATGTGAGTGG AGCAGCAGCT
TTAATGCTGG GGACAAGAAG AGCCAGGAAT GAGAAAACCG GTCAGCCTGA TTTGAGTACT
CTTCAATTGG AAGAGAAACT TCGAAATGCA ATCACTCCTA TTAATGGTTT GCCAACAGCC
ACAGGTGGTC GCCTTAATTT GTATAATGGA ATTGATCAGG AAGGTATTGG TTGGGGTGAC
GTACACTTTA CCACTTTTGA CGGTCGTAAA TATGACCTTC AATCCTTTGG CGACTTTATT
ATGGCCGAAA CAGCGCGGAA TGATGATGAC TGGGTAGTTC AAACTCGTCA ACAACCTTGG
GCAAAGAATA GTTCAGTTGC TGTTAATACA GCTTTTGCTA CCCGAGTTGA TGGTAAAACG
GTAGTTTTTA ACCAGAAGTT TCCTAACAAC AGACTCCAAG TTGGTGGAGT TGATTTTCCC
TTGGCTAGCG GTGAGACTAA AAACATTGGG GACAGCAAAA TCGAACGTGA TGGCAACAAA
TACACAATTA CCTATGCAGG AAATGATGGT ATTATTGATG TTGATGATGC TAAGTTAACA
GCTTTTGATA ACGGCGATCA TATCAATATT CATATATCTG ACTTTGCTAC AATGCAAGGG
CTGTTAGGAA ATAATGATGG TAACCCTAAC AATGATTTTG CTTTAAGTGA TGATACTCAA
AAGTCTAACA ATGTAACCGC AAAAACAATA CACCAAGAGC ATGGTGAGTA TTGGCGGGTA
CCGAGTGAAG ACAAAGAACA AAAGGGGGAT AGAAAATCTT TGTTTGAAGA CTCAGCCGAG
GTTATAGGCA TTCCTAAAAG ATTTTTAACG CTGGATGGTT TCCCTAAAAA TGACGTGGCA
GCAGTCAATG CAAAGGTAAA AAAAGCAGGA ATTACTGACA AAGACAGAGC AGATGCAGTT
GCTTTCGACC TTCTTGCAAC TGAAGATGAA ACTTTTCTGA CTAGTGCTGT AGAGTTTTTT
AAGTCTGTAG ACGAAGCAAA TAATCCTGAC GTTCAAGCAC CTGTCCAGTT TGACTTCAAT
GCTGATGGAG TAGCAGACAT TCTCTGGCGT GAGGAAAAAG GGCGTCGTGC GCGTAGTGAT
ATTTGGTTTA TGAATGATGA TGGTACACTT AATAAAAGTA CTCCACTCGT AAATTATTAT
TCAAGGTGGG ATGTAGCAGG AGTAGGAGAC TTCAATGCTG ATGGAGTAGC AGACATTCTC
TGGCGCCACA AAAAATACGG CTTTAACTAT ATTTGGTTGA TGAATGATGA AGGCACATTT
AATAGCCGTC TTCACATCAA GCGTCTTAGT TCAAGCTGGA ATGTAGAAGG AGTAGCAGAC
TTCAATGGTG ATGGAGTAGA AGACATTTTC TGGCGTAATA AATCTCAAAA CCAGATTTGG
TTTATGAATG ATGAGGGCAA AGTTAATAAC CGTGCTAGCC TTGACAGTCT TGGTACAAGT
TGGGATGTAG CAGGAGTAGG AGATTTCAAT GGTGATGGAG TAGAAGACAT TCTCTTGCGT
GATACAAAAG GAAGTAACGA GATTTGGTTT ATGAATGATC AGGGCGAAGT TGATAACCGT
GATCGCCTTA GCCGTCTTAG TTCAAGGTGG GATGTAGCAG GAGTAGGAGA TTTCAATAGT
GATGGAGTAG AAGACATTCT CTGGCGTGAT ACAAAAGGAA GTAATCAGAT TTGGTTAATG
AATGATCAGG GGAAAGTTCA GAGTTCTGTT GACCCAGGAA GTTATGATTC AGCTTGGGAT
GTAGCAGGAG TAGCAGAGTT CAACGGTGAT GGAGTTGCAG ATATTCTCTG GCGCGATGAA
AATAATGGAT CTAACCGTAT TTGGTTGATG AATAATGACG GCACACGTAA TCAGATCGTT
GACCCTGGAT CTCTTGGTTC AACTTGGGAC GTAGTTGGGA TGTAA
 
Protein sequence
MMTTPSSRRV QNSRNNPPVA DKNKITVYEN STDTPLGITA PTDPDGDPLT IRVIGLPRLG 
TVTKADGTEV KRRDKLTSEE LVGLEYDAPN NYNGKGNAGG FFYFVNDGTS NRLGSTRITI
NPLPEDFKPG EVIVKLKDVD NKTSSKKIES FRDNLDIEVI STIEGIGVEL WKLPNSTNVQ
EFVEEYSSRP EFDIQPNFTN TKLFTPNHPD DPDYNVLEGS SPPGLGRLWG LNNKGQTGGT
DDADINAPEA WGFTTTPVVS PTVNSTVRVA VIDTGVDVNH PDLTGNLNLD LAANTIFGDD
PEDVTDNHGH GTHVAGIIGA VGNNQTGVVG VNWDVEIVPI KAFDDIDGDG VPEATDMAIL
EAINYAINVA KVDIINASWG KLPDNDNDDN DDIAELWKNV IDNDTDDESQ PPPSPPPLFV
AAAGNQGVDI DDPENAVYPA SIDSQNIISV AATDHDDNLS SFSNFGASVD LAAPGGSDIP
GNGPGSSTDP RNIYSTLPNN DYGYSAGTSA AAAYVSGAAA LMLGTRRARN EKTGQPDLST
LQLEEKLRNA ITPINGLPTA TGGRLNLYNG IDQEGIGWGD VHFTTFDGRK YDLQSFGDFI
MAETARNDDD WVVQTRQQPW AKNSSVAVNT AFATRVDGKT VVFNQKFPNN RLQVGGVDFP
LASGETKNIG DSKIERDGNK YTITYAGNDG IIDVDDAKLT AFDNGDHINI HISDFATMQG
LLGNNDGNPN NDFALSDDTQ KSNNVTAKTI HQEHGEYWRV PSEDKEQKGD RKSLFEDSAE
VIGIPKRFLT LDGFPKNDVA AVNAKVKKAG ITDKDRADAV AFDLLATEDE TFLTSAVEFF
KSVDEANNPD VQAPVQFDFN ADGVADILWR EEKGRRARSD IWFMNDDGTL NKSTPLVNYY
SRWDVAGVGD FNADGVADIL WRHKKYGFNY IWLMNDEGTF NSRLHIKRLS SSWNVEGVAD
FNGDGVEDIF WRNKSQNQIW FMNDEGKVNN RASLDSLGTS WDVAGVGDFN GDGVEDILLR
DTKGSNEIWF MNDQGEVDNR DRLSRLSSRW DVAGVGDFNS DGVEDILWRD TKGSNQIWLM
NDQGKVQSSV DPGSYDSAWD VAGVAEFNGD GVADILWRDE NNGSNRIWLM NNDGTRNQIV
DPGSLGSTWD VVGM