Gene Tery_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2037 
Symbol 
ID4243641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3171686 
End bp3175804 
Gene Length4119 bp 
Protein Length1372 aa 
Translation table11 
GC content40% 
IMG OID638107150 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_721753 
Protein GI113475692 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.884942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0586303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTACAA CACTACTAAG TTTTCAGGGA CGCCAGAACT CTAGTCAAAA TCCTCCGGTA 
GTCGATAGAA ACAAAATCAC AGTATACGAA AATAGCACAA ACACATCCTT AGGACTCGCG
ACCCCTACGG ACGCCGATGG AGATCCCCTC ACTATCCGAG TCGCCAGAGT GCCAAGATTA
GGAGAAGTAA CTAAAGCTGA TGGTACAGTA GTTAAAAGAA ATGACATACT AACATCAGAG
GAATTAGTGG GGTTAGAGTA TGACGCCCCT GCAAATTATA ACGGTAAAGG CTACCCTGGA
AGTTTTTTCT ACTTTGTCAA TGACGGTAGC TTCGACATAT TAGGTAGTAC ACGTATAACT
CTTAAGCCTT CACCTGAACA TTATAAACCA CGCGAAGTCA TTGTTACCCT TGAAGATCCC
AGTAATGAAA CATTTTCTAA TTCTATTCAA TCTTTACGTG ATGAATTAGA CATTGAAGTA
GTTTCCACAA TTGAAGCCTT CGACCGTGAA GTGTGGAAAT TACCAATTTC TACTACTGTA
AAAGAATTTA TTGAAGAATA TGATAGTCGT CCGGAAATTG ATATTCAGCC CAACTATAAA
AATAGGTTGT CGCGTCTTTA TCATCCTAAT GATCCAGATT ATAATAATAT TCCAGACTCT
GTTGTTCCTC CTGGCCGTGG AGTAGGTCGT CTTTGGGCTC TCAACAATAA AGGACAAACT
GGGGGAAAAG ATGGTGCAGA TATTAACGCT CCAAAAGCTT GGGGTTATGA AACTCCAGAA
CTCATAACTC CGGTAGACGA TAACGGTAGT CCGAAAGTTA GGGTAGCTGT TATTGATACA
GGTGTTGACG TAGATCATCC AGATTTAATC AATAATTTAG ACTTATCTGC GGCCAGGAAT
TTTGTTGATG GGTATAATAA TACAGATAAT ATTTCTAAGG AAGTGGAAGA CTTAGACGGC
CACGGTACTC ATGTTGCTGG AATCATTGGT GCTATAGGAA ATAACAATGA AGGAATAGTT
GGGGTAAGTT GGAATGTCGA AATCGTTCCA ATAAAAGCCT TTGACTTTGA TGAGAATGAT
GATCCTATAG GGTTTGATGC TGATATCATT GAGGCCATAG ATTATGCGAT TAACGATGCT
CAAGTAGATA TAATCAATGC TAGCTGGGGT AAGCCGGTTG GGACGCCTTA TTCTAAAGAA
TTGAAAGAAG CGATTAGTAA TGTCAACCAT CCTTCAGGAC GAGTACCACC ACTATTTGTG
GCAGCGGCTG GTAATGAGAG CAATGATAAT GATAATGCTA ATTTAAAGAT GAGAACTTAT
CCCGCCAGCT ATGATTTAAA CAACATTATT TCTGTAGCTG CAACAGACCA TAATGATCGG
CTTTCCCCTT TTTCCAATTA TGGAAGAAAG TCTGTTGATT TAGCTGCTCC TGGTGGTAGC
AATCTACCTG ATAATAATAA TAACCCTCAT GATTCTAGTG ATATTTATAG TACGGTACCG
GTTGGTACTG GTATTGATGG TGGTAATTAT GAATACAGTG CAGGTACTTC CGCAGCAGCA
GCTTATGTGA GTGGAGCAGC AGCTTTAATG CTGGGGACAA GAAGAGCCAG GAGGCAGACT
TATCCTGTTG GGAGTCCTAT GTATGAAGCT TTGGATGATT TGAGTGCTGT TGAGCTGAAA
GATAAAATTC TTAAGATCAC GACTCCTATT GATGGTTTGA AAACAGCCAC AGGTGGTCGC
CTTAATTTGT ATGAGGGAAT TAGACGTCAA GGTATAGGTT GGGGTGACGT ACACTTTGCC
ACTTTTGATG GTCGTAAATA TGATCTTCAA TCCTTTGGCG ATTTTATTAT GGCGGAAACA
GCGCGTAAGG ATGATGACTG GGTAGTTCAG ACTCGTCAAG AACCTTTTGT ACACAATAGA
TCAGTTTCTG TTAATACAGC TTTCGCCACC CTAGTTGATG GTCAAAGAGT AGTTTTTAAC
CAGAAATTTC CTAATAACAG GCTCCAAATT GATGGAGTTG ATTTTCCCTT AGCTAGTGGT
GAGACTAAGA GTATTGGGAA CAGTAAAATC GAACGTAATA ACAATAAATA CACAATTACC
TACGCAGGAA ACGATGGTAT TATTGATATT GATGACACTA CATTAACAGC TTTTGATTGG
AGCAGTTATA TCAACATTTA TATATCTGAT TCTGCTAGAA TGCAAGGGCT GTTAGGAAAT
AATGACGGTA ACCCTAATAA TGATTTTGCA TTGCGTGATG GTACTCAACT TCCTAATAAT
TTAACCGTAA GACAAATACA TCAACAATAT GGTGAAAGTT GGCGAGTAAA AGAAGGAGAA
TCTTTATTTA AAAATCCAGC CACTGTTATA GATCTTCCTG AAAAATTTAT ATCGCTGGAT
GACTTTCCTC AAGATGAAGT TGCAGCAGCC AAGGCAAAGG TAAAAAAAGC AGGAATTACT
GATGAAAACA GAGTAAATGC AGTTGCTTTC GATATTCTTG CAACCGAGGA TGAAAGTTTT
CTAAATAGTG CTGTAGAAAT GTTTAATTCT TTAGACGACA ACTCTGCTCC CACAGATATC
AAGCTTGACA ACAATACCAT TGACGAGAAT GTAACCCCAG GTGCTACAGT TGGCAAATTT
TCCACTACCG ACCCCGACAA TGAGGATAGC TTTACTTATG CATTAGTAGA AGGTGTCGGA
GATAGGGATA ACGCAGGTTT CAATGTTGAT GGTGATCAAC TGAAAATTAA TGGTTCTCCA
GACTATGAAA CCCAGTCTAG TTATAGCATC CGAGTTAAAA CTACTGATGG AAAAGGAGCA
AGTTATGAAG AACAGTTAAC CATTAATGTT AACGATCTTG ACGATAAAGC ACCTACAGCA
GCAACCTTCA CCCCAACTGA TAATGCCACA GATATCGCCA TCGGAACTAA TTTAGTCATA
AACTTTGATG AAAACATCCA AGCAGGGACA GGCAACATAA TTATCAAACA ATTCAGCGAT
AATTCAGTAG TAGAAACTAT AAATGTTACT TCTAGTCTAG TTAGTATCTC AAACAATACC
CTCACCATTA ACCCCACTGC AGACTTAGCA GAACGAACAA AGTATTATGT TGAGGTAGAA
GCAGATGCTA TACAAGACAC TTCAGGCAAT AATTACCTTG GTATTAAAAA CAACAGCACC
TGGAACTTTA CGACTATTAA TAATAGTCCA GTCCAGTTTG ATTTCAACGG CGATGGAGTA
GCAGACATTC TCTGGCGTAA AAAAATAGAT AGTCCTGCAA ATGCAGAGAA CCAGATTTGG
TTTATGAATG ATGACGGCAC AGTTAATAAT AGTGCTCCCC TGAAAAGTAA TTATTCAACA
TGGGGTGTAG CAGGAGTAGG AGATTTCAAT GCTGATCAAG TACCCGACAT TCTCTGGCGT
AATAAGTACA AACGTAACGA GATTTGGTTT ATGAATGATG ACGGCACATT TAATAGTCGT
GCTCGTCTCA AGCGTCGTGG TTCAAGCTGG TCTGTAGGAG GAGTAGCAGA TTTCAATGCT
GATCAAGCAA CAGACATTCT CTGGCGTAAT AAATACGGGT ATAACGAGAT TTGGTTTATG
AATGATGAGG GTGCACTTAA TCATCGTGCT CGCTCCCTAG GTCCTGATTC AAGCTGGGAC
GTAGCAGGAG TAGGAGATTT CAATGCTGAT CAAGTAGCCG ATATTCTCTG GCGTGATCAA
AATGAAAATA ACATGATTTG GATGATGAAT GATGACGGCA CAGTTAATAA TAGTGCCCGC
CCTGATAGTC TTAATTCAAG CTGGGATGTA GTAGGAGTAG CAAATTTCAA TGCTGATCAA
GTAGCCGATA TTCTCTGGCG TGATGAAAAA GGAAGTAGCC AAATTTGGTT AATGAATGAT
CAGGGAAAAG TTCAGAATTC TATTAGCCTA GGAAGTTATG ATTCACCCTG GAATGTAAAA
GGAATGCCAG ATTTGAATGG TGATGGAGTC GCAGATATTC TCTGGCGCAA TGAAAACAAT
GGAGCTAACC ATATTTGGTT AATGAATGAT GACGGCACAC GTAATCAGAT CGTTGACCCT
GGATCTCTTG ATTCAACTTG GGACATAGTT GGAATGTAA
 
Protein sequence
MGTTLLSFQG RQNSSQNPPV VDRNKITVYE NSTNTSLGLA TPTDADGDPL TIRVARVPRL 
GEVTKADGTV VKRNDILTSE ELVGLEYDAP ANYNGKGYPG SFFYFVNDGS FDILGSTRIT
LKPSPEHYKP REVIVTLEDP SNETFSNSIQ SLRDELDIEV VSTIEAFDRE VWKLPISTTV
KEFIEEYDSR PEIDIQPNYK NRLSRLYHPN DPDYNNIPDS VVPPGRGVGR LWALNNKGQT
GGKDGADINA PKAWGYETPE LITPVDDNGS PKVRVAVIDT GVDVDHPDLI NNLDLSAARN
FVDGYNNTDN ISKEVEDLDG HGTHVAGIIG AIGNNNEGIV GVSWNVEIVP IKAFDFDEND
DPIGFDADII EAIDYAINDA QVDIINASWG KPVGTPYSKE LKEAISNVNH PSGRVPPLFV
AAAGNESNDN DNANLKMRTY PASYDLNNII SVAATDHNDR LSPFSNYGRK SVDLAAPGGS
NLPDNNNNPH DSSDIYSTVP VGTGIDGGNY EYSAGTSAAA AYVSGAAALM LGTRRARRQT
YPVGSPMYEA LDDLSAVELK DKILKITTPI DGLKTATGGR LNLYEGIRRQ GIGWGDVHFA
TFDGRKYDLQ SFGDFIMAET ARKDDDWVVQ TRQEPFVHNR SVSVNTAFAT LVDGQRVVFN
QKFPNNRLQI DGVDFPLASG ETKSIGNSKI ERNNNKYTIT YAGNDGIIDI DDTTLTAFDW
SSYINIYISD SARMQGLLGN NDGNPNNDFA LRDGTQLPNN LTVRQIHQQY GESWRVKEGE
SLFKNPATVI DLPEKFISLD DFPQDEVAAA KAKVKKAGIT DENRVNAVAF DILATEDESF
LNSAVEMFNS LDDNSAPTDI KLDNNTIDEN VTPGATVGKF STTDPDNEDS FTYALVEGVG
DRDNAGFNVD GDQLKINGSP DYETQSSYSI RVKTTDGKGA SYEEQLTINV NDLDDKAPTA
ATFTPTDNAT DIAIGTNLVI NFDENIQAGT GNIIIKQFSD NSVVETINVT SSLVSISNNT
LTINPTADLA ERTKYYVEVE ADAIQDTSGN NYLGIKNNST WNFTTINNSP VQFDFNGDGV
ADILWRKKID SPANAENQIW FMNDDGTVNN SAPLKSNYST WGVAGVGDFN ADQVPDILWR
NKYKRNEIWF MNDDGTFNSR ARLKRRGSSW SVGGVADFNA DQATDILWRN KYGYNEIWFM
NDEGALNHRA RSLGPDSSWD VAGVGDFNAD QVADILWRDQ NENNMIWMMN DDGTVNNSAR
PDSLNSSWDV VGVANFNADQ VADILWRDEK GSSQIWLMND QGKVQNSISL GSYDSPWNVK
GMPDLNGDGV ADILWRNENN GANHIWLMND DGTRNQIVDP GSLDSTWDIV GM