Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0488 |
Symbol | |
ID | 4241700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 781534 |
End bp | 783267 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638105803 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_720417 |
Protein GI | 113474356 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.718847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.491604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAATTC TGAAAAAAAT GAATGATGAA AAAACTGTGA TTGGTTCTAA CTATTCTGGT ATTTCTTATC TATTGGATAA TAATCATAAT CAATTTACTG TACCTGTAAA TTTACTAATT CCATATTCAG CAGGTCTACA AGCTTTAGAT GGAAATGATA CAATAATAGG CTCATTAAGC CCTGAATTGA TTAATGGTAA TCAAGGTAAC GATAATATTT TTGGTGGAAG TGGTTCTGAT ACTTTACGAG GTGGTAGAGG TAATGATTTT ATTGAAGCTG ACCAAGGTAA CGATCAAGTT TTTGGAGATT TAGGTAAGGA TACAGTTTAT GGAGAAATCG GAAATGATCA AATTTATGGA GGGAAAGGAG AGGATATTTT ATTCGGAGGC AATGGTAATG ATACAATTTA TGGGGATTTA GGAAAAGATA CATTAATTGG AGAAGCAGGC AATGATATAT TTGTTTTGCG AGATTCACCA AATAATAACA ATTTAGATAC TGCAGATATT ATTTATGATT TTAATCCTAA TTTTGACAGT ATTCAAATGC CAGCAAACTT AACAGAAAGT GACATTCTAT TAAGAGAAGA TTTTTATTAT GGAGGTACAT TAATTCAAGT TCAGGCAAAT GGTTCCATAT TAGCAATAGT TAAAGACATA TCTAATACAA ACGTTAAAAG TGAGTTAATT TTTGGAGATA CCGCAAATAC TAATGAACTT TTACAAACGA ATAGTTCTGT AAGACCAACC TTTAATAATA TTTTTGGATA TGGTTTAGTC GATGCCTCAG CAGCAGTAGC CAGTGCTATT GGTAGTACTT CCTTTCCAGA AGTTCCTGAT TTAGGAGGAA ATCAGTGGGG ACTAGACTTG GTTAAAGCAC CCGAAGTTTG GAATCAAGGC TTTCTGGGAG ATGGTATTGT AGTAGCCGTT ATTGATAGTG GTGTAGACTA TACCCATCCA GAATTAACAG GCCAAATTTG GAAGAATAGC CGTGAAATTC CTAACAATAA TATTGATGAT GATGCTAATG GCTATGTGGA TGATTTTCAG GGTTGGGATT TTATCAATGA TGATAATGAC TCAAGAGATG AAAAAGGTCA TGGAACTCAT ATTGCAGGCA CTATAGCTGC CAAGAGAGAT GGGATAGGGA CAACTGGTAT AGCTCCAAAT GTCCAAATTA TGCCTCTCAG GATACTTAAT GATCAAGGAA CAGGTAAAGT TAGCGATGGT ATAGAGGCTA TTCGTTATGC TGTTGATAAT GGAGCAGATG TGATTAACTT TAGCTCTGGT GATAGAAATT TAGTTAGTGG GGAAATTGAA GCTATTCGTT ATGCTGCTGA ACGAGGTGTT GTATTTGTTT CTGCTGCAGG TAATGGTAGT TTAAGTAGTC CTGATTATCC AGCAAAGTTA GCTGATAAAC AGGGAATTGC GGTTGGGTCA GTAGAGAAAA ATGGGAAATT TTCTTCTTTT TCCAATGAAG CTGGAAACCA ACCTTTAGAT TATGTCGTTG CTCCAGGGGG GGATGGTTTT CCTGAAGATG CAGGAGATAT CTATGCCCCT GTACCTCTTT CTATAAAAGG TAATTTATAT AGTTTCTTGA CAGGTACTTC AATGGCTACA CCTTATGTTA CAGGTATAGT AGCTTTAATT AAACAAGCTA ATCCAAGTTT GTCTGTTGAG GCCATTGAAA ATATAATTAC TTATACTACT AACTCAGCAG ATGTGATTGT CTAA
|
Protein sequence | MLILKKMNDE KTVIGSNYSG ISYLLDNNHN QFTVPVNLLI PYSAGLQALD GNDTIIGSLS PELINGNQGN DNIFGGSGSD TLRGGRGNDF IEADQGNDQV FGDLGKDTVY GEIGNDQIYG GKGEDILFGG NGNDTIYGDL GKDTLIGEAG NDIFVLRDSP NNNNLDTADI IYDFNPNFDS IQMPANLTES DILLREDFYY GGTLIQVQAN GSILAIVKDI SNTNVKSELI FGDTANTNEL LQTNSSVRPT FNNIFGYGLV DASAAVASAI GSTSFPEVPD LGGNQWGLDL VKAPEVWNQG FLGDGIVVAV IDSGVDYTHP ELTGQIWKNS REIPNNNIDD DANGYVDDFQ GWDFINDDND SRDEKGHGTH IAGTIAAKRD GIGTTGIAPN VQIMPLRILN DQGTGKVSDG IEAIRYAVDN GADVINFSSG DRNLVSGEIE AIRYAAERGV VFVSAAGNGS LSSPDYPAKL ADKQGIAVGS VEKNGKFSSF SNEAGNQPLD YVVAPGGDGF PEDAGDIYAP VPLSIKGNLY SFLTGTSMAT PYVTGIVALI KQANPSLSVE AIENIITYTT NSADVIV
|
| |