Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2362 |
Symbol | |
ID | 4245010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3647884 |
End bp | 3649884 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638107455 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_722055 |
Protein GI | 113475994 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.814035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAA TTTCTTCAAT TCCTGGAATC AAACAACTAT GGGCAAAAAC CAAAGGTAAC TCTGAGGTTT GCGTGGCAGT GCTTGACGGT CTAGTTGACC TGAAACATCC TTGTTTTGAG GGAGCTAATT TGACTCAACT ACCAAGCTTA GTTCAAGGTC AAGCTACTCC TCAAAGCGAG ATGTCTCTCC ATGGGACTCA TGTGGCCAGT ATAATTTTTG GTCAGCCAAA CTCAAGCGTC TCTGGTATTG CTCCCCACTG TCGAGGTTTA ATAGTTCCCA TATTCTCAGA CTATCATCGC CGAACTTCTC AGTTGAATTT AGCACGGGCC ATCGAACAAG CGGTGAATGC TGGGGCTAAT ATTATTAATA TTAGTGGGGG TGAACTGACA GATTATGGTG AAGCTGAAGA CTGGCTTAAC CGTGCGGTGA GTTTATGCCA AAATAATAAT GTTTTGCTTG TTGCTGCAGC GGGTAATGAC GGCTGTGAAT GTTTGCATGT TCCGGCAGCA CTACCCACTG TTCTAGCAGC AGGAGCCATG GGAGAAAACG GACAGCCCCT AGATTATAGT AACTGGGGCG AAAATTATCA GACTCAAGGT ATTCTGGCTC TGGGAGAAAA TATTTTAGGT GCTGAACCAG GAGGTGGTAC AAGACAGTTG AGTGGGACCA GCTTTGCCAC CCCAGTAGTG TCAGGAGTTG CTGCATTATT GATGAGTTTG CAGTTGCAAA GAGAGGAAAA ACCAGACTCA CAAAAAGTGC GTACTGCCTT ACTCAAGACA GCTGTTCCAT GTCATGCTCA AGAAAAACGC CGTTGTTTAG TTGGCCAGAT GAATATTTCA GGTGCCATTG CACATATAAC AGGAGAAACT ATGTCAGAAT CAGAACAAGA TAATAGTAAT GGTATTGAAG CTAGCTGTGG TTGTGAGTCA ACTCCAGAAG CTAGCTTGCC AGGTTCAGTT GGCCTAGAAA ATAGTCTACC GACTCCCGCC GACAATGGTG TGGTAGCTGC AGGGGTCGTT GAAGCTGGCG TCACGGCTTC ACAACCATTA TCATCAACTA ATACATCTAA TACATCTAAT AATCAAATTT CTGCTATGCC AAATAATAGT CAAAGTAACA ATAGTAATGG AATCACACCC AGTCAACCTC CTCAAGACGT GACAAATATA GTCTACGTTA TTGGTACATT GGGTTATGAC TTTGGAACTG AAGCACGGCG GGACTCATTT AAACAGTTGA TGCCAGCAGT TACTATTGGA AATACCCAAA TTCCAGCTAA CCCTTATGAT GCTCGTCAAA TGGTGGATTA TTTGGCGAAT AATCTTTCGG AAGCTAAATC TTTGATTTGG ACTTTGAATA TGGAATTGAC TCCTATCTAT GCTATTGAAG CTGTGGGGTC ATTTGCACGG GAAGTTTATG AAGCTTTGCA AGAGTTACTT GCAGGGGAAG TAGAAGCGGA AGATGCTGAG AGATATATTG AGCGGGTGAG TATTCCGGGA AAATTAACTG GGCGTACAGT TAAGTTGTTC TCAGGCCAAG TGGTGCCTGT GATTGAACCT GTTAGTCCTC GTGGTATTTA TGGCTGGCGG GTGAATACAT TGGTTGGTTC TGCTTTGGAA GCAGTTCGTG GAGAACAAGC AGAGGCTGAC GATGAGCAAA TGCGTCGGAG TTTGAGCAGT TTCCTAAATC GAGTTTATTA CGATCTACGC AATTTAGGGC AGACTTCTCA AGACCGAGCT TTAAATTTTG CAGCTACTAA TGCTTTCCAA GCGGCTCAAA CTTTCTCTAC AGCAGTGGCA GCAGGCATGG AGTTGGATAG TATTGCTGTG ACCAAGAGTC CATTCTGTCG GATGGATAGT GATTGTTGGG ACGTGCAGTT GAAGTTTTTC GATCCAGAAA ATAACCGTCG GGCGAAGAAG GTGTTCCGGT TTACCATTGA TGTTAGCGAT TTCATTCCGG TAACTTTGGG CGAAGTTCGT TCTTGGTCTT CTCCTTATTA G
|
Protein sequence | MPEISSIPGI KQLWAKTKGN SEVCVAVLDG LVDLKHPCFE GANLTQLPSL VQGQATPQSE MSLHGTHVAS IIFGQPNSSV SGIAPHCRGL IVPIFSDYHR RTSQLNLARA IEQAVNAGAN IINISGGELT DYGEAEDWLN RAVSLCQNNN VLLVAAAGND GCECLHVPAA LPTVLAAGAM GENGQPLDYS NWGENYQTQG ILALGENILG AEPGGGTRQL SGTSFATPVV SGVAALLMSL QLQREEKPDS QKVRTALLKT AVPCHAQEKR RCLVGQMNIS GAIAHITGET MSESEQDNSN GIEASCGCES TPEASLPGSV GLENSLPTPA DNGVVAAGVV EAGVTASQPL SSTNTSNTSN NQISAMPNNS QSNNSNGITP SQPPQDVTNI VYVIGTLGYD FGTEARRDSF KQLMPAVTIG NTQIPANPYD ARQMVDYLAN NLSEAKSLIW TLNMELTPIY AIEAVGSFAR EVYEALQELL AGEVEAEDAE RYIERVSIPG KLTGRTVKLF SGQVVPVIEP VSPRGIYGWR VNTLVGSALE AVRGEQAEAD DEQMRRSLSS FLNRVYYDLR NLGQTSQDRA LNFAATNAFQ AAQTFSTAVA AGMELDSIAV TKSPFCRMDS DCWDVQLKFF DPENNRRAKK VFRFTIDVSD FIPVTLGEVR SWSSPY
|
| |