Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1624 |
Symbol | |
ID | 4242408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2481037 |
End bp | 2482254 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106765 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_721375 |
Protein GI | 113475314 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTAT CTTTTAAACA ATTAACTTTG TATTTTTCTC TGTTGTCCAT TGGTACTGCT ACGGGATGGT TGGGTCATCA TTATCTCGAA GCTAATAAAT GGTCAAATGA CTCTGATGTA ATATCTTCTG TAGTTAAGAA ACAAGCACAA CCATCTACTC CAAACTCTGG AAATAACCTA GTTTCCTTTT CTCATCATAA TTTTATTGCC GAAGCAGTCA AAAAAGTTGG CCCATCAGTA GTCCGTATTG ATGCAGCTAA AAAGTTAACA ACTGAAGCTC CAGAAGCTTT AAAGAATCCT CTATTGAAAC GTTTCTTTGG GGAAAATTTG CCGGTTCCAG AAGAACGAAC TAAGCGTGGT ACTGGGTCAG GGGTAATTAT TAGTTCTGAT GGCCGCTTAA TTACAAATGC TCATGTTGTT CATGGAGCAA ATACGGTTAA GGTGACATTG AAAGATGGCC GGGTATTTGA TGGTGTGGTT AAAGGGGTGG ACTCACTGAC TGATATAGCA ATAATTAAAA TTGAGGCCAC AGATTTACCA GAGGTATCTA TTGGCAAATC AGAACAATTA ATTCCTGGAC AATGGGCGAT CGCTATTGGT AATCCTTTGG GTTTGGACAA TACTGTAACA GTGGGAATTA TTAGTGCTAT TGGTCGCACC AGTTCTCAAG TAGGTATTCC AGATAAACGA GTTCGCTTTC TTCAGACAGA TGCTGCAATT AATCCTGGCA ACTCTGGTGG GCCACTTTTG AATGATCAAG GTGAAGTAAT TGGTATTAAT ACAGCTATTA GAGCGAATGC TCAGGGGTTA GGGTTTGCTA TTCCCATAGA AACTGCAAAA AGAATTGCTG ATGAATTATT TGTCTATGGG AAAATAGAGC ACCCATTTTT AGGTATTTCA ATGGTTGATT TAACTCCTGA GGTCAAGGAT GAAATTAATA GAAAACTGGA TACGAAAATT AAGGATAATC AAGGTGTAGT AATTATGAGA GTTATAGAAG ATTCTCCTGC ACAAAAAGCT GGTTTACGTC AAGGAGATGT GATTCAAAAA GTAGGGGGAG TAGTAGTGAA AAGTCCAACA GAAGTTCAAC AAGAAGTAGA AAAAAGTTTA GTAGGAAAAA ATTTGGCAGT GGAGGTAATT CGTAATCGGA AAATTGCCAA AATTTTGGTT AAACCTGATG CTTTTCCTGA ACCACTTGAG TTAGAACTAA AGGAATAG
|
Protein sequence | MALSFKQLTL YFSLLSIGTA TGWLGHHYLE ANKWSNDSDV ISSVVKKQAQ PSTPNSGNNL VSFSHHNFIA EAVKKVGPSV VRIDAAKKLT TEAPEALKNP LLKRFFGENL PVPEERTKRG TGSGVIISSD GRLITNAHVV HGANTVKVTL KDGRVFDGVV KGVDSLTDIA IIKIEATDLP EVSIGKSEQL IPGQWAIAIG NPLGLDNTVT VGIISAIGRT SSQVGIPDKR VRFLQTDAAI NPGNSGGPLL NDQGEVIGIN TAIRANAQGL GFAIPIETAK RIADELFVYG KIEHPFLGIS MVDLTPEVKD EINRKLDTKI KDNQGVVIMR VIEDSPAQKA GLRQGDVIQK VGGVVVKSPT EVQQEVEKSL VGKNLAVEVI RNRKIAKILV KPDAFPEPLE LELKE
|
| |