Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4416 |
Symbol | |
ID | 4246069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6801445 |
End bp | 6802677 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638109300 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_723877 |
Protein GI | 113477816 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.464298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTT CAGATCAAAA AAATCCAGCT AGACCTCAAA TAGGTGTTTA TGTAGTCGCT ATAGCTGTAA GTACAGGTTT GACTTTAACT GCCATACGCG CTTTTCCTAG AATATTTCTA CCCACAGATA ACAGGGAAAC GAGTCAAAAT AAACCTCAAA GTCAGCTAGT AGTAAATACA AAAGTTCCTC AGATAGCACA AGTACCAATA AAAGCTGATA GTTTTGTCGC CACTGCGGTT GAGAAAGTAG GACCGGCTGT CGTACGCATA GATACAGAAC GTACAGTAGC GCGTAATACA CCCAATTTTT TTAATGACCC ATTTTTCCGT CGCTTTTTTG GAAATGATAG TTTTTCCCAA GTTCCTAAGA AGTTTCAACA ACAGGGACAA GGCTCTGGTT TTATTACTGA TAGTAGTGGT ATTATTTTGA CTAATGCCCA TGTTATTAAA GGTGCAGATT CAGTTACAGT TAAGCTTAAA GATGGGCGGA GTTTTGAGGG AGAAGTAAGA GGTCTTGATG AACCTTCTGA TTTAGCAGTG ATCAAAATTG ATGGGGAAAA TTTACCTGTT GCATTTTTAG GAAATTCTGC TCGGGTCAAA GTCGGCGACT GGGCGATCGC TGTAGGAAAT CCCCTGGGGT TAGATAATAC GGTAACTTTG GGTATTGTTA GTTCTCTAAA CCGCGCTAGT TCGGAAGTTG GTATCCCTGA TAAACGTCTT GATTTTATTC AAACTGATGC TGCTATTAAT CCTGGTAACT CTGGAGGTCC TTTGGTAAAT TCTCAGGGAG AAGTTATTGG TATTAATACA GCTATTCGTG CTGATGGGCA AGGTATCGGA TTTGCTATAC CTATAGATGA GGCAAAGGTG ATTCAAGAAA AGTTAGTTAA AGGTGAAAGT ATACCTCGTC CTTATATTGG GGTGCGGATG GTTACTTTGA CTCCAGAAAT TATTGAAAAA ATTAATAAAA ATCCCAATTC CTCAATACAG TTGCCTGAGA CTGATGGTGT TTTAATCGCA CAAGTAATTT CTAATAGTCC AGCAGCTAAA GGGGGTTTAC GACTTGGGGA TGTGGTTACA GAAATTGATG GTCAAAAAAT TGCTACTGCT GAAGAATTAC AGAGTATAGT TCAGAAAGGT CAAATTGGTA AACCTCTAAA TATTACGGTA AAACGTGGTA AAGAGACTCA AACGTTTTCT GTGAGTCCAC AAGAATTACA GGATGCTAAT TAA
|
Protein sequence | MKISDQKNPA RPQIGVYVVA IAVSTGLTLT AIRAFPRIFL PTDNRETSQN KPQSQLVVNT KVPQIAQVPI KADSFVATAV EKVGPAVVRI DTERTVARNT PNFFNDPFFR RFFGNDSFSQ VPKKFQQQGQ GSGFITDSSG IILTNAHVIK GADSVTVKLK DGRSFEGEVR GLDEPSDLAV IKIDGENLPV AFLGNSARVK VGDWAIAVGN PLGLDNTVTL GIVSSLNRAS SEVGIPDKRL DFIQTDAAIN PGNSGGPLVN SQGEVIGINT AIRADGQGIG FAIPIDEAKV IQEKLVKGES IPRPYIGVRM VTLTPEIIEK INKNPNSSIQ LPETDGVLIA QVISNSPAAK GGLRLGDVVT EIDGQKIATA EELQSIVQKG QIGKPLNITV KRGKETQTFS VSPQELQDAN
|
| |