Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3922 |
Symbol | |
ID | 4244005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6057969 |
End bp | 6059126 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638108845 |
Product | peptidase M50 |
Protein accession | YP_723427 |
Protein GI | 113477366 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.558191 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCAG GTTGGCGAAT TGGAAGTTTA TTTGGTATTT CTTTATTGTT GGATTATTCC TGGTTTATAA TTCTAATTTT GGCAGCATAT TTTCATGGTC AATATTACCA ACAGGAATGG GGGAGTTTTT TGGCTTGGGG TGCGGGATTG GTAATTGCTA TATTACTATT TTGCTCTGTA GTTTTACATG AATTGGGTCA TAGTTTAGTT ACTATTTCTC AAGGGATAAA GATTAATTCT ATAAGGTTGT TTCTATTTGG TGGAGTTGCT TTAAGAGAGA GGGATTATAG GAGTCCTGGA GAGGCTTTTC AGGTGGCGAT CGCTGGACCT TTGGTGAGTT TGGTTTTGTT CTTTTTATTA GGTGTAATAA GTTTACTATT TCCAACATCA AGTTTAATTG GGGAGTTAAT TAATAGGGTA GCAGAAATAA ATTTGATTTT AGGGGTTTTT AATATAATTC CTGGGTTACC TTTGGATGGG GGACAAATAT TAAAGGCGGT TGTTTGGAAA ATAACTGGTA GTCGTTTTAC TGGTATAAGA TGGGCTGCTA AGGGGGGTAA GGGTTTAGGA TGGTTTGGGG TTGGTTTGGG GTTGATAATA GTTTTTATGA CTAGAGATTA CAATGTTTTT TTGATGGCTT TAATTGGCTG GTTTGCTTTG CGTAATGCTA GAATTTATCA GTACATGACT GATTTAAAGT CGACCTTAAT TCATATTAAG GCTGTAGAGG TAATGACTAG AAACTTTCGT GTAGTAGATG CAGATTTAAC TCTGAGTCAG TTTCTGAGGA AATACCGTTT AAAAAGTTCT AAGTTTTCAA CTTATTTTGC TGCTAGTATG GGTCGTTATG TTGGTTTTGT TTCTGCTGAT GCTATTCCCT ATATTGAAAA AAGTTATCGG GATACTCAAA CTTTAAGGAT GATTATTTGC CCTCTAAGTC ATATGGTTAC TGTGTCAGAA AAGGTGAGTT TATTAGAGGT TCTTAAGAAG ATGGAATGTC ATCAACAAAA GCAAATAACT GTGCTTTCTC CTGCTGGAAC AGTGGCGGGA ATAATTGATC AAGGTGATAT TGTTCGAGGA ATGGCTAAAT ACTTGAAGTT GAATATTTCA GAGGCGGAAA TTAAGCTTGT GAAAACGGAG GGGGTGAGGA GGGGATAG
|
Protein sequence | MQAGWRIGSL FGISLLLDYS WFIILILAAY FHGQYYQQEW GSFLAWGAGL VIAILLFCSV VLHELGHSLV TISQGIKINS IRLFLFGGVA LRERDYRSPG EAFQVAIAGP LVSLVLFFLL GVISLLFPTS SLIGELINRV AEINLILGVF NIIPGLPLDG GQILKAVVWK ITGSRFTGIR WAAKGGKGLG WFGVGLGLII VFMTRDYNVF LMALIGWFAL RNARIYQYMT DLKSTLIHIK AVEVMTRNFR VVDADLTLSQ FLRKYRLKSS KFSTYFAASM GRYVGFVSAD AIPYIEKSYR DTQTLRMIIC PLSHMVTVSE KVSLLEVLKK MECHQQKQIT VLSPAGTVAG IIDQGDIVRG MAKYLKLNIS EAEIKLVKTE GVRRG
|
| |