Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3191 |
Symbol | |
ID | 4243863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4874834 |
End bp | 4875961 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108197 |
Product | peptidase M50 |
Protein accession | YP_722788 |
Protein GI | 113476727 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00586872 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGGAT CTTTTCGTGT CGGCAACCTA TTTGGCATAC CATTTTACAT TAACTCATCC TGGTTTATAG TCTTAGGTCT CCTTACTTTA ACTTATGGTA ACGACCTAGC AACTCAATTT TCTCAAGAAT TGGGTAATAC CTTACCTTGG ATACTAGGAT TAATAACAGC ATTATTATTA TTTTCCTCTG TCTTAGCCCA TGAGTTAGGG CATAGTTTTG TTGCTCTATA TCAGGGAATA AAAGTAAAAT CAATTACCCT ATTTCTCTTC GGAGGTTTAG CTAGTTTAGA TAGAGAATCT AAGACTCCTA TAGAAGCATT TTTGGTAGCA ATTGCTGGCC CTTTAGTGAG TATATTATTA TGCGGTTTCT TTGTATCAAT TAATATATTT ACATCCATTA CTGGACCAGC AGAATCCATT GTTCAACTTT TAGCTTATAT CAACTTATTC CTAGCATTAT TTAACCTAAT TCCAGGTTTA CCACTTGATG GTGGTAATAT CCTTAAATCT ATTGTTTGGA AAATCACTAA TAACCCTTAT AAAGGAATTA TTTTTGCAAG TAGAGTAGGT CAAGTATTTG GTTGTTTAGC AATAATTTCT GGTTTAATTC CCGCATTTTT ATTTAGTAGA ATTCCTAATT TTTGGAATAT TCTCATTGGT TGGTTTCTAC TACAAAATGC TGGTCGCTCT GCCCAATATG GAGAAATTCA AGGTATGCTT GCTGATTTAA ATGCAGTAGA TGCTATTATT CCTGATAATC CAATTGTATC AAACAATCTC TCTTTACGAG AATTTGTGAA TGAATATGTT ATTGGGAAAG AAGCTAGAAA GAAGTTTTTA GTGATAAATG AAATGGGGCA GTTTGTAGGA GTAATTAACG TTGATGATTT AAAAATAGTT AATACATCCC AATGGCCTTT GGTTCAGGTA AAAACATTAA CAAAACCTTT AGCAAAGATA GAGACTGTAA CCGCTAAAAC TTCTTTATTA GAAGTAATTT CTTTGTTAGA GCAAAAGCAA ATCAGTGAAC TAACTGTTAT TGATGAAAAT GGGATCTTAG TTGGGTCTAT TGAAAAAGCT TCAATTAGGC GTTTGTTAAC AAGAAAGGAG CAAGCTAAAA CTAATTAA
|
Protein sequence | MNGSFRVGNL FGIPFYINSS WFIVLGLLTL TYGNDLATQF SQELGNTLPW ILGLITALLL FSSVLAHELG HSFVALYQGI KVKSITLFLF GGLASLDRES KTPIEAFLVA IAGPLVSILL CGFFVSINIF TSITGPAESI VQLLAYINLF LALFNLIPGL PLDGGNILKS IVWKITNNPY KGIIFASRVG QVFGCLAIIS GLIPAFLFSR IPNFWNILIG WFLLQNAGRS AQYGEIQGML ADLNAVDAII PDNPIVSNNL SLREFVNEYV IGKEARKKFL VINEMGQFVG VINVDDLKIV NTSQWPLVQV KTLTKPLAKI ETVTAKTSLL EVISLLEQKQ ISELTVIDEN GILVGSIEKA SIRRLLTRKE QAKTN
|
| |