Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4572 |
Symbol | |
ID | 4246226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 7033113 |
End bp | 7036100 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638109445 |
Product | peptidase M23B |
Protein accession | YP_724021 |
Protein GI | 113477960 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.109216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGATA TTTTAGGAGA CTTTAGCAAC TTTGAACAAG ACTTGAGCAA CCTAATATTA ATTGGTGATA AGCAAGAAAA TTTCGATGAT GGGCAGAGAT GGACTAGTGG CATTGAGCCA ACTGGGAATA ATTCTAAACT AAAAGTCGAA GAATTCAATA CAGGTGGCGA TGAACCATCA CAGCCCCTTC AGGGACAAAA ATTCTTTGAT TTAGGTGAAC TTAACCATTT ATCTGCCTCA AATTCTCCAC CGGGGAAGAA AGACCCACTA GTAGGTGAGG ACAATGAAGC AGTTGTCAAA AAAAGTGACA ATTTAATAAA TCCAAATCCA ATTAACCGTC GAAGCGGTAA CAGTAGGAAT AGGGCTGATA ATATTGGAAC TCTCAGCAGC AGTAGTAGTT TCACTGGTTT TGTTGGAACA ACAGATACCA ACGACTATTA TCGCTTTTAT CTGAGTGGGG AAAGGGAGTT TAACCTCACT CTCAACGGCT TAAGCGGTGA TGCGGACGTA CGATTACTCA ATAGTAGTGG TGGTACTATT AGTAGTTCTA CCAAGGGTGG TAGTAGTTCC GAGAGCATCA GTGAAACTCT CAATTCGGGT ACATATTATA TTAGGGTTTA TCCAATGAGT GGGGTGAATA CTAATTACAA TCTCAATATA GAAGCAACTT CATCTTCATC TGATGAAGAG GTAAATATTA CTTCCCCAAG CAGCAGGACC AGTATTGAAC CGGGAGAAAG GTATAATATT CGCTGGACCG ACAACTTTAG GGATAATGTC AAACTGGAAT TGTACAAAGG AAGTTCCCGG CAACAAACAA TTGCTCGTTC CACCTCCAGC GATGGCAGTT ACTCTTGGAG GGCGCCCACA TCTTTAAGCA GCGGTACTAA TTACAGAATC AAGATTCGCA ATGTGAACGA TAGTAGTGTT TACGACTACA GTAGTTATTT CACTATTGAA CCAGATGAAC CAGATGGAGT GGTAAATATT ACTTCCCCAA GCAGCAGTAC CAGTATTGAA CCGGGAGAAA GGTATAATAT TCGCTGGACC GACAACTTTA GGGATAATGT CAAACTGGAA TTGTACAAAG GAAGTTCCCG GCAACAAACA ATTGCTCGTT CCACCTCCAG CGATGGCAGT TACTCTTGGA GGGCGCCCAC ATCTTTAAGC AGCGGTACTA ATTACAGAAT CAAGATTCGC AATGTGAACG ATAGTAGTGT TTACGACTAC AGTAGTTATT TCACTATTGA ACCAGATGAA CCAGATGAAA AGGTAAATAT CACTTCCCCA AGCAGCAGTA CCAGTATTGA ACCGGGAGAA AGGTATAATA TTCGCTGGAC CGACAACTTT AGGGATAATG TCAAACTGGA ATTGTACAAA GGAAGTTCCC GGCAACGAAC AATTGCTCGT TCCACCTCCA GCGATGGCAG TTACTCTTGG AGGGCGCCCA CATCTTTAAG CAGCGGTACT AATTACAGAA TCAAGATTCG CAATGTGAAC GATAGTAGTG TTTACGACTA TAGTAGTTAT TTCACTATTA AACCAGATGA AGAGGTAAAT ATTACTTCCC CAAGCAGCAG TACCAGTATT GAACCGGGAG AAAGCTATAC TATTCGCTGG ACCGACAACT TCAGCGATAA TGTCAAACTG GACTTGTACA AAGGCAGTTC CTGGCAACAA ACAATTGCTA GTTCCACCTC CAGCGATGGC AGTTACTCTT GGAGGGTGCC TACATCTTTA AGCAGCGGTA CTAATTACAA TATCAAAATT CGCAATGTGA ACGATAGTAG TGTTGACGAC TACAGTAATA GTTTCACTAT CCAAAGCACA ACATGGCCTC CAAGCGTTAC CAAGGATTTG AAAACCTATA CTGGTCGAGA AGAGTACAGT GGCTATGTAG GCAACGATGA CTACTACAAA TTTTCTGTTG ACTCCCCTGG ATACCTACAA TTTGCACTGC GGGGAATGAG TGCAGATGCT AACTTGCAAT TGTTGAATTC TAGCGGTAAA GTCTTAGAAA GTTCCAGCAA ATCAGGCAAT AGTGATGAAT ATGCAAATGA AAACCTCGGC ATTGGCACTT ATTATATGCG GGTATACGGC CATAATGGTG CCGATACCAA CTATCGCTTA GTACTGAACC TCGACAAAGC AAAAAATGAC AGGAGTAATG CTCGTTGGCT CGGTGAGCTA GCAGGACAGC GTAAAGAATA CAAGGACTTT ATAGGAACCA GCTCAGGGGA TCAGTATGAT TACTATAAAT TTACTGTACA AGAACCTCGT TTCTTGGAAT ATGCGCTTCG AGACTTGACA GCTCCTGCTG ACATCGATAT ACTCAATTCC AGTGGCGCTC GAATTACTCC TAAAGAAAAT GATAAAGATA ATCATCGATA TAACCTGCAC GAAACTATGG GGTTACAAGC TGGTACGTAT TACGCGAGAG TTACGGCACC AACTGACTCT AGTCAGCAAA CTAACTATAA ATTAGTGCTG AATCTTAAAG GTGAGTATAA GCCCTTACAC TCTAACCCAA CAGAATCTAA CCCCCTGAAG GGATTCCAAT CTCCTGTCAG AGGTGAGAGA TGGTATGTTT CACAGTCACC TGGTGGCAGT TATAGTCATA CTGGCAATTT GCGTTACGCT ATAGATATCA GTATTCCAGG CTGGGATGAT TTTGGTGAAC CAATATATGC CATGCGTTCA GGAACTGTTA AAAAAGTGGT CGATGATCAT CCAGATATCG CCGATTCTAA GCGAAATAAC TTAGTGGAAA TCCAACACGA AAATGGCTAT GTTGCTAGGT ATTGGCATCT TCAGCAATAT TCTAATTCAG ATGCAGGGTT ACAAGTCGGT CAAAAGGTGG ATGCAGGTCA AATGATAGGT CGAGTAGGAA ACTCTGGTTT CAGTACTGGT CCTCATTTGC ATGTCGATGT TGTAGATTCT AGCTTGATAA CAAGGCCATT TGAGATAGAA GGAATTTTTG ATTACTAA
|
Protein sequence | MSDILGDFSN FEQDLSNLIL IGDKQENFDD GQRWTSGIEP TGNNSKLKVE EFNTGGDEPS QPLQGQKFFD LGELNHLSAS NSPPGKKDPL VGEDNEAVVK KSDNLINPNP INRRSGNSRN RADNIGTLSS SSSFTGFVGT TDTNDYYRFY LSGEREFNLT LNGLSGDADV RLLNSSGGTI SSSTKGGSSS ESISETLNSG TYYIRVYPMS GVNTNYNLNI EATSSSSDEE VNITSPSSRT SIEPGERYNI RWTDNFRDNV KLELYKGSSR QQTIARSTSS DGSYSWRAPT SLSSGTNYRI KIRNVNDSSV YDYSSYFTIE PDEPDGVVNI TSPSSSTSIE PGERYNIRWT DNFRDNVKLE LYKGSSRQQT IARSTSSDGS YSWRAPTSLS SGTNYRIKIR NVNDSSVYDY SSYFTIEPDE PDEKVNITSP SSSTSIEPGE RYNIRWTDNF RDNVKLELYK GSSRQRTIAR STSSDGSYSW RAPTSLSSGT NYRIKIRNVN DSSVYDYSSY FTIKPDEEVN ITSPSSSTSI EPGESYTIRW TDNFSDNVKL DLYKGSSWQQ TIASSTSSDG SYSWRVPTSL SSGTNYNIKI RNVNDSSVDD YSNSFTIQST TWPPSVTKDL KTYTGREEYS GYVGNDDYYK FSVDSPGYLQ FALRGMSADA NLQLLNSSGK VLESSSKSGN SDEYANENLG IGTYYMRVYG HNGADTNYRL VLNLDKAKND RSNARWLGEL AGQRKEYKDF IGTSSGDQYD YYKFTVQEPR FLEYALRDLT APADIDILNS SGARITPKEN DKDNHRYNLH ETMGLQAGTY YARVTAPTDS SQQTNYKLVL NLKGEYKPLH SNPTESNPLK GFQSPVRGER WYVSQSPGGS YSHTGNLRYA IDISIPGWDD FGEPIYAMRS GTVKKVVDDH PDIADSKRNN LVEIQHENGY VARYWHLQQY SNSDAGLQVG QKVDAGQMIG RVGNSGFSTG PHLHVDVVDS SLITRPFEIE GIFDY
|
| |