Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1179 |
Symbol | |
ID | 4244063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1844523 |
End bp | 1847486 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638106398 |
Product | peptidase M16C associated |
Protein accession | YP_721010 |
Protein GI | 113474949 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0156529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTAT TAACAGAAAG AACAATTGAA GTTGGACAAA AACTACAAGG CTTTGAAGTC AAAGCTATTA CTGACCTGAA GCAACAGCGA ATGGTGGCAT ATCAGCTAGA ACATATCAAA ACAGGTGCAA AACTGTTACA CCTATATTCA GAAGATGCTG AAAATCTCTT TTCAATTAGT TTTCCCACTC CCCCTCCAAA TAGCACAGGA GTATCCCATA TCCTAGAACA TTCTGTATTA GCAGGTTCAA AAAAATATCC TGTACGCGAA CCATTTTTTG AAATGCTAAA AATGAGTCCA GCAACCTTCA TTAATGCCAT GACTGGCCCT GACTGCACTT ACTATCCAGT TTCCAGTAAA GTTAAACAAG ACTTATTCAA CTTGGCAGAA GTATACTTTG ACGCTGTATT TCACCCACTG CTAACCGAAA ATACGTTTAA ACGAGAAGGA CACCACTTAG CTCCTACCGA CAAGGAAAAC CCAACAGGGG AATTAAAATT TACTGGGGTA GTTTACAATG AAATGAATGG AGCTTTTTCT GACCCTGAAC AAAGGTTAGA TAGTATTGCT AACCAAAGTC TATTTCCTGA TAACATTTAT GGTCTTGAAT CTGGAGGGAA CCCCCAAAAT ATTCCTGAAC TCACTTACAA AGACTTCCGT GACTTCCATT CCAGCTATTA TCATCCCAGT AATGCTTACT TTGTTTTTTA CGGCAATATT TCTACTCCTG AATATCTTGA ATTTTTAGCC AAAAAGCTAG AAGCTTTTGA AAGACAGAAA CCTAATATCA ATATTAATCC TCAATCTCGC TGGAGTGAAC CTCGTTTTAA AGAAGATTCC TACCCTATTA GTGCAGCGGA TGAAACAACA CAAAAAACTT ACATAATGAT CAAATGGTTG GTGGGAGATA GCACTGACTC TGAGGAATGG GTAGCTTTAG ATATTCTCAG TCGCATCTTG TTGGGAAATG AAGCGGCACC ATTGAAAAAG GCGATTGTTG AATCCCAAAT TGGTCAAGAT CTACTCGGCT CTGGAGTAGA TTCTGTGGGT AAGGAGGTTA CTTTTCATCT GGGTATTCAA GGAAGCGAAC CTAACCAGGG TGAAGCATTC AGTCAGTTAG TAATCAAGAC TTTAAAAGAA ATTGCTGAAG AGGACATTGA ACCTAGTATT GTTGAGGCAG CGTTTCAACA GGCAATTTAC CAACACCAGG AAATAGGTAG TATGTATCCT TTACGGATGT TGTTTCGGGT GATGCAAACC TGGATTTATA GTAATGATCC ATTGAAGTTT TTACATATTA GCGATCGCCT CGCCGAATGC AAACAACGTT ACCTGGAAAA ACCTAGATAT TTTAATAACC TAATTCGTGA AAAATTACTC AATAACCCCC ACCGTTTGAC ACTGGTATTA AAACCAGATA AAGAATGGCA ATCTAATTAT GACAAAGCAG TAGTAGCACA GGTGGAACAA GTACGTTCTC AATTAACTTC AGAAGAATTA GAACGCATAG CTACTGAAGC AACAGAATTA GAAATAGAGT CAGGAACTCC TAATTCTCCT GAAGAAATAG CTAAACTGCC CCAACTACAA GTGAAGGACT TACCTGACAA ACCAGAACAT ATTCCTACTG ATGTCGAAGA ACTTGATGGC CAAGTGACAC TTTTAAGGAA TCATGTGCTA GCAAATGGAG TAAATTATTT ACAACTAGAT TTCAGTTTGC GTGGGTTGCC TGAAGATTTG TGGTTGTATC TATCTATTTA TATAGATGCA CTGCGGAAGT TGGGTGCAGG AGAAATGAAC TACGAACAGG TGGCTCGCGG CATTGCTTCT TATACCGGGG GAATTAGTTT TCAATCTCTG TTACGGACTT CTACTAAAGA TGCCTATCAT TCTGTGCGTG GACTTCGCGT CACCATTAAA ACCCTAGATG AACAGATTGA ACCAGCATTA GAGTTGTTAC ACAACATGAT TTTTGCAGTT AATCCTCGGG ATACAGCTCG ACTACGGGAA GTGATGATTC AGTCTTATTC TCAATCTAAT TCAGATTTGA TTTATAACGG TATCTACACT GCTATATTGC GAGCTAGTGC TGGTATGACA TCAGAAGCTA AGATTAGTGA GATTGTTAAC GGTTTACCTC AACTGGAGTT GTTGAAAAAA GTATGCGATC GATTTGATGA ACATGGGGCA AATTTGATGA GCAAGGTTGA AACTATCCGG GATTATGTAG CCAATCAACC TTTGACTGCT AGTTTTACTG GTTCAGATAA TGCTTATAAT GTGGTCAAGA AGACTCTCTC GGAGTGGGGT CATCAGCAAA AGCAACAAGA AGGAGATACT TTTGGTAGTC GCTTTGAACC AGTTTATAAT ATGCGGGAAG CTTTAGCAGG TCCAGTACAG GTGGCTTATT GCGTTCAGAC TATGCCAGCT CCTCACTTTA GTGATGAAAG AGCGCCATTT TTAAGGTTAG GTACTCATTT ATTAGGTTTG GGTTATCTAT TTACAGAAGT TCGTCTTAAG GGTAATGCTT ACGGTGCAGG ATGTCGTTAT AGTGGTTTAG GAAAAGTTAT TTCTCTCTAT TCTTATCGCG ATCCTCATGT CAGTCGTACT CTTGATGTAT TTGCTGGTTT GATAGATTAT CTTAAGGATG TAGATTGGAC TCAGATTGAT GTTGACCGGG CAATTATTGC TACAATTCAA GATGATTCTC CAGTTTTGCG CCCAGAAGTA GCCACAAGCT TAGCTTTGGA ACGTCATTTG ATAGCGCAAA CTGCTGAACT TAGGGAGGAA CGTTATCAGC GAACGCTCAA AGCTACAGTT GCAGATGTGA AAGAAACTTT GTTAGATGTT TTTACTGCGG GGATGGAACG TAGTAATGTT TGCGTAATGT CTTCTCGCGA AAAATTGGAA GAAGCAAACC GTTCTCGGGA GGCGGATCCA TTGACAATTT CTGATATTAT GTAA
|
Protein sequence | MPLLTERTIE VGQKLQGFEV KAITDLKQQR MVAYQLEHIK TGAKLLHLYS EDAENLFSIS FPTPPPNSTG VSHILEHSVL AGSKKYPVRE PFFEMLKMSP ATFINAMTGP DCTYYPVSSK VKQDLFNLAE VYFDAVFHPL LTENTFKREG HHLAPTDKEN PTGELKFTGV VYNEMNGAFS DPEQRLDSIA NQSLFPDNIY GLESGGNPQN IPELTYKDFR DFHSSYYHPS NAYFVFYGNI STPEYLEFLA KKLEAFERQK PNININPQSR WSEPRFKEDS YPISAADETT QKTYIMIKWL VGDSTDSEEW VALDILSRIL LGNEAAPLKK AIVESQIGQD LLGSGVDSVG KEVTFHLGIQ GSEPNQGEAF SQLVIKTLKE IAEEDIEPSI VEAAFQQAIY QHQEIGSMYP LRMLFRVMQT WIYSNDPLKF LHISDRLAEC KQRYLEKPRY FNNLIREKLL NNPHRLTLVL KPDKEWQSNY DKAVVAQVEQ VRSQLTSEEL ERIATEATEL EIESGTPNSP EEIAKLPQLQ VKDLPDKPEH IPTDVEELDG QVTLLRNHVL ANGVNYLQLD FSLRGLPEDL WLYLSIYIDA LRKLGAGEMN YEQVARGIAS YTGGISFQSL LRTSTKDAYH SVRGLRVTIK TLDEQIEPAL ELLHNMIFAV NPRDTARLRE VMIQSYSQSN SDLIYNGIYT AILRASAGMT SEAKISEIVN GLPQLELLKK VCDRFDEHGA NLMSKVETIR DYVANQPLTA SFTGSDNAYN VVKKTLSEWG HQQKQQEGDT FGSRFEPVYN MREALAGPVQ VAYCVQTMPA PHFSDERAPF LRLGTHLLGL GYLFTEVRLK GNAYGAGCRY SGLGKVISLY SYRDPHVSRT LDVFAGLIDY LKDVDWTQID VDRAIIATIQ DDSPVLRPEV ATSLALERHL IAQTAELREE RYQRTLKATV ADVKETLLDV FTAGMERSNV CVMSSREKLE EANRSREADP LTISDIM
|
| |