Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3157 |
Symbol | |
ID | 4243828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4820301 |
End bp | 4822412 |
Gene Length | 2112 bp |
Protein Length | 703 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638108166 |
Product | prolyl oligopeptidase |
Protein accession | YP_722758 |
Protein GI | 113476697 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.385606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000562539 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGAGA AAAAACAACC TTTAACATAT CCTATCACTG AGAAAACAGA TACAGTAGAA AATTATCACG GTGTTGAAGT TGCAGATCCT TATCGATGGT TAGAAGACCC TAACTTAGAA AAAACTAAAG AGTGGGTAAA ATCTCAAAAT GAAATTACTT TTAACTATTT GGCAGAAATT TCCGAAGGAG AAACTATCAA AAAACGACTA ACTAAAATCT GGGATTATGA AAAATATAGT GTCCCATTTA AAGAAGGCGA TCGCTACTTC TATTATAAAA ATGATGGTTT ACAAAATCAA AGTATATTAT ATACATTGCC TACTCTAGAT GCAGAACCAA AAGTTTTGAT AGATCCTAAC CAATTCTCAG AAGATGGAAC AGTCGCTCTT TCAGGAATTG CTATTAGTAA AGATGGCAAA TATATCGCCT ATGGTATTTC CAAGTCTGGT TCAGACTGGC AAGAATGGCG CATCAAAAAT ATTGATACTG GAGAACATTT CCCGGATGTT TTGCAATGGA TTAAATTCTA TATACCAACT TGGAAAAATG ATAATCAGGG TCTGTTTTAT AGTCGTTACG AACAACCAAA AGAAGGGAAG TTAAAAGATA CTAACTATTT ACATAAGGTT TACTATCATA GTTTAGGAAC TTCTCAAGAT AATGATGTAC TAATTTATGA AAAACCTGAA CAAAAAGAAT GGAGTTTTAA TTGTCATGTC ACAGAAGATA ATAAGTATTT AATTATTACA GTTTGGCAAA GTACAGAACG CAAAAATCTA GTTTTTTATC AGGATTTAAG TATACCAAAT GCGCCAATAG TGGAACTTAT TAGCGAATTT GAAGCTGAAT ACCTTTTGAT AGATAATTAT CAGAATATTT TTTGGTTTTT CACAGATTTA AATGCTCCAA AAAGACGAGT TATTGCCATT GATATTAACA ACCCTCCATC TCCTTCTTTA GTGAGAGGAG AAAATCAAAA TAAATGGCAA GAAATTATTC CTGAAGCTAC AGATGCATTG CAAGGGATAG GAACACTTAA TAATCAGTTT GTTACTTTCT ATTTAAAAGA TGCTCATACT CAGATAAAAA TATTTAATCT TGATGGTTCC CCAGTCAGAA ATGTAGAATT ACCTGGCATT GGTTCAGTAG TTGGTTTTTA TGGTAGACGT CACGACACAA GCACATTTTA TAGTTATGTT AGTTTCACAA CTCCCTCAAC TATTTATCAT TACGATATGG TGAGTGGTGA AAGTAAAATT TATCGTCAGT CAAATGTAGA TTTTAATCCT AATCAATTTG AAACAAAGCA AGTTTTTTAT AGTAGTAAGG ATGGTACAAG CATCCCCATG TTTATTACTC ATAAAAAAGG TGTAAAACTA GATGGCAATA ACCCAACTAT TCTTTATGGA TATGGAGGAT TTAATATTTC TCTGACTCCT AACTTTTCTA TTAGTAGATT AATTTGGTTA GAGATGGGGG GAGTCTACGC AGTGCCCAAT ATTCGTGGAG GTGGAGAATA TGGTGAAGGG TGGCATCAAG CAGGAATAAA ACAACAAAAA CAAAATGTAT TTGATGATTT TATTAGTGCT GCTGAATGGT TGATAGAAAA TAATTGGTCT TCTTCTCCAA AGTTAGCTAT TACTGGTGCT AGTAATGGTG GTTTATTAGT AGGTGCTTGT ATAACTCAAA GACCAGAATT ATTTGGTGCT GCTTTACCCG CAGTAGGAGT AATGGATATG TTACGTTTCC ATAAATTTAC TATTGGTTGG GCATGGACTG CTGAGTATGG TTCCCCAGAT GATCCCGAAG AATTTAAAGC TTTATATGCT TATTCTCCTC TACATAATTT AAAGCCAAAA ACATCTTATC CTCCAACTTT TATTACTACT GCTGACCATG ATGATCGGGT TGTTCCAGCT CATAGTTTTA AGTTTATTTC TACCTTACAA GAAGTTCATA TAGGAGATCA TCCAGTGTTA ATTAGAATTG AAACTAAAGC AGGTCATGGA GCCGGAAAAC CTACTACGAA AATAATTGCA GAAATTACAG ATGAATTTGC TTTTTTGCTG AGAAACCTTA AGATAGAATT ACCTGAAAAT TTTGGTAATT AA
|
Protein sequence | MKEKKQPLTY PITEKTDTVE NYHGVEVADP YRWLEDPNLE KTKEWVKSQN EITFNYLAEI SEGETIKKRL TKIWDYEKYS VPFKEGDRYF YYKNDGLQNQ SILYTLPTLD AEPKVLIDPN QFSEDGTVAL SGIAISKDGK YIAYGISKSG SDWQEWRIKN IDTGEHFPDV LQWIKFYIPT WKNDNQGLFY SRYEQPKEGK LKDTNYLHKV YYHSLGTSQD NDVLIYEKPE QKEWSFNCHV TEDNKYLIIT VWQSTERKNL VFYQDLSIPN APIVELISEF EAEYLLIDNY QNIFWFFTDL NAPKRRVIAI DINNPPSPSL VRGENQNKWQ EIIPEATDAL QGIGTLNNQF VTFYLKDAHT QIKIFNLDGS PVRNVELPGI GSVVGFYGRR HDTSTFYSYV SFTTPSTIYH YDMVSGESKI YRQSNVDFNP NQFETKQVFY SSKDGTSIPM FITHKKGVKL DGNNPTILYG YGGFNISLTP NFSISRLIWL EMGGVYAVPN IRGGGEYGEG WHQAGIKQQK QNVFDDFISA AEWLIENNWS SSPKLAITGA SNGGLLVGAC ITQRPELFGA ALPAVGVMDM LRFHKFTIGW AWTAEYGSPD DPEEFKALYA YSPLHNLKPK TSYPPTFITT ADHDDRVVPA HSFKFISTLQ EVHIGDHPVL IRIETKAGHG AGKPTTKIIA EITDEFAFLL RNLKIELPEN FGN
|
| |