Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3860 |
Symbol | |
ID | 4243522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5963776 |
End bp | 5964636 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638108790 |
Product | prolyl aminopeptidase |
Protein accession | YP_723373 |
Protein GI | 113477312 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.4706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0554671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCTA AAATACGAGA TACAGAAATC TATTTTGATA TTGAAGGTGC TGCCCTAATT CCAGATGGTG ATCGGATGCA AGAAAAACCT ATTGCTTTTG TAATTCACGG CGGCCCTGGT GCAGATCATA CTTCTTATAA ACCTACCTTT TCTCCTCTTA GTCAAAAATT ACAATTAGTA TATTTTGACC ACCGGGGACA AGGGCGATCG GCAAGGGGGC TAAAAGAAAG TTATACCCTA GAAAATAATG TGGAAGATAT GGAGGCATTA CGTCAATATT TGGGCATAGA AAAAATTGTT CTTATTGGTA CTTCTTACGG CGGAATGGTT GCCCTTAGCT ATGCAGTACG TTATCCAGAA AGTGTTCAAT CTTTAATTGT CATTGCTACA GCTGCAAGTT ATCGTTTTTT GGAATTGGCA AAAGTTAATC TAGCTAAAAA AGGAACAACA GAACAACAAG CGATCGCTCA ACTTTTATGG GATGGTAAAT TTGAAAATGA GGCACAACTC AAAGAATATT TTCAGGTGAT GATGTCTATG TATTCAATTA CCTATAAAGC AGAAACATCA GGGAAAAGTT GGAATGGAGC TATTTTATCT CCCGATGCTA TTAATGTTGC CTTCGGTGGT TTTTTGCGTT CTTACAATGT ACTCGATCAA CTACATAAAA TTACTGCTCC TACTTTAGTA ATAGGGGGCA AACATGATTG GATCTGTCCC CCGGAATTAT CAGAACAAAT TGCTGCAGCT ATTCCTAATA CAGATTTAAG AATCTTTGAA AACAGCGGTC ATTTAATTCG GGTTGATGAA CCAGAAGCAC TTCTAGATGC AATTATGGTT TTTTTAGTGA ACAAGGGATA A
|
Protein sequence | MRAKIRDTEI YFDIEGAALI PDGDRMQEKP IAFVIHGGPG ADHTSYKPTF SPLSQKLQLV YFDHRGQGRS ARGLKESYTL ENNVEDMEAL RQYLGIEKIV LIGTSYGGMV ALSYAVRYPE SVQSLIVIAT AASYRFLELA KVNLAKKGTT EQQAIAQLLW DGKFENEAQL KEYFQVMMSM YSITYKAETS GKSWNGAILS PDAINVAFGG FLRSYNVLDQ LHKITAPTLV IGGKHDWICP PELSEQIAAA IPNTDLRIFE NSGHLIRVDE PEALLDAIMV FLVNKG
|
| |