Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2052 |
Symbol | |
ID | 4245700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3199942 |
End bp | 3201834 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638107163 |
Product | peptidase S9, prolyl oligopeptidase active site region |
Protein accession | YP_721766 |
Protein GI | 113475705 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.55137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.714509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCA AAGAAAAAAC TACGGCTCAA CTCACACCTC TCATTCCTCG TGAAATTTTA TTTGGGAACC CTGAACGTAC TAAACCAAAA ATTTCCCCCG ATGGAAAATA TCTTGCCTAC ATCGCCCCAG ATGATAAAAA CGTTTTGCAA GTATGGGTAC AAACTATTGG TAAACAAGAC GATCGCCAAG TTACTGCTGA CCAAAAACGA GGCATTCGTA TGTATCTTTG GACTTACAAG CCAGACCAAC TGATATACCT CCAAGATGCT GGTGGTGACG AAAACTTTCA TCTTTATCAA GTAAATATTC AATCAAACAT AGTGCGCGAC CTGACCCCAT TCCAAGGAGT TAAAGCGCAA ATAGTAAATT TAGATCATTG TTTTCCCGAC CAAATATTGG TGGGAATGAA TCTAAAAAAC CCCCAAAGTT TTGATGTTTA CCGCATTGAT CTTAATAATG GTGCAGTAGA ATTTGATACC CAGAACCCAG GTAATATTGT TGGTTGGACT GCTGATGCTC AATTTCAAGT CCGGGCAGCG ATCGCCAGCA CTGAGGATCG AGGTTCTAAT TTATTTTATC GGGAAACTAC AGAAAAACCA TGGGAAACTT TGCGTCATTG GGGACCTGAT GAACAAGGTG GTCCGGTTCT CTTCTCCGAT GATGGGAATA TTCTCTATAT ATCGGGTAAC CATGATGCTA ATGCTCAACG TCTACTTGCT CTAGACCTTA GAGATGGTCA AGAAAAAGTA ATTGCTGAAG ACTCCCAATA TGATATTAGT GGCATCATAA CTCATCCTAC AACTCGTACT ATTCAAGCAG TAGGTTTCTA CAAAGATAAG CTAGAATGGC AAGTTCTGGA CGATAGCATT ACTGCAGATT TTGAATTTCT GAAAGCAGCT CATAAAGGGG AGTTTCGCAT CGTAGACCGG ACTCTTCAGG ATCTTAACTG GTTAGTAGCT TATTTCACTG ATGACGGACC AGTTTATTAT TATGCCTACG ACCGTACTGC TAAAACTACA ACTTTTTTGT TTACGAACCA ACCAAAACTA GAAGGTTTAC AGTTGGCTCC TATGGAACCT ATTTCCTATG TTGCTAGGGA TGGTTTAACT ATTCATGGAT ATTTGACTAA ACCTGTGGGG GTTTCTACTC CAACTGCAGC AGTGCTTTTA GTTCATGGAG GACCTTGGGC TCGGGATACT TGGGGTTATA AAGGGCAAGC TCAATGGTTG GCTAACCGGG GTTATGTGGT GTTGCAAGTT AATTTCCGCG GTTCAACTGG TTATGGTAAA GACTTTCTCA ATGCTGGTAA CCGGGAGTGG GGAGCGAAAA TGCATGATGA TCTCATTGAT GGAGTTAACT GGCTAGTTGA AAAGGGTATC GCTAACAAGG ATGAAATTGC TATTATGGGT GGTTCTTATG GTGGTTATTC AACTTTAGTA GGGTTAACTT TTACTCCGGA AGTCTTTGCT GCTGGTGTTG ATATTGTCGG ACCTAGTAAT TTGATCACTT TAATGGAAAC TATTCCACCT TATTGGAAAC CTCTGAAGAG GGTTTTTAGT CATCGTATGG GAGATATTGA AACAGAGCCA GAGTTTTTAA GGTCGCGATC GCCTCTGTTT TTTGTGGATA AAATTCAGAA ACCTTTGTTA ATTGGCCAAG GAGCAAATGA CCCACGGGTG AAGGAGTCAG AAAGTGAACA AATTGTTCAG GCTATGAAGG ATGCTGGTAA GCCTGTAGAA TATGTTTTAT ATGAAGATGA AGGTCATGGT TTTGCACGCC CAGAAAACCG TTTACATTTT TATGCGATCG CTGAGGAATT TTTGGCAAAA TACTTAGGTG GAAAATTTGA ACCGGCGGGT TCTATAGATG GGCACTCTGG TGTGGTTAAG TAG
|
Protein sequence | MTSKEKTTAQ LTPLIPREIL FGNPERTKPK ISPDGKYLAY IAPDDKNVLQ VWVQTIGKQD DRQVTADQKR GIRMYLWTYK PDQLIYLQDA GGDENFHLYQ VNIQSNIVRD LTPFQGVKAQ IVNLDHCFPD QILVGMNLKN PQSFDVYRID LNNGAVEFDT QNPGNIVGWT ADAQFQVRAA IASTEDRGSN LFYRETTEKP WETLRHWGPD EQGGPVLFSD DGNILYISGN HDANAQRLLA LDLRDGQEKV IAEDSQYDIS GIITHPTTRT IQAVGFYKDK LEWQVLDDSI TADFEFLKAA HKGEFRIVDR TLQDLNWLVA YFTDDGPVYY YAYDRTAKTT TFLFTNQPKL EGLQLAPMEP ISYVARDGLT IHGYLTKPVG VSTPTAAVLL VHGGPWARDT WGYKGQAQWL ANRGYVVLQV NFRGSTGYGK DFLNAGNREW GAKMHDDLID GVNWLVEKGI ANKDEIAIMG GSYGGYSTLV GLTFTPEVFA AGVDIVGPSN LITLMETIPP YWKPLKRVFS HRMGDIETEP EFLRSRSPLF FVDKIQKPLL IGQGANDPRV KESESEQIVQ AMKDAGKPVE YVLYEDEGHG FARPENRLHF YAIAEEFLAK YLGGKFEPAG SIDGHSGVVK
|
| |