Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3675 |
Symbol | |
ID | 9157855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3786787 |
End bp | 3788817 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Prolyl oligopeptidase |
Protein accession | YP_003648592 |
Protein GI | 296141349 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.169529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCGT ACCTCTGGCT CGAAGAAGTG GAATCCGACA CGTCGCTGGA TTGGGCGCGC GAACGCAATG CCGATTCGGC GGCGCGGCTG GCCGGGGAGC GGTTCGAGAC GATCGAGTCC GAAGTGCTCG CGATCCTCGA TTCGGATGAG CGGATCCCGT CGGTGCGCCG GCGCGGCGAG TGGCTGTGGA ACTACTGGAT CGATGGGGAG CACCCGTACG GCCTGTGGCG GCGCACCACG CTGGAGTCCT ATCGCACCGA TTCTCCCGAG TGGGACGTGG TGATCGACCT GGACGCCCTG CGCGAGCAGG ACGGCGAGAA CTGGGTCTGG GGCGGCGCCG CCGTGTTGCG CGGCGAGTCG CCCGACGGTG CGCCCTGGGA CCGCGCGCTG GTGTCGCGCT CGCGCGGCGG TGCCGACGCG ACCGTGGTGC GGGAGTTCTC CATCTCGCGC CGCGAGTTCC TCGATCCCGA CGCCGGCGGC TTCGCCCTGG ACGAGGCGAA GAGCCGGATC TCCTGGATCG ACGCCGACAG CGTGTACGTG GGCACCGACT TCGGGCCGGG TTCGCTCACC GACTCCGGCT ACCCGCGGGT GGTGAAGCGG TGGCACCGTG GCACGCCGCT GTCCGAGGCG GTCACCGTCT ACGAGGGCGC GGAGTCCGAT GTCTCCGTGG GCGCCGTGTA CGACGACACC CCGGGCTACG AGCGGCACTT CGTGGGTCGC AGCACCGATT TCTACAACCA CGAGGAGTAC CTGCTCGATC CGGGATCGGG TGCGTTGGAG CTGATCGACG TGCCCACCGA CGCCGAGGCC GATGTGCATC ACGACCTCCT CCTGGTGTCG CCGAAGTCGC CGTGGCAGCT GCCGTCGGGG ACGGTCGACC CGGGCGCTCT GGTGGCCTTC GACTTCGATG CGTACCGGGC GGGCGGGCGC ACGTTCACCA CGATCTTCGC CCCGGACGCG CACACCAGCC TGCAGGGCTA CGCCTGGACG AAGTCGTACC TGCTGCTGGC GACGCTGCAC GATGTGCGCT CGGAGTTGCG CACGCTCGAC CCGAAGGATT GGTCCGAGGT GACCACGGCC GGGCTGCCTC CGCTGGCCGA GATCGGCGTC GCCGGAACCG ATCCACGGGA ATCCGACGAG GTATTCCTCT CCGCGAGCTC GTTCACCCTG CCGCCCAGCC TGCTGTACGG CGAGGCCGGC GGCACCGTCG AACCGCTCAA GTCCACGCCC GCGATGTACG ACGCCGACGG TGTGGTGGCC GAGCAATTCT TCGCCACCAG CGCCGACGGG ACACAGATCC CCTATTTCGT GGTGCGCAAG CCGGCGAACG GCCCGCAACC CACCCTGTTG TACGGCTACG GCGGATTCGA GATCTCCTTG ACGCCGGGCT ATTTCGCGGG TGCCGGCCGC ACCTGGATCG AACGCGGCGG GGTGTACGTG CGGGCGAACA TCCGCGGCGG CGGCGAGTAC GGGCCCACCT GGCACACCTC GGCGCTCAAG GAGAACCGGA TGCGCTGCTA CGAGGACTTC TCGGCGGTGG CCCGGGACCT GGTGGAGCGC GGTATCACCA CCCGCGATCA GCTCGGCGCG ATGGGCGGCA GCAACGGCGG CCTGCTCATG GGCGTGATGT ACACGATGTA CCCGGAGTTG TTCGGCGCCA TCGTCTGCCA GGTGCCGCTG CTCGATATGA AGCGGTTCCA TCTGTTGCTG GCGGGAGCAT CGTGGATGGC CGAGTACGGC GATCCCGACG ATCCCGAGCA GTGGAAGTAC ATCAGCGAGT ACTCGCCGTA CCAGAACCTG CCCGACGACG CGGCCGGGTA CCGGCCCGCG CTGCTGGTCA CCACGTCGAC CCGCGACGAT CGCGTGCATC CCGGGCACGC CCGTAAGTTC ATCGCGGCGC TGCGCGAGCG CGGGATCGAC GTGAACTACT ACGAGAACAT CGAGGGCGGC CACGGCGGCG CGGCCGATAA CAAGCAGGCG GCGTTCATGG CGGCCCTGGC CTACGAATTC CTGTGGAAGG AGCTGTCATG A
|
Protein sequence | MDPYLWLEEV ESDTSLDWAR ERNADSAARL AGERFETIES EVLAILDSDE RIPSVRRRGE WLWNYWIDGE HPYGLWRRTT LESYRTDSPE WDVVIDLDAL REQDGENWVW GGAAVLRGES PDGAPWDRAL VSRSRGGADA TVVREFSISR REFLDPDAGG FALDEAKSRI SWIDADSVYV GTDFGPGSLT DSGYPRVVKR WHRGTPLSEA VTVYEGAESD VSVGAVYDDT PGYERHFVGR STDFYNHEEY LLDPGSGALE LIDVPTDAEA DVHHDLLLVS PKSPWQLPSG TVDPGALVAF DFDAYRAGGR TFTTIFAPDA HTSLQGYAWT KSYLLLATLH DVRSELRTLD PKDWSEVTTA GLPPLAEIGV AGTDPRESDE VFLSASSFTL PPSLLYGEAG GTVEPLKSTP AMYDADGVVA EQFFATSADG TQIPYFVVRK PANGPQPTLL YGYGGFEISL TPGYFAGAGR TWIERGGVYV RANIRGGGEY GPTWHTSALK ENRMRCYEDF SAVARDLVER GITTRDQLGA MGGSNGGLLM GVMYTMYPEL FGAIVCQVPL LDMKRFHLLL AGASWMAEYG DPDDPEQWKY ISEYSPYQNL PDDAAGYRPA LLVTTSTRDD RVHPGHARKF IAALRERGID VNYYENIEGG HGGAADNKQA AFMAALAYEF LWKELS
|
| |