Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0347 |
Symbol | |
ID | 9154482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 361672 |
End bp | 363732 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | peptidase S15 |
Protein accession | YP_003645329 |
Protein GI | 296138086 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCTGA ATCGGCTCGG GCGCGCGCTC ACCGCGCTCC TCGCAGTAAC CGTGATCGTC GCCGGAGCGG CGGTGCCCGC ACTGCGCGCG CCGGCGGCCG CGGAACCGTG GAAGCCGGGT GCCGGTGTGG CGGGGATGAT CAATCCAGAG TGGATCGCCT CGCACGACGG CCCTTCGCAG TACCCGCGGA TGGCCACCGA GTGGGATGTT CCTATTCGCA TGTCCGACGG CACTGTATTG CGCGCCAACA TCTTCCGGCC CGCCGATGCG TCCGGCAAGG CGATCGAGAC CGCGATGCCG ACGATCGTCA ACATGACGCC CTACACCAAG TTCATCAGCA CCATCGTGAC ATTGGTGACC AACGTCCCGG TGCTGTATCC GGCACTGGTG GGACTGCTCA ATCTGTTCAA CTTCTCCGGC ACGCCGATCG CCGGGGTGGA CGATCTGCGC AACACGCTCA ACGGTGGCCT GCTCAACACC TTCACCGTGG ATCAGAAGCT GGTACAGAGC GGGTACACCC AGGTGGTGGT GGACGTGCGC GGCACCGGCA ATTCGCAGGG CGTGTGGCAG GTCTTCGCTC AGCGCGAGCA GCAGGACACC GTCGAGGTGC TGGACTGGAT TCGCAAGCAG AGCTGGACCA ACGGCCGGTT CGGTATGGCA GGCGTGTCGT ATTCGGCGAT CAACCAGTTG CAGGTGGCGT CGAAGAATCC CGAGGGACTA CAGGCCCTGT TCCCGGTGGT GCCCGGCGCC GATATCGCCG CGGACATCGT GGCGCCCGGC GGCGGGCTCG GCGTCGGCTT CATCGGTCCC TGGCTCGCCA TGGTCAACAT CCTCAAGTTC ATTCCCGATC TCCGGTCGCT GCTCAACGGC ACCTTCGATT GGAAGTGGTT GCAGGACAGG CTGCAGAACC CCGCGGTGTT CGTCCCCGAG CTTCTCACCG GTGTCTTCTC GCCCACCGTG GAGGGCCTCA CCCCGACCAC GAAGGAACTG ATCAACAAGG ACTCGACGCT GCGCGCCGCC TTCCAGACGC CTCTGGACAA GGTGACCACG CCCACGTTCG CGCTCGGTGG CTGGAACGAC CTGTTCACCA ACACCGAATG GCGTCTGCCC ACGGCGCTGA GTGCGCTGTC CACGGCCAAG AAGAAGCTCA TCATGGGCGA CGCCGCACAC GTCACCGTCA CCAACGATAT GGGCGGAGTG GGGCAGCCGA CCCGTGCCGA CGTCCTGCAG AAGGCGTGGT TCGACAAATG GCTCAAGGAC GCCGATAACG GTATCGACAA CTATGCGCCG GTCACGGTGC ACCGGATGGG TGGCGGCTGG TGGCAAGGGG ACACCTTCCC CGAGCCCGAG CAGAAGTACC AGCGCATGTA CCTCTCCAGC CTGCCCTCGG GTACCGCCCC GACGGCGCTC TCGGACAACA GTCTTCAGAC GTCACCTCCG AAGGTTCCGG CGCGCCGCAC GGTGGGGCCG GGGCTGTCGA CGCTGTGCTC GAACGACACC GGCCAGGCGA TGTTCGGTCT GCTGGTATTC CAGGGGTGCA CCAAGGACAA TCGCGTCGCC GAGATGAACG CGCTCACCTT CACCTCCAAG CCGGTGGGCA AGCCGACCGT GATCTCCGGA CCGATCAACC TGCGGCTCAA CACAACCCAG GACGCCAAGG ACGCCTACTG GAGCGTGATG GTCACCGACG TCGGCCCCGA CGGCAGGTCC GAGAAGATCA GCTCCGGTCA GCTCACCACC TCGCTGCGGC AGATCGACGA ATCCCGCAGC ACCCGCACTG CGGGTGGAGA CATGGTCGAT CCGTACTACA AACTCAATCT CGCCGACCGG CAGCTGGTAG CCCCCGGACA GGTGGTGCCG CTCGACATCG GCACCCACGC CGTGAGTGCC GTGCTCAAAC CCGGGCACCG GCTGCGTGTC GATGTCTTCG CGCTCAATCT GATCAAGGCG ATGACCGTCG GACCGGTGAC CGCGGAGACG CAGTTCCGGC CGCAGCACGT ACTGATCGAT CCGAAGCAGC CCAGCTACCT GGTGGTGCCA TCGGATCGGC CCCTGCCGTG A
|
Protein sequence | MILNRLGRAL TALLAVTVIV AGAAVPALRA PAAAEPWKPG AGVAGMINPE WIASHDGPSQ YPRMATEWDV PIRMSDGTVL RANIFRPADA SGKAIETAMP TIVNMTPYTK FISTIVTLVT NVPVLYPALV GLLNLFNFSG TPIAGVDDLR NTLNGGLLNT FTVDQKLVQS GYTQVVVDVR GTGNSQGVWQ VFAQREQQDT VEVLDWIRKQ SWTNGRFGMA GVSYSAINQL QVASKNPEGL QALFPVVPGA DIAADIVAPG GGLGVGFIGP WLAMVNILKF IPDLRSLLNG TFDWKWLQDR LQNPAVFVPE LLTGVFSPTV EGLTPTTKEL INKDSTLRAA FQTPLDKVTT PTFALGGWND LFTNTEWRLP TALSALSTAK KKLIMGDAAH VTVTNDMGGV GQPTRADVLQ KAWFDKWLKD ADNGIDNYAP VTVHRMGGGW WQGDTFPEPE QKYQRMYLSS LPSGTAPTAL SDNSLQTSPP KVPARRTVGP GLSTLCSNDT GQAMFGLLVF QGCTKDNRVA EMNALTFTSK PVGKPTVISG PINLRLNTTQ DAKDAYWSVM VTDVGPDGRS EKISSGQLTT SLRQIDESRS TRTAGGDMVD PYYKLNLADR QLVAPGQVVP LDIGTHAVSA VLKPGHRLRV DVFALNLIKA MTVGPVTAET QFRPQHVLID PKQPSYLVVP SDRPLP
|
| |