Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2542 |
Symbol | |
ID | 9156703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2634996 |
End bp | 2635982 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF199 |
Protein accession | YP_003647484 |
Protein GI | 296140241 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGTTGA CATCCCAGGT CAAGGATGAG CTGAGCAGGC TCTCCATCAC GCAGGTGAGT TGCCGCCGCG CCGAGGTGGC CTCTCTGCTG CGCTTCGCGG GGGGACTGCA CATCCAGGGC GGCAGGGTGA TCGTCGAGGC GGAGGTCGAT ATGGGCATCA TCGCTCGCCG TCTCCGCAAG GAGATCCTCG ATCTCTACGG TTACAACTCC GATGTCCACG TGCTCAGCGC AGGTGGCCTC CGTAAGGCGG CCCGGTACAT CGTCCGCGTG GTCAAGGACG GCGAAGCGCT CGCGCGACAG ACGGGACTGC TCGATATGCG CGGGCGGCCG GTGGTGGGGC TGCCGTCGCA TATCGTCGGC GGGTCCGTCG GCGATTCGGA AGCGGCGTGG CGCGGCGCCT TCCTCGCCCA CGGGTCTCTG ACCGAACCCG GCCGCTCCAG CGCCCTCGAG GTTTCCTGTC CCGGGCCCGA GGTGGCGCTC GCGCTCGTCG GCTGTGCTCG CCGCCTCGGC GTCACCGCGA AGGCACGCGA AGTGCGCGGC GCCGACCGCG TGGTGGTCCG CGATGGCGAG GCCATCGGCG CGCTGCTCAC CCGGATGGGT GCCAACGACA CCCGGTTGGT GTGGGAGGAG CGCCGGATGC GGCGCGAGGT ACGCGCCACC GCCAACCGCC TCGCCAACTT CGATGACGCC AACCTGCGCC GCTCCGCCCG GGCCGCCGTC GCGGCCGCCG CGCGCGTCGA GCGGGCGCTG GAGATCCTCG GTCCCGACGT CCCCGATCAC CTCGCGCAGG CCGGTTCGCT GCGGGTGCAG CACCGTCAGG CGTCCCTGGA GGAGCTCGGT CAGCTCGCCG ACCCGCCGAT GACCAAGGAC GCCGTGGCCG GCCGCATCCG CCGCCTGCTG TCCATGGCCG ACAAGCGCGC CGCCGCCGAC GGTGTCCCGG ATACCGAATC CGCCGTCACC GCGGACATGC TCGACGACGG AGAATGA
|
Protein sequence | MALTSQVKDE LSRLSITQVS CRRAEVASLL RFAGGLHIQG GRVIVEAEVD MGIIARRLRK EILDLYGYNS DVHVLSAGGL RKAARYIVRV VKDGEALARQ TGLLDMRGRP VVGLPSHIVG GSVGDSEAAW RGAFLAHGSL TEPGRSSALE VSCPGPEVAL ALVGCARRLG VTAKAREVRG ADRVVVRDGE AIGALLTRMG ANDTRLVWEE RRMRREVRAT ANRLANFDDA NLRRSARAAV AAAARVERAL EILGPDVPDH LAQAGSLRVQ HRQASLEELG QLADPPMTKD AVAGRIRRLL SMADKRAAAD GVPDTESAVT ADMLDDGE
|
| |