Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2959 |
Symbol | |
ID | 9157127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3066097 |
End bp | 3069132 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | MMPL domain protein |
Protein accession | YP_003647894 |
Protein GI | 296140651 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.301766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACACG AGAGCAAGCG TTTCGAGGCA CTGGGCCGAT GGTGTGCTCG CCACGCGTGG CTCGTGCTCG GGCTCTGGGT GGCACTGGCG GGCGTGCTGA ACGTCGCCGT GCCCCAGTTG GAGAAGGTGG TCTCGCAGCA TTCGGCGCCG TTCGTCGCGC AGAATCTCGA GGCGGTCGAC AACCTGCGGG CCATGGCCGC AGACTTCGGC ACCGTGCCCA CCACCGGGAT CGGTTCCGTG GTGATCACCA ATCCCGCTGG AATCTCCGAT GCCGACCGCG CGTACTACCA GCGGCTACTG GAGAAACTGC AGGCCGATCG GGAGAACGTG GCCTGGCTGT TGGACACGTA CTCGAAGCCG GAGACCCGCG CCGTCGGTCT GTCACCGGAC GGGAAGGCCA TCAACCTCGT CTTCGCCGTC ACCGGCGACG CCGGTTCCAC GCAGGCGCAC CACGCCACCA CCGCGGTACG GGCGATGGTC GACGACGCCG CGGCGCCGGA AGGCACCGAG GTCCACTACA CCGGGCCGTC CCCCACGCTC GCAGATCTGT TCTCCGCCAT CGACTATTCG CTGCTGATCA TCACCGTGAT CTCGGTCCTG CTCATCACGA TGGTTCTGCT CGTGGTGTAT CGATCACTGT GGACCGCGAT GGTGCCGCTC GTGACGATCG GGCTGGGACT CGCGGTGTCC CGGCCGATCA TCGGCCTCCT CGGGATGAAC GAGGTTCTCT CGATATCCAA TTTCACGATC GCCATCGGCA CCGCGCTGGT GCTCGGCGCC GGTACCGATT ACGCCATCTT CGCCATCGCG GCCTACCACG AGGGGCGCCG GCGCGGGATA CCGGCACGTG AGGCCGCCGC GTACTCCTCG CTGAAGATGT CCACCATTCT GGTGGCTTCG GCGCTCACCA TCGCCGCTGC GTGCTGCTCG ATGGCGTTCA CTGAGATCGG CATGTTCCGC ACTGCGGGGC CGCCGACCGC GATCGCGGTG GCCGTCACTT TGCTCGTGGC ACTGACGGTT CCGCCCGCGC TGCTGCGGCT GCTGGGTGAA CGCGGCAAGG CCGAGCCCCG CCCGCTCGAC GAGCGCAAAT GGCGCCACCG GGGCGCCCGC ATCGTGCGAC ATGCCGTGCC GATCACGGTG GTGTGCCTCG CGATCCTGGT GGCGGCGGCC TCCGTGCTGC CTACCCTGCG AATCGGCTTC GACGAGAACA GCATGCAGCT CCGCAGCACC GATTCCAAGG TCGGTTACGA CCGCGTCTAC GAGCACTGGG GCGTCAACGA GGTGAGCCCC GAGTACATCC TCATCAGGTC CGACCGCGAT ATGCGTAACA CCAGGGACCT GGCCGCCCTC GAACTCGCGG CCGCCCAGGT GGCGAGCCTG CCCCAGATCG CGTACGTGCG ATCGATCACC CGGCCCGACG GTCAACCGAT CGCCGAATCG GCGGTCGGGT TCCATACCGG CCAGGTGGCC GACCGGCTGG CCGGCGGCGA GAAGCAGATC GCCGATGCCA CACCGCAATT GAAGCGGCTG GCGTCCGGAG TCGCCGAACT GCACGACGGC GCGCAGCAGG CCAGTGACCG CCTTCCGGAG CTCAAGGCCG GAACCGACCA GGTGGTTGCC CTCGCCAACG GCGTGCTGTC GGCGTACGGC GCCGTCGAAG ACGTGGTGCG CACGGCCTCG TCAGGGCGCG CCGGCAGTCG CGAAGCACTC GCGACCGCCA CCGGCGCCAT CGACGCGGTG CGCGGGACGT TACCGGCGTT GACCAGGCTG GCCACGCAGG TCGACCCACT GGCTCGCGAT GCCCGGTCGG TCCTCGGGCC GCTGGTCACC GGCGAACCGT CCGCCGCGTG CCGGGCCGAC GGTGCCTGCA TGCGCGCCCG GGCCGCTCTG GCCGAGCTCG ACGCCGCCAC CGGCGGCCGG GCGCGACCCG TTCTGGAACA GACCGTCGCT CTACAGCCGG TCGGCGCGGT CACCCTACTG GGCACCGTCA TCGACAAGGT CCAGGTGGCG ATCGCGAGCC TGGGCGGCCT GATGGACCAG CTCGGCGACC GCAGCCCCGA ACAGGTCCGG GCCGAGCTCC AGCGCCTCAG TGCGGGCGTC GGCGAGCTGC AGTCCGGGAT GGCCCGGCTG ACCGCGGGAC TCGGCGAAGT GCGTGCCGGG ACCGGGCAGA TGCTCACCAT GACCGGCGCT CTGCAGGCCG GGTTGCAGCA GGCCGTCGAC TACCTGCGCG GCGTGCAGAC GGGCACCGAT GCGGGCGCCG GCCGCGGCTT CTACCTCCCG GAACAGGGTC TGACCGACCC TCGCTTCGTG ACCGGTTCCC AGTTGTTGAT GTCGCCGGAT GGCCGCAGCG CCCGGATGCT CGCGGTGTGG AAGATCAATC CGTACGGTTC CGAGGCGCTC GACGAGGTGC CGCAACTGAC CGACGCGGCC CGTCATGCAC TGGTGGGAAC CGGTCTCGAG GGATCGGAGG TCCGCACCTC GGGCCTGACC TCACTCTCGG CGCAGATGCG CGATCAGGTG TGGAAGGACT TCGCCACCTT CGGTGTGATC GCGGTGCTCG CGGTGCTGAT CCTGCTCTGC GTACTGCTGC GCAGCCTGGT AGCCCCTCTG GTGATGGTGG CGGCGGTGAT CGTCTCGTTC GCCGCAGCGG CGGGCGTCAG TACCTTGGTG TGGCAGTACA TCATCGGGAT CGACCTGGAC TGGAGCGTGT TGCCGATCGC GTTCATGGCA CTGGTCGCGG TGGGCGCCGA CTACAGCATG CTGTTCGCCG CACGGATTCG GGAGGAATCC GGTTCCGGCA TGATCAGCGG CATCCTGCGC GGCTTCGGTA GCACGGGCTC CGTGATCACC ACCGCCGGAA TCGTCTTCGC GCTCACCATG CTCGGACTGA TGGGCGGTAC CGTGATCAAC CTGTTGCAGA TCGGCTCGAC CATCGCGCTC GGCCTGGTCC TGGACATCAC GGTGGTGCGC ACCTACCTCC TGCCCTCGGT GATGGCCATC GCGGGCAATC GGATCTGGTG GCCCGCCAAG GCGTGA
|
Protein sequence | MKHESKRFEA LGRWCARHAW LVLGLWVALA GVLNVAVPQL EKVVSQHSAP FVAQNLEAVD NLRAMAADFG TVPTTGIGSV VITNPAGISD ADRAYYQRLL EKLQADRENV AWLLDTYSKP ETRAVGLSPD GKAINLVFAV TGDAGSTQAH HATTAVRAMV DDAAAPEGTE VHYTGPSPTL ADLFSAIDYS LLIITVISVL LITMVLLVVY RSLWTAMVPL VTIGLGLAVS RPIIGLLGMN EVLSISNFTI AIGTALVLGA GTDYAIFAIA AYHEGRRRGI PAREAAAYSS LKMSTILVAS ALTIAAACCS MAFTEIGMFR TAGPPTAIAV AVTLLVALTV PPALLRLLGE RGKAEPRPLD ERKWRHRGAR IVRHAVPITV VCLAILVAAA SVLPTLRIGF DENSMQLRST DSKVGYDRVY EHWGVNEVSP EYILIRSDRD MRNTRDLAAL ELAAAQVASL PQIAYVRSIT RPDGQPIAES AVGFHTGQVA DRLAGGEKQI ADATPQLKRL ASGVAELHDG AQQASDRLPE LKAGTDQVVA LANGVLSAYG AVEDVVRTAS SGRAGSREAL ATATGAIDAV RGTLPALTRL ATQVDPLARD ARSVLGPLVT GEPSAACRAD GACMRARAAL AELDAATGGR ARPVLEQTVA LQPVGAVTLL GTVIDKVQVA IASLGGLMDQ LGDRSPEQVR AELQRLSAGV GELQSGMARL TAGLGEVRAG TGQMLTMTGA LQAGLQQAVD YLRGVQTGTD AGAGRGFYLP EQGLTDPRFV TGSQLLMSPD GRSARMLAVW KINPYGSEAL DEVPQLTDAA RHALVGTGLE GSEVRTSGLT SLSAQMRDQV WKDFATFGVI AVLAVLILLC VLLRSLVAPL VMVAAVIVSF AAAAGVSTLV WQYIIGIDLD WSVLPIAFMA LVAVGADYSM LFAARIREES GSGMISGILR GFGSTGSVIT TAGIVFALTM LGLMGGTVIN LLQIGSTIAL GLVLDITVVR TYLLPSVMAI AGNRIWWPAK A
|
| |