Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3479 |
Symbol | |
ID | 9157654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3577680 |
End bp | 3580913 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Acyl transferase |
Protein accession | YP_003648399 |
Protein GI | 296141156 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAG CCTCCGAGAT CACCGCAGCA CAGCTCCTCG ACCGCCTCAA GAAGGCGGCG GTCGACCTCA AGCAGGCCCG CGACGATCTC CGCGGGCTCC GTGCCGCCGC GAACGAGCCC ATCGCCATCG TGGGAATGGG CTGCCGCTTC GGCGGAGGTA TCGACTCGCC CACAGATCTG TGGGAGGCCG CGTGTGCGGG CGAGGAAACC GTCGAGAACT TCCCGACCGA CCGGGGCTGG CCGGAGTTCG ACAGCGCGAG CCGGCGCGGA TCCTTCCTCC GCGACGCCGC GGGCTTCGAC GCAGGCTTCT TCGGCATCGG CGGATACGAG GCGACCGCCA TGGACCCGCA GCAGCGGCAC GCGCTGGAGA TCGCCTGGGA GGCCATCGAA GACGCGCGCA TCGACCCACG GTCGTTGCGC GGCAGCCGAA CCGCGGTGTA CCTCGGCGCC ACGTCGTTCG GCTACGGTGG CGACTACCTG GGCGCCGGCG ACGGTCTCGC CGGACACCTG GTGACCGGCA ACGTGACCAG CGTGCTGTCG GGCCGGATCA GTTACCTCCT CGGTCTGACC GGACCGGCGG TCACACTCGA CACCGCATGC TCCTCGGCGC TCTACGCGGT GCACCTGGCC ATGGCGGCGC TGCGATCGCG CGAATCCGAT CTCGCGCTCG CAGGTGGAAT CACGGTGATG TCCACCCCCG GCGTCTTCGC CGAGTTCACC CGACAGGGCG GGCTGGCCGC CGACGGGCGC TGCAAGTCCT TCTCCGCGAC CGCGGACGGA ACCGGATGGG GAGAAGGCGC CGGAGTCGTC GTCCTGCAGC GACTCTCGGA TGCCATGGCC CAAGGACGCC GGGTTCACGC GGTCATTCGC GCCGGCGCCG TCAACCAGGA CGGCGCCAGT AACGGTATGA CGGCCCCGAG CGGGGCGGCG CAGCGGGCGG TGATCACGGC CGCTCTCCAC GCTGCGGGCC TGGCGGCCGA GGACGTCGAC GCCGTCGAAG CGCACGGCAC GGGAACTGTC CTGGGAGACC CGATCGAGGC GTCCGCGGTG CTGGGTACCT ACGGGCAGGG CCGCCCGCCC GGACGTCCCC TGTGGCTCGG GTCGATCAAA CCCAACATCG GGCATACTCA GGCCGCCGCA GGTGCTGGGA GCCTGATCAA GGCGGCGTTC ATGGTCGGTA CCGGCGTCCT GCCCCCATCG CTGCACAGCG GTACACCCTC GGGCGTCGTC GACTGGTCGT CCGGGGCCGT CGAACTGCTC ACCCGGGAAC AGCGATTCCC GGATGCGGGC CGGGAGCGCC GAGTGGGCGT CTCCGGCTTC GGAATCAGCG GCACCAACGT GCATCTGATC GTGGAACAGG CCCCCGGCGA TCCGATCGGC GACGCCGGTG TCCAGGCCGT GCCCCGAGCG ATCGTACCGG TGGTCTACTC GGGCCGCACC ACCGCGGCAG CGCGCCGCGC CGCGCAGGCG CTCGCCGCAC GCCTTTCGGA AGCAGCCGGA GCCGAAGGGG CGGGCCTCCT GCGCGATCTC GCCTTCTCTC AGATCACCAC CCGGGCGCAT CACGAGGAGC GGGCAGTGGT CCTCGCGGAT TCCGCTCCCG CGGCCGTCGC CGCCCTCCGA ACACCACCGG CCACCGTGCG CGCGCTCGCC GAACCCCGGC CGGTCTTCCT CTTCCCCGGC CAGGGTGCGC AGTGGGCCGG TATGGGAACC GACCTGCTCG AGGGGTCCCC GATCTTCGCC GAACGGTTCG CGCAGTGCGC GGCCGCCCTC GCCGAATACA CCGATCACTC GCCCTACGAA GGGCTTCGGG CCGCCGCCAC GGTGGACACC GTCCAACCCA CGCTCTTCGC CATGATGGTG TCGCTGGCCC ACTTGTGGGC CGCACACGGG GTGCGGCCCA CCGCGTTGAT CGGCCACAGC CAGGGTGAGA TCGCCGCCGC CGTCGTCGCC GGAGCCCTGT CCCTGGACGA CGGTGCCCGT ATCGTTGCGG TGCGCAGTCG CGCCCTGGTA CCCGCCTGTG GGAACGGCGG TATGGCGGCG ATCGCAGCCG CTCCGGGCGT CGTCGAGGAC CTGCTCGCGT CCCGCCACTG GGACATCGAT ATCGCAGGCC GGAACGGACC CACCTCGACG GTGGTCTCGG GGCCATCCGA CCAGCTCGAG CGGCTGCGCA GCGAACTCCT CGCCCGCGAC ATCCGGTGCT GGATCATCGA TGTGGATTAC GCCTCCCACG GGCGACAGAT GGACCAGCTG CGCGAGCGGC TGCGCCGGGA CATCGGCCCG GTCACGGCCC GCTCCACGGC GACCCCTTTC TTCTCCACGG TCACCGGAAC GGAGATCGAT ACCGCCCGAA TGGATTCCGA CTACTGGTTC GCCAACCTGA GGAACCCGGT GCTGCTGCAG GACGCCGTGA CCGCCGCCCA CGACCGCGGC CACCGCGCCT TCGTCGAGAT CAGCCCCCAC CCCGTGCTCA ACATCGCCCT GCTGTCCACC TTGGAGCAGT GCGCATCGAC ACCCACCGCG GTGCTGGCCA CCCTGCGCCG TGATCAGGGC TCCTCCGACG ACTTCCTCCG CTCACTCGCC GATGCCTACT GCGCCGGTCT CCCGGTCGAC TTCGCCGGAT ACCTCGCCGG CGGACGACCG ACCGATGTGC CGCACTACCC GTTCGAGCAC CGCCGGTACT GGTACACCCC ACCCCGTCGC ACCGGCGGCG AAGGAGGCGA CGACATCGGC GGCGCCGCCA CGCTCGACCT GCCGGAGCCG GACGATTCCG GTTCGGAGAG CTCCGATCAC GAAGCTGCCG CGTTCGCCGC GGAAATCCGG AGTCTCGGTG CGGCCCGTGG CCGGCGCGCG GTGGTCGCCG TCGTCCTGGA GGCGCTCGCG AGCGCGCTGG GAGCCGATGC CGCATCCGAT CTCGCCGCCG ACCGCTCGTT CCTCGACCTC GGGGTGGGAT CTCTGGCCGC AGTGGCACTG CGCACCGCGC TCAGCGCGCG AACCGGACTC GCGCTGTCGA CCACGATCGC CTTCGAGTTC CCGACTCCCG CCGCCCTCGC AGAGCACGTT CACAGCCGTC TCACCGGAGA AGCCGACGCT CCAGCGGCCG GAACCGAATC GAGCGCGACG GAGCAAACCG GTGCATCGCG CAGCCAGGAT GCCGCGATCC TCGCCGAGTT GGCGCAACTC ACCGAGGTCG ACGACGACGA CATCTTCGCC GCCCTCGACC GAGAGTTGGG ATGA
|
Protein sequence | MSEASEITAA QLLDRLKKAA VDLKQARDDL RGLRAAANEP IAIVGMGCRF GGGIDSPTDL WEAACAGEET VENFPTDRGW PEFDSASRRG SFLRDAAGFD AGFFGIGGYE ATAMDPQQRH ALEIAWEAIE DARIDPRSLR GSRTAVYLGA TSFGYGGDYL GAGDGLAGHL VTGNVTSVLS GRISYLLGLT GPAVTLDTAC SSALYAVHLA MAALRSRESD LALAGGITVM STPGVFAEFT RQGGLAADGR CKSFSATADG TGWGEGAGVV VLQRLSDAMA QGRRVHAVIR AGAVNQDGAS NGMTAPSGAA QRAVITAALH AAGLAAEDVD AVEAHGTGTV LGDPIEASAV LGTYGQGRPP GRPLWLGSIK PNIGHTQAAA GAGSLIKAAF MVGTGVLPPS LHSGTPSGVV DWSSGAVELL TREQRFPDAG RERRVGVSGF GISGTNVHLI VEQAPGDPIG DAGVQAVPRA IVPVVYSGRT TAAARRAAQA LAARLSEAAG AEGAGLLRDL AFSQITTRAH HEERAVVLAD SAPAAVAALR TPPATVRALA EPRPVFLFPG QGAQWAGMGT DLLEGSPIFA ERFAQCAAAL AEYTDHSPYE GLRAAATVDT VQPTLFAMMV SLAHLWAAHG VRPTALIGHS QGEIAAAVVA GALSLDDGAR IVAVRSRALV PACGNGGMAA IAAAPGVVED LLASRHWDID IAGRNGPTST VVSGPSDQLE RLRSELLARD IRCWIIDVDY ASHGRQMDQL RERLRRDIGP VTARSTATPF FSTVTGTEID TARMDSDYWF ANLRNPVLLQ DAVTAAHDRG HRAFVEISPH PVLNIALLST LEQCASTPTA VLATLRRDQG SSDDFLRSLA DAYCAGLPVD FAGYLAGGRP TDVPHYPFEH RRYWYTPPRR TGGEGGDDIG GAATLDLPEP DDSGSESSDH EAAAFAAEIR SLGAARGRRA VVAVVLEALA SALGADAASD LAADRSFLDL GVGSLAAVAL RTALSARTGL ALSTTIAFEF PTPAALAEHV HSRLTGEADA PAAGTESSAT EQTGASRSQD AAILAELAQL TEVDDDDIFA ALDRELG
|
| |