Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3478 |
Symbol | |
ID | 9157653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3572955 |
End bp | 3577676 |
Gene Length | 4722 bp |
Protein Length | 1573 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Acyl transferase |
Protein accession | YP_003648398 |
Protein GI | 296141155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC AGGAAGCCAA GCTGCGCGAC TACCTCAAGC GCACTCTCGC CGAACTGCGA GTGGTGCGCC GCGAACTCGC CGAGGCCACG GCGGGCGGCG GTGACGGCCG CCGGAGTGAA TCTGGCGATC CCATCGCGAT CGTCGGGATC GGCATCCGGT TCCCCGGCAT CCACGGACCA GCGCAGATGT GGTCGGCACT GGAGCGGGGC GCCGACCTGG TCGGCTCGTT CCCCACGGAC CGGGGATGGG ATCTCGCCGG CCTGTACCGG CCACCGGCGG GGGATGACCC GCGCTCCGGT GAGCGGGAGC CCGGCACCAG CTACGTGGAC GCAGGCAGCT TCCTGTCCGA TGTCGCGGAG TTCGACGCCG AATTCTTCGG GATCTCGCCC CGTGAGGCGC TCGCCATGGA TCCGCAGCAG CGGCTCCTGC TGCACACCGG TTGGGAGGCC ATCGAGCACG CGCGCATCGA CCCCACCTCC CTCGCCGGTG AGCGGGTCGG CGTCTACCTG GGCGCCACTG ACCACGACTA CGCGCGGGAC CGCGCCACCT GGCCCACCGA TATCGAGGGC AACGCGATGA TCGGCCGCTC CGGGGCTGCC TCCGCCGGGC GCATCTCCTA CACCCTCGGC CTCACCGGCC CGGCGATCAC CGTCGACACC ATGTGCTCGT CATCGCTGGT GGCGACGCAC CTCGCCGTGC GGGCACTGCG CCAGGGCGAG TGCACCATGG CGCTCGTGGC CGGCACCACG GTGATGTCCA CCCCCGAGGG CTTCACCGAG TTCAGCGCGC AGGGGGCGCT GAGCCCCAGC GGTCGCTGCC GGTCCTTCTC GGACGACGCC GACGGCACCG CATGGTCCGA GGGCATCGCC GCCGTCGTGC TCCAGCCTCT GAGCGAGGCC CAGCGGGACG GCCGCCGGGT GCTCGCGGTC ATCGAGGGCT CCGCCGTCAA CCAGGACGGT GCGAGCAACG GCCTCACCGC TCCGAGCGTG TCCGCGCAGC GCGATGTGAT CACCGCGGCC CTCCGCGACG CCGGCCTCGA CCCGGCGGAC ATCGATGCGG TCGAGGCGCA CGGAACGGGC ACGGTCCTCG GTGATCCGAT CGAGGCGACC GCCTTGCTCC AGACCTACGG TGCGGCACCT GCGCGCCGTC GCCCGCTGCT GCTGGGATCG TTCAAATCGA ATGTCGGGCA CTCGGCCGCC GCTGCCGGTG TCGCCGGACT CATCAAGATG ACCCTCGCAC TGACCCACGG GTCTCTGCCC CGCACACTGC ACGTGGACGC GCCCAGCACC AAGGTGGACT GGTCGCAGGG CGCCGTCGAA CTGCTCACCG CGCCCCGCCC GTGGTCCGCG ACGCCCGGGC GTCGTCGCCG CGCCGGTATC TCCGCCTTCG GCGCCAGCGG GACCAACGCG CACGTGATCG TCGCCGACGC CCCACCGGTC CCCGAGCCCA CCGGACCGGA TCCCGAGATA CTGTCGTCCA CCGGGGTCGG ACGCGGCGCT CTGGCGTGGC CACTGTCGGC CCGGTCCGCG GCCGCCCTCC CGGCGCAGGC GCAGGCGCTG CTCACCGAGG TCGATTCCGC CTTCGCCGCC GACCATCCGG ACGACGACGT CGCGCGGGAA CGCTATTGCG CCGATCTGGC GCTCTCGCTC GCCGGCCGCA CCCAGCACCC GCACCGCGTC GTGATCACCG GCGGCCCAGC ACGACTCCGC GAGCGGCTCG CCGGACTCGC CCGCTCCGAG GCCGTTCCCG AGCCCGGCGT GGTCGCCGTC GACACTGCGC GCGACCGCCT CGGTCCCGTT TTCGTCTTCC CCGGCCAGGG CGGCCAGTGG GCCGGTATGG CCGCGGGCCT GCTGCGTGAC TGTCCGGCGT TCGCCCACCG GTGGGAGCAG TGCGCCGAGG CGCTCGCACC GCACGTCGAT TTCGACCTCA GCGACACCGT CGCCGATCCG GAGGGGGCGT GGCTCGAGGA CGTGAGCCGC GTGCAACCGA TCCTCTGGGC GACCATGGTG TCCCTCGCCG AGGTGTGGGC GGCGGCCGGG GTGCGCCCGG CCGCCGTGAT CGGCCACAGC CAGGGCGAGA TCGCTGCCGC CTGCGTGATC GGTGCGCTCT CGCTCGCCGA CGGTGCCCGC CTGGTGGCCA CCCGCAGCCG CCTTCTGACA CGGCTGTCCG GTCGGGGAGG GATGCTCGCC CTCGGTGCGA ACCGGGAGGA AGCATTGCGT CGGCTGATCC CCGGCGTCCA GCTCGCCGCC GAGAACGGTG CCGGGTCGGT GGTGCTCGCG GGTGCTGCCG ACGCGTTACG ACGCTACGCC GATACCGCGG AAGCGGACGG CATCCGCACG CGGATGATCC CGGTGGATTA CGCCTCGCAT TCCGAGCAGG TCGACGAGAT CGCCACCGCC CTCACCGCCG CGACCGGTGA CATCGTCGCC CGGGACGCGC CCGACTCCGA GTTCTACTCC ACGGTGGCCG GCCGCACCGG CGAGCCCATC GCCACCGAGG CGCTCGGCGG CGGGTACTGG TTCGAGAATC TCCGGAATAC AGTCGAATTC GCGACAGCAG TGCGCCGAGC GATCACCGAC GGCTACGGAC TCTTCATCGA GGTCAGCCCG CATCCGCTGC TGCTCACCGC GATCGGCGAG ACGGCAGACA CACTCGGCGG AGCGGACGAT GTGGCGGTCG TGGGCACTCT CGCCCGCGAC CGGGGCGGTC TCGACCAGAT CTGGTACTCG CTGGGACAGG TGCATGCCGC CGGCGGTGCC GTGGACTGGG CGCGTGCGCT CGCCGCGTAC GCGCCGCGCA TCGTCGACCT ACCGACGTAC CGGTTCAGTA CTCGCCGGTA CTGGCTTCCG GACGGACGCG CGGCCCTGAA CCGGTACGCG CCCAACCCGA TCGGATCCGT CGACGAGTGG CGCTACCGCG TCGCCTACCT GCCCGTTGCG GCATCGACCG GATCACTGCC CGGCACCGTG ACCGTGATCA CCGATCACCA CCGCACCGGC GCGGCCGACG CACTGCGGGC CGCGGGTGCC GCCGTCACCA CCTGCCGGAT CGGCGAATGG CGAACGGATG CACCCGCTGC CGATGCATCC CTGGTGCTGC TCGGCGGCGA GGCTGAGCCC ACGGCCCACG GCGGCGCACA GTCGGTTCCA CCGGCGCTCG CCGAGGCCTT CGAACTGGTG TCCCGGCACA TCCGGGGACC CGCTCATCCG CTCTGGTTCG CCGGTGACGA AAGCGCCCCG GACGTCGCCG CCGCGTTCGC GCTGATCCGC GTCGCCGCTT TGGAATTCCC GAACCACATC GGTGGCACCG TCGATCTGCC CGACGGCGCC CTGGACGCCG ACCGACTCCT CGCGGCTCTG AGGGGTGAAC ACGATCAGGT CTCGCTGCGC ACGCCATCGG CCGGCGAGAT CGGGGTGCGC CGCCTGCTCG CCGGGCCGTT CCCCGGGAGC GGACCGCGAA CCGGTGACTG GACGCCACGC GGGACCGTCC TGATCACCGG CGCCACCGGT GGCATCGGTG GCCAACTCAC CCGCTGGCTG GCCGCCCGCG GCGCTCCACG GATGATCCTG CTCAGCCGCC GGGGTGAGGC CGCACCCGGC GCGGCCGACC TGGCCGACGA GCTCGCCGCC GCAGGTGCGC AGCCGCGGAT CGTGGCCGCC GATGCCTCCG ATACCGACGC GCTGGTCCGG ATCCGCGACG AGTACGCCGC CGCCGGAACG CCGATCACCT CCGTCTTCCA TCTCGCCGGT GGCGGGACCC TTGCGGATCT CATCGACACC GACGCGGCGG AGTTCGCAGC GACCGCGCAC GCGAAGATCG ACGGTGCCCG GGCCCTGGAC GCGGTGTTCC CGGATGTCGA GGACTTCGTG CTGTTCTCCT CGATCTCGGC GGTGTGGGGC AGTGGATCCC ACGGCGCCTA CGCCAGCGCC AACGCCTACC TCGATGCGCT CGCGCGCGGT AGGCGTGCCG CCGGACGAGA CGCCGTATCG ATCGTGTGGG GCATCTGGGA CCCGGCGGAC GGCGGAGGGA TGGCTGCGAA TCTGGTCCGT GAGCAACTGG CCGCCCGGGG GATCCCATTC ATGGATCCGT CCCGATCACT GCGCGAACTC GGTGCCGTTC TCGGTGGCGA CCCGCAGCCG GTCGAGGTGA TCGCGGCCGT CGACTGGGAA CGGTTCCTGC CGGTCTTCAC CTCTGCCCGC TCCAGTGGCC TCTTCGACGA GCTGCCCCGC GGCGGTGTAC CGGATGCCGG CGAATCACGT TCCGGCGGAG AACTGCCCGA TCTCGCGCGC AGGGTGCTCG ACCTGCCGGA TCGTGAACGC GATGCAGTCG TCAGCGACGT GGTCCGCGAC ACGATCGCCG CGGTGTTGCG TCTCGATCCC GGCGAGGTCG ATACCGAAAG GGCCTTCCGG GATGTGGGCA TCGACTCGCT GACCGCGATC GACACCCGCA ACCGGCTGCG CGCAGCGACC GGTATCCCGC TCCCCGTGAC CATGGTCTTC GACCACCCCA CGGTCACCGC GCTGGCCCGG TACATCACAG GGCGGCTGCT TGAAGGCAGC GAGGAGCCCA CCGCACCATC GGTCGACCGA ACGGCCCCAT CGCCAGGTCG GGGGACGTCG GTGGACCTCA CGACGCCGGC GGACGACCTG CTCGACGTGG ACGAGATGGA CATCGCGGAC CTGATCCGCG CCGCGAACGA CGCAGAGGAG GCGGCCCGAT GA
|
Protein sequence | MSEQEAKLRD YLKRTLAELR VVRRELAEAT AGGGDGRRSE SGDPIAIVGI GIRFPGIHGP AQMWSALERG ADLVGSFPTD RGWDLAGLYR PPAGDDPRSG EREPGTSYVD AGSFLSDVAE FDAEFFGISP REALAMDPQQ RLLLHTGWEA IEHARIDPTS LAGERVGVYL GATDHDYARD RATWPTDIEG NAMIGRSGAA SAGRISYTLG LTGPAITVDT MCSSSLVATH LAVRALRQGE CTMALVAGTT VMSTPEGFTE FSAQGALSPS GRCRSFSDDA DGTAWSEGIA AVVLQPLSEA QRDGRRVLAV IEGSAVNQDG ASNGLTAPSV SAQRDVITAA LRDAGLDPAD IDAVEAHGTG TVLGDPIEAT ALLQTYGAAP ARRRPLLLGS FKSNVGHSAA AAGVAGLIKM TLALTHGSLP RTLHVDAPST KVDWSQGAVE LLTAPRPWSA TPGRRRRAGI SAFGASGTNA HVIVADAPPV PEPTGPDPEI LSSTGVGRGA LAWPLSARSA AALPAQAQAL LTEVDSAFAA DHPDDDVARE RYCADLALSL AGRTQHPHRV VITGGPARLR ERLAGLARSE AVPEPGVVAV DTARDRLGPV FVFPGQGGQW AGMAAGLLRD CPAFAHRWEQ CAEALAPHVD FDLSDTVADP EGAWLEDVSR VQPILWATMV SLAEVWAAAG VRPAAVIGHS QGEIAAACVI GALSLADGAR LVATRSRLLT RLSGRGGMLA LGANREEALR RLIPGVQLAA ENGAGSVVLA GAADALRRYA DTAEADGIRT RMIPVDYASH SEQVDEIATA LTAATGDIVA RDAPDSEFYS TVAGRTGEPI ATEALGGGYW FENLRNTVEF ATAVRRAITD GYGLFIEVSP HPLLLTAIGE TADTLGGADD VAVVGTLARD RGGLDQIWYS LGQVHAAGGA VDWARALAAY APRIVDLPTY RFSTRRYWLP DGRAALNRYA PNPIGSVDEW RYRVAYLPVA ASTGSLPGTV TVITDHHRTG AADALRAAGA AVTTCRIGEW RTDAPAADAS LVLLGGEAEP TAHGGAQSVP PALAEAFELV SRHIRGPAHP LWFAGDESAP DVAAAFALIR VAALEFPNHI GGTVDLPDGA LDADRLLAAL RGEHDQVSLR TPSAGEIGVR RLLAGPFPGS GPRTGDWTPR GTVLITGATG GIGGQLTRWL AARGAPRMIL LSRRGEAAPG AADLADELAA AGAQPRIVAA DASDTDALVR IRDEYAAAGT PITSVFHLAG GGTLADLIDT DAAEFAATAH AKIDGARALD AVFPDVEDFV LFSSISAVWG SGSHGAYASA NAYLDALARG RRAAGRDAVS IVWGIWDPAD GGGMAANLVR EQLAARGIPF MDPSRSLREL GAVLGGDPQP VEVIAAVDWE RFLPVFTSAR SSGLFDELPR GGVPDAGESR SGGELPDLAR RVLDLPDRER DAVVSDVVRD TIAAVLRLDP GEVDTERAFR DVGIDSLTAI DTRNRLRAAT GIPLPVTMVF DHPTVTALAR YITGRLLEGS EEPTAPSVDR TAPSPGRGTS VDLTTPADDL LDVDEMDIAD LIRAANDAEE AAR
|
| |