Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4321 |
Symbol | |
ID | 9158503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014159 |
Strand | - |
Start bp | 80430 |
End bp | 83558 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Acyl transferase |
Protein accession | YP_003649221 |
Protein GI | 296141979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.113206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCCC CGACCCTCTC CCGGTCCGGT TCGACGCCGG GGCTCCGTCC CCGGATCGAC GGTGCACCCC TGCTCCCGGA CGGCCTCCCG GCCTTCGTGC TCAGCGCCGA CGGTGCCGAT TCCCTCGCCG CCGCCGCCGC GCAGCTCCGC CGCTACCTCA CCGAGCACCC GCAGGTACCG CTGCCCGCCG TCGCCGCGGC GCTGGTCGCG ACGCGCGACC TGCGGCGGCA CCGCGCGATC GTGCACACCG ACGACCGCGG CGAGCTCGTC ACCGCCCTCG ACGCGCTGGC AGCGGACCCG GTGCGCCCGG TCGGTACCGC CGTTCCCGGC CTGCACACGG GCGTCGCCGC CGCGGGCGCG CCGGCGTTCG TCTTCCCGGG CCAGGGCAGC CAGCGCCGGG GCATGGGCGC GCTGTTCCAC CGCGAGTCCG CGGTGTACCG CGACACGGTC GAGAAGATCC ACGCGATCGC GCTGGAGGTC TTCGGCTCGT CGGCCCGCGA CTACCTGCTC GGCACCGGCG AGTGGGAGCA GGGCGGCCGG CCCGTCCCCG TCGAGGTGGT CCAGCCCGCG ATCTTCATGC AGGCGCTCGG CCTGGCCGCG ATGTGGCGGG CAGCCGGCAT CGAGCCGCGG ATCCACGTCG GGCACAGCCA GGGTGAGATC GCCGCGGCGG TCGCCGCGGG CACCGTCGGC CTCGCCGACG GGCTGCGGCT GGTCACGCGG CGCGCCCTCG CGGTCCGCGA CAACGCGCCC ACCGGGCATT CGATGGCGGT GCTCGGCACC GACCGCGAGC GGTGCGCGGC CATGCTGGCC CGGACCGTCG GCTTCGCGGA ACTGTCCGTG GTGAACTCGG CGCACGTGCT GTGCGTCAGC GGTGAGCGCG ACGTGGTGGC GGGGCTGGTC GCGCAGGCCA CCGAGCGGGG GATCTTCGCC CGCGAGATCC GGGTCGAGTA CCCCGCGCAC ACCTCGCTCG TGGGCTCCAT GTGGCACCTG CGGGAGAAGT GGATGGGCGT GATGGACCAT CCCGCGTTCC TGCCCACCGA GCACCTGCTC ATCGGCGGCA CGCTCGGCGA GGCCGTCCCC GCCGACACCG ATTTCCTCGA CTACTGGTTC TGGAACCTCA AGAACCCGGT CCGGTTCGAC CTCGCGACGC GGGCCGCACT CGACGCGGGC GCCGACCGGC TGATCGAGCT CGCCGAGCAC CCCACGCTGC AGCTCGCACT GCACGAGAAC ATCGCCGACG CCGGTGCCCG CGCCACCGTG ACCGGCAGCT CGCGGCGCGA CGCGACCGAC CTGTCGGAAT TCTCCTCCGC GGTCGCCGAC GTGCTCGTGA CCCACGCCCG CGACCTCGAC GTCCAGCGCA CACTGGTCGC GGAGGAACTG CCGCCCGGCT TCCCCGCGGC CCCGCTGTCC CGGCAACGCC TGTGGGCCGC CCTCCCGGGC CGGCCCGCGA CCGCGGGGCC CGTCCGGCCC CGCACGCGTG TCCTCGACAC CGTCTGGACC GATCTCGACG CACCGGTCAG CGCGCCACCG CGACCGCTCG CGATCATCGA CCCCACCGGG GCGCACGCCG ACCTGGCCGC CGCCCTCCTC GACGCCGCGG CCCGGTACGG GACACCCGCC CGCCGTAGCG ACCGCGCCGG TGACGACGAG ATCGCGGTGG TCCTCGTCCC GGGATCCGCG GAGACCGACA CGACGATCGC CGCCGTCGGC GAGTTGCTCG CCGATCGCCG CTGGTGGGCC GGAGTACAGC CCGCGGCCGG GATCGCCGCC GTCACGGCGG GGGCGGTCGT CGCGGACCCG GCCGACCCGG GGCCCGACGG TGCCGCCGCG GCGATCGCCG TCGGTTTCCG CGCGTTCGGC GCGGACCTGC CCGGCGTCGA GGTCCGGCAC CTCGATCTCG ATCCGCGCGC CGACGCGGCG GCGCAGGCCG GCACCGCGAT CCACGCGCTG CACGTGGCGG GCGAGCCGCG TCTGGCCCTG CGGTCCGGCG CCGTGCGCGC CGAACGCTGG GTCGACGCCG AGCCCACCGA AACGGGCACG GAGCCTCCTG CGGGACTGCT GGAGACTGCG CGGAACGTAG TGATCAGCGG GGGTACCGGA CACCTGGGGC TCGCGTTCGC GGCGCACGCG GCGGCCCACG GCGCCGCCTC GGTCACTCTG CTCTCCCGCT CGGGCGGGGG AACGACCGTC CGGCACGCGC TGGCCCGGAT CGCCCGCCGC CATCCCGCCT GCGCGGTCAC GGTGGTGCCG TGCGATGTCA CGGACCCGGC GGCCGTCGCC ACCGCCCTCG CCGGCGCCGG GCGCGCGATC GACCTCGTCG TGCACGCCGC CGTCGGATAT CGACGTTGTG CCGCAACGGA TCTCGATGCA TCGGACTTCA CCGCCGCGGC CGCGGCCAAG GTGGGCGGAC TACGCACGCT CGCGCAGGCG GTGCCGGATG CGACGCTGCT GACCTGCAGC TCCGCCGCCG CCGCGCTTCC CGGCGCGGGC CAGGCCTGGT ACGCCGCGTC GAACACGCTG GCCGAGGCCG AGGCCGCCGC GCTCCGGCGG GCCGGACGAC GGGCCGCCGC CGTGCGGTGG GGCCTGTGGG AGCAGGCCGG TCCGCTCGAC GAGGCGGGCT TCGCGGCCGT CACGGCGGCC GGGGTGATCC CGCTGGCCGC CCCCGACGCG CTGGCCGCGC TCGCCCGCTC GACCGGTCCC GAGCCAGTGA TCACCGCCGT CGACCTGCCG CGGCTGCGCG ATGTCGCCGC GGCGTTCGGC GCGGCGGCGC TGCTCACCGA TCTCACCGAG GACACCACCC CCGCCGTTCC GGCACCGTCG GACGCGGTCC CCGGGACAGC GCCCGACGCC CCCGCGACCG TGACCGTGAC CACCGACGCC GCCCCCGGAC CCGCGCCGGG CGGGGGCGCC GATGTCGCGG CGGTGCTCCG GCACCACCTC GCGCGGGCGC TCGCGGTGCC CGCGGACACG CTGGACCCCG ATGTCGCGCT CGTCGCCCTG GGACTCGACT CCCTGCAGGC CCTGGAGCTG CGCACCGCGG TCCGCGACGA ACTGGACGCC GAACTCCCGC TCGAGGCGAT CCTCGGCGGC GCGACGCTCG CCGAGGTCAG CGCGACCCTG GCCGGCTGA
|
Protein sequence | MSSPTLSRSG STPGLRPRID GAPLLPDGLP AFVLSADGAD SLAAAAAQLR RYLTEHPQVP LPAVAAALVA TRDLRRHRAI VHTDDRGELV TALDALAADP VRPVGTAVPG LHTGVAAAGA PAFVFPGQGS QRRGMGALFH RESAVYRDTV EKIHAIALEV FGSSARDYLL GTGEWEQGGR PVPVEVVQPA IFMQALGLAA MWRAAGIEPR IHVGHSQGEI AAAVAAGTVG LADGLRLVTR RALAVRDNAP TGHSMAVLGT DRERCAAMLA RTVGFAELSV VNSAHVLCVS GERDVVAGLV AQATERGIFA REIRVEYPAH TSLVGSMWHL REKWMGVMDH PAFLPTEHLL IGGTLGEAVP ADTDFLDYWF WNLKNPVRFD LATRAALDAG ADRLIELAEH PTLQLALHEN IADAGARATV TGSSRRDATD LSEFSSAVAD VLVTHARDLD VQRTLVAEEL PPGFPAAPLS RQRLWAALPG RPATAGPVRP RTRVLDTVWT DLDAPVSAPP RPLAIIDPTG AHADLAAALL DAAARYGTPA RRSDRAGDDE IAVVLVPGSA ETDTTIAAVG ELLADRRWWA GVQPAAGIAA VTAGAVVADP ADPGPDGAAA AIAVGFRAFG ADLPGVEVRH LDLDPRADAA AQAGTAIHAL HVAGEPRLAL RSGAVRAERW VDAEPTETGT EPPAGLLETA RNVVISGGTG HLGLAFAAHA AAHGAASVTL LSRSGGGTTV RHALARIARR HPACAVTVVP CDVTDPAAVA TALAGAGRAI DLVVHAAVGY RRCAATDLDA SDFTAAAAAK VGGLRTLAQA VPDATLLTCS SAAAALPGAG QAWYAASNTL AEAEAAALRR AGRRAAAVRW GLWEQAGPLD EAGFAAVTAA GVIPLAAPDA LAALARSTGP EPVITAVDLP RLRDVAAAFG AAALLTDLTE DTTPAVPAPS DAVPGTAPDA PATVTVTTDA APGPAPGGGA DVAAVLRHHL ARALAVPADT LDPDVALVAL GLDSLQALEL RTAVRDELDA ELPLEAILGG ATLAEVSATL AG
|
| |