Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0066 |
Symbol | |
ID | 9154200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 68178 |
End bp | 69494 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | O-antigen polymerase |
Protein accession | YP_003645059 |
Protein GI | 296137816 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.059458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCGAT TGGGGCTGAA CCGGATCGCT GCGCCGGGAG CGCTGTTCCT CGGCGTCGCC GGCTTCTTCC TGCTCAGTAT GCGACAGGTT CTTACGGTCC CCGGTACGTT CGGCTACAGC CTCGCCCAGA CACTGTTCAT GGGCGGCGCC CTTCTCTGGT TCGCGACGGT ACTCACCGGT CACACCCGTC TGGTGCGAGA TTGGGCGGTG ATCGCCGCCG TGCTGGGTTA CGTCTGCGCG TCGTTCATCT CGTATGCCGC GGCAGCTGCT CGCGGAATTC TGCCGATCTC GCAGAGCGCT GTCGATCGGT ATGTCCTTAC GGACCTGATG CTTGCCGGCA CTGTACTGCT GACTCTCACC GTGCTCCGCA CCCAACACGC CCTGCGCGTG ATGCTCGGTG GGCTCGTGCT CGGCGGCACG GTGAGCGCGC TGTTCGCGTT GTTGGACTCG AGTACAGGCA TCGACATTGC GGCGCAGTTC CGGATTCCCG GGCTTACCAA GGGGAGCGAT TACGTCCTCA TCGAGAGCCT TGATCGCGCC GGCGTCACTC GGCCGCAGGG TAGTGCCGGG CATCCGCTGG AACTCGGCGC GGTTCTCACG GTGCTCGCGC CGCTCGCGTT GGCTCTGTAC TTCAACGCAA AGAGGCGGGC GGCGGACCCG AGAGTGGTTC GCCTGTGGCT GGCATGCACT GTGATCATTC TGACCGCCGA TCTGGCAACC GTATCCCGGT CGGCAGTTGC GGGTGGCCTC GCTGCGATCG CGGTGATGTG CTGGCGCTGG CCGGTGCAGC GGCTCGCGGC GATTCTCGGA ACCGCTACGG CCGTGGTGGT CGTCGGTGTG GTCGGCCAAT TGACGTTGGT GACAGCCCTG ATTGAGACCT TCGCCGGCTC ATCGAAGGAT CCGTCGTTGC AGTCCCGGGA GGTCGGGCGG AACTTCGTGG CGGAGAATCT CGCCGACAAC TTCTGGTTGG GCCAGGGCGT CGGCGCATAC CCGACCCTGA AACAGCCCGT GCTCGACAAC CAATACCTGT CCAGGCTGAT GGAAGCGGGA ATCTTTGGCC TGCTGAGCTT GTGCATGGCA CTCGTTGTCA CCCTGTATCT CGCCGTGCGC GCTTCGCAGT CGAAGGATGA TGCGCTGGCC GAGCTAGCCG GCGGCATCAG CGGATCCGCG GCGGCGCTGA TCGTCATCTG CCTGATTCTC GACACCTCTG GCTTCGGGCA GATTTGGTAC TTGACCTGGA TCCTGCTGGC CCTCGCCGGC GTCGTCTATC GGCTGTCGCG GGTTCGCGAG GGCGAGCTCG CCGCGCCACC GCCCTGA
|
Protein sequence | MERLGLNRIA APGALFLGVA GFFLLSMRQV LTVPGTFGYS LAQTLFMGGA LLWFATVLTG HTRLVRDWAV IAAVLGYVCA SFISYAAAAA RGILPISQSA VDRYVLTDLM LAGTVLLTLT VLRTQHALRV MLGGLVLGGT VSALFALLDS STGIDIAAQF RIPGLTKGSD YVLIESLDRA GVTRPQGSAG HPLELGAVLT VLAPLALALY FNAKRRAADP RVVRLWLACT VIILTADLAT VSRSAVAGGL AAIAVMCWRW PVQRLAAILG TATAVVVVGV VGQLTLVTAL IETFAGSSKD PSLQSREVGR NFVAENLADN FWLGQGVGAY PTLKQPVLDN QYLSRLMEAG IFGLLSLCMA LVVTLYLAVR ASQSKDDALA ELAGGISGSA AALIVICLIL DTSGFGQIWY LTWILLALAG VVYRLSRVRE GELAAPPP
|
| |