Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2271 |
Symbol | |
ID | 9156427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2364302 |
End bp | 2365663 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
Protein accession | YP_003647218 |
Protein GI | 296139975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGGA ACGGACGGTC GGGCAATCGC GCTCGTCATC GCCGGCGTCA CACCTGGCTG CTCGCCGTGC TGGACGCCGT CGTCGTGGCC GGATTCCTCG GCGCGACCTA CGACCCCGGT AGCGATCCGG CGGCACACGT GACCAAGGTG CTCGCCGGGA CCGCACTGAC GATCGGGACC TTCTACCTGT TCGGGCTCTA TCGCGCCCGA ATCACGCTCA GCGCGCTGGA CACCCTGCCG CAGTTGGTGA TCGCGGCGTG GCTTCTCGCC CCGCTGGCGA TGGCGATGCA CTGGGACGAC GACCACACAT TCATCCCCCA GGTGCTGATG TGCTGGGTGG CACTACTGAT GCTACGGGTG GTGTACTACG CGGTGGTGCG CCATCGCCGG GCCGCACATC CGGAGAACGG CGCCCGGACC CTGGTGATCG GTGGCGGCAA GGTCGCCGAC GAGCTGGTGC GCGCGATGTC CCATTACCCG GTGTACGGCC TGCGTCCAGT GCTGGTGATG GATGACGATC CGCTCGACCC GACCCTCTTC CCCACGGAGG TGATCCCCCG CCGCCACGAC CTGGCAGGTC TCATCGAGGA GCGCGATATC GAGACGGTGA TCGTGGCCTT CTCCCGCGAC CGGGATTCCA CCCTGGTGGA CCCGCTGCGC GAGTGCGACA CCCTGGATTG CGAGATCTAC GTCGTGCCGC GGCTGTACGA ATTCGTGCAT CAGGACCGCG ATATGGACCG GATCCACACC ATCCCGCTCG TCCTGGTGCG CCGCCTCGCC CTGCGCACCA GCCATTGGCG GGTCAAGCGG ATCACCTCGC TGATCACCTC GAGTGTGGGA TTGCTGCTGT TCTCTCCCCT GCTGGCCCTC GTCGCGGCCG CGATCAAACT GCAGGATCCG AGGGCGCCGG TATTGTTCCG TCAGACCCGG GTGGGCCAGG ACGGTCACCT GTTCACGCTG TACAAGTTCC GCACCATGCG CCCGGTTCCC GATGAAGTGG CCGACGCCGA CTGGTCGACC GACCGGACCA CACGTCTTGG CTCCTTCCTG CGCCGGTACT CGATCGACGA GTTGCCGCAA CTGTGGAACG TGGTGTGCGG CGACATGTCG ATGGTGGGCC CGCGGCCCGA GCGCCCGCAC TTCGTGGACC GGTTCCGCGC CGACATTCCC GCGTACAAGT CCCGGCACCG TGTGCACGCC GGCCTCACCG GCTGGGCGGC GATCCACGGC CTGCGCGGCG ACACCGACAT CCGCGACCGC GCGTCGTACG ACAACTACTA CATCGAGAAC TGGAGCCTGT GGCTCGACAC CAAGATTCTC CTGATGACCG CGGCTTCGGT CCTCCTCGGT CGCGGTCGCT GA
|
Protein sequence | MSGNGRSGNR ARHRRRHTWL LAVLDAVVVA GFLGATYDPG SDPAAHVTKV LAGTALTIGT FYLFGLYRAR ITLSALDTLP QLVIAAWLLA PLAMAMHWDD DHTFIPQVLM CWVALLMLRV VYYAVVRHRR AAHPENGART LVIGGGKVAD ELVRAMSHYP VYGLRPVLVM DDDPLDPTLF PTEVIPRRHD LAGLIEERDI ETVIVAFSRD RDSTLVDPLR ECDTLDCEIY VVPRLYEFVH QDRDMDRIHT IPLVLVRRLA LRTSHWRVKR ITSLITSSVG LLLFSPLLAL VAAAIKLQDP RAPVLFRQTR VGQDGHLFTL YKFRTMRPVP DEVADADWST DRTTRLGSFL RRYSIDELPQ LWNVVCGDMS MVGPRPERPH FVDRFRADIP AYKSRHRVHA GLTGWAAIHG LRGDTDIRDR ASYDNYYIEN WSLWLDTKIL LMTAASVLLG RGR
|
| |