Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0209 |
Symbol | |
ID | 9154343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 222504 |
End bp | 223862 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003645202 |
Protein GI | 296137959 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCG ATGAGAACGT CCGGACCGTT CCCGATCGGG GCCGGCTCCG CTGGTTCCTG ATCGGATCGG TCGCGCTGAT CTCGTTCTCC GCCTTCGAGT CCCTCGCAGT GGGAACGGTG CTGCCGCGGG CGGCGGAGCA GTTCGGCGCC ACCGCCGAGT ACTCGGTGGC CTTCGGTGCG GCGTTCGCGG CGATGATCGT GGCGATCGCC TGGGTGGGGC CGTGGGTGGA CCGCTCCGGG GTGCGGCCGC CGCTCATCGC CGGTGCCCTG TTGTTCGCCG CCGGCCTGCT GATCGTGGGG TCGGCGCCGG GGATGACCGT GCTCGCCGTC GGTCGTGGCG TGCAAGGCCT GGGCAGCGGC CTGATCAGCG TGGTGCTGTA CGCGATGGTC GGCCGACTGA TCCCGCCGGC GGGACGCCCA CGGGTGTTCG CCGCGTATTC GACGGCGTGG GTGGTGCCGT CGCTGGTCGG CCCCGCGATC GCCGGCCTCG CGGCCGGGAC GGTGGGCTGG CGCTGGGTGT TCCTCGGACT GGCCGCGCCC TCCTTGCTGG CCCTGTTCGC GACATTGCGG GCAACCACGG GGCTCGATGA ATCCGCCTAC CGCAACGACG ATCCGCCGAA CCGCGGAGTG CTGTTCGCGG CACTGGCGGC GGGAACAGCC GCCGCGGTCG CGCAGTTCGC TGCGCCGCAG GGCGGCTCCG GGTTCCTGCT GCTGGCTGCG GCAGCATTCG TGATCGCGGT GATCGCGGCC CGGCGAATGC TCCCCGCCGG ATCGCTGATC GGCCGTCCCG GGATTCCGCG CCTGATGCTC GCGAACCTGC TCATGGCCGG GTCGTTCTTC GCCGCGGAGA TCTACGTGCC GCTCTATCTG GTGCACATCG ACCGGCTCTC GCCCTTCGCG GCCGGATCCG TGATGACCGG CGCCGCACTA CTGTGGGCGC TGGCCTCGCA GCTTCAGGCC CGTATCCCAC CGGACGGCGC ACTGCGCGAA CGACTTCCAC TGATCGGTTC GAGCCTGCTG ACGCTGGCGC TGGTCGCGGT GGCAACATCG TTCGCGGTCG ACGCACCGTG GCCGGTGATC TTCGCGACGT GGGCGATCGG TGGATTCGGC ATGGGCCTGA CCTATCCGAC GCTGTCGATC CTCATGCTCG GTCGCAGCGC AGACGACGAG CAGGGCACGA ACTCGAGCGC GCTCAAGCTC TCCGATGCCG TGGGCACTGC GCTCGCGATC GCCTGTACCG GAGCACTGTT CGCACAGGTC CTCGCGTGGG GTAACTCCGG GTTCGCGGTC ACCTTGCTGG TGCCGCTCGC TTCCGGGCTG GCGACGGTGG CGGTCACCTC GCGCCTCACG GCCGAATAG
|
Protein sequence | MTIDENVRTV PDRGRLRWFL IGSVALISFS AFESLAVGTV LPRAAEQFGA TAEYSVAFGA AFAAMIVAIA WVGPWVDRSG VRPPLIAGAL LFAAGLLIVG SAPGMTVLAV GRGVQGLGSG LISVVLYAMV GRLIPPAGRP RVFAAYSTAW VVPSLVGPAI AGLAAGTVGW RWVFLGLAAP SLLALFATLR ATTGLDESAY RNDDPPNRGV LFAALAAGTA AAVAQFAAPQ GGSGFLLLAA AAFVIAVIAA RRMLPAGSLI GRPGIPRLML ANLLMAGSFF AAEIYVPLYL VHIDRLSPFA AGSVMTGAAL LWALASQLQA RIPPDGALRE RLPLIGSSLL TLALVAVATS FAVDAPWPVI FATWAIGGFG MGLTYPTLSI LMLGRSADDE QGTNSSALKL SDAVGTALAI ACTGALFAQV LAWGNSGFAV TLLVPLASGL ATVAVTSRLT AE
|
| |