Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0141 |
Symbol | |
ID | 9154275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 146387 |
End bp | 147922 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003645134 |
Protein GI | 296137891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC TCAGAACCGA TCACCCGTCC ACGATGCCCT TCGCGCGGCG GTGGGCCTCT GCGGCGGTGC TCTCGATCAG CCTGCTGGTG ATCACCGTGG ATCTCACGAT CCTCAACATC GCGCTGCCCG ACCTCTCCGC GGACCTGCGC CCCACCGCGG CGCAACAGCT GTGGATCATC GACGCGTACT CATTGGTGCT GGCCGGCCTG CTGGTCTCGA CCGCCTCGCT CGGAGACCGG TTCGGCCGCA AGCGAATGCT TCTGCTGGGC TACGCCGTCT TCGGCGTGGC CTCGCTACTC GTGCTGTGGG CGGATTCGCC CGGAGAGGTG ATCGCGCTGC GCGCGCTGCT GGGTGTGGGC GGCGCGATGA TCATGCCCAC CACGCTGTCC ATGCTGCGGG TGATCTTCAC CGATCCCGCC GAGCGGGCGA AGGCGCTCGG CCTGTGGGCC GCGGTCTCCG GCCTGGGCGC CGCGATCGGT CCGATCGCGG GCGGTGTTCT GCTGGAGAAT TTCTCGTGGC GTGCCGCCTT CCTGGTGAAC GTGCCGTTCA TGGCGGCTGT GCTGATCGCC GGCCTGCTGA TCCTGCCCGA GTCGACGGTG CCCAGCCCGG GCCGCTGGGA CTACGTGGGC GCGCTGCTCT CGATCACCGG CATGGTGGCC CTGGTGTGGT CGATCAAGCG GTTCGCCAAG GACCACACCT TCGCATCGAC GCCGGCCCTA GTGGCGCTGC TGCTCGCGGT GATCGCGTTG TCGCTCTTCG TCTACCGCTC GTTGCACCGC CCGGATCCGC TGCTCGACGT TCGGCTCTTC GAACGTCGCC AGTTCACCGC CGCGATTCTC GCCGCGCTGG GTGTCATGTT CGCCATGGCC GCCGCGCTCC TGCTACTGGC GCAGTGGATG CAACTGGTGG AGAACTACTC GCCGATCGAG ACCGGCGTCC GGCTGCTCCC GGTGGCGGTC GCGGCCACCG TCGCGTCGAT CGCCGCTCCC TGGCTCGCTC GTAAGCTGAA CGCGCGGATC GTGCTCGCGG GTGGGCTTGC TCTGGCCGGC ATCGGCATGG TGCTGATCGA CGCCGCGGAC CAGCTCACGT ACACCGCGAT GATCGCGCCA TTGGTGCTGG TGGGCATGGG CATGGGATCG ATGACGGTGG CCTCCGCGAT GGTCATGTCG GGCACTCCCG AGGAGAAGGC GGGTAATGCC GCCGCACTCG AAGAGACCTC GTACGACCTC GGCAACGTAC TGGGTGTCGC GGTTCTCGGC AGCATCGCCG CGATGCTGTA CACCGCCGAC GCCGATTTCG CGGCGATCCC CGGTGTGGAT GCGGCGACCG CCGATGCCGC CGGCGAATCG CTCGGTGCGG CAATGGCTAT CGCGCAGCAG GCGCAGCTGC CGGCGCTGGC AGAACACGCC GCTGCCGGGT TCACCGAGTC GCTGCAGACC ACGGGTCTGG TGGGCGGGGT GCTGCTACTG GCCGTGGCTG CGGGGGTCTA CCTGCTCACG CCGAAAGGCA CGGACATCAC CGTGCAGGCG CACTGA
|
Protein sequence | MTDLRTDHPS TMPFARRWAS AAVLSISLLV ITVDLTILNI ALPDLSADLR PTAAQQLWII DAYSLVLAGL LVSTASLGDR FGRKRMLLLG YAVFGVASLL VLWADSPGEV IALRALLGVG GAMIMPTTLS MLRVIFTDPA ERAKALGLWA AVSGLGAAIG PIAGGVLLEN FSWRAAFLVN VPFMAAVLIA GLLILPESTV PSPGRWDYVG ALLSITGMVA LVWSIKRFAK DHTFASTPAL VALLLAVIAL SLFVYRSLHR PDPLLDVRLF ERRQFTAAIL AALGVMFAMA AALLLLAQWM QLVENYSPIE TGVRLLPVAV AATVASIAAP WLARKLNARI VLAGGLALAG IGMVLIDAAD QLTYTAMIAP LVLVGMGMGS MTVASAMVMS GTPEEKAGNA AALEETSYDL GNVLGVAVLG SIAAMLYTAD ADFAAIPGVD AATADAAGES LGAAMAIAQQ AQLPALAEHA AAGFTESLQT TGLVGGVLLL AVAAGVYLLT PKGTDITVQA H
|
| |