Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3504 |
Symbol | |
ID | 9157679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3614737 |
End bp | 3616473 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003648422 |
Protein GI | 296141179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCGTTCC GCGATATCGA CACCCGCGAC GGCTACAAGT GGATCGCGCT CTCCAACACC ACCCTGGGCA TGCTGATCGC CACGATCAAC AGTTCCATCG TGCTGATCGC GCTGCCGGAC ATTTTCAAGG GCATCCACCT CAACCCGCTG GAACCGGGCA ACACCAGCTA TCTGTTGTGG ATGATGATGG GTTTCCTCGT GGTCACCGCA GTGCTGGTGG TGGCCTTCGG CCGACTCGGC GACATGTTCG GCCGCGCCCG CATGTACAAC ATGGGCTTCG CCATCTTCAC CGTCTCGTCG ATATTCCTGG CGGTCACCTG GTTCGACGGC CCCGCCGCCG CACTCTGGCT GATCGCCTGG CGCGTGGTGC AGGGTGTGGG CGGCGCCTTC CTCATGGCCA ACAGTTCGGC CATCCTCACC GACGCCTTCC CGCAGAATCA GCGCGGTCTC GCGCTCGGCA TCAACGGCGT CGCGGCGATC GCCGGCTCGT TCCTCGGCCT GGTGATCGGC GGCGTCCTGG CGCCCGTCAA CTGGCACCTG GTGTTCCTGG TGTCGGTGCC GTTCGGCGTG GTCGGCACGA TCTGGGCGTA TCTCAAGCTC GAGGATCGCG GCGAACGAGT CCCCGCCACC ATGGACTGGT GGGGCAACAC CACTTTCGCG GTCGGACTGA TCGCGGTGCT GGTGGGCATC ACCTACGGCA TCCAGCCCTA CGGCGGTGAC GCGATGGGAT GGGGTTCGCC GTTCGTCCTG TCCTGCCTGA TCGGCGGTGC CGCGGTGCTG GCGGTGTTCT GCTACATCGA ACTACGCGTG CCGGCTCCAC TGTTCGACCT GCACCTGTTC CGCAGCAAGG ACTTCCTGTG GGGCAACGTC GCGAACCTGT GCGGATCGTT GGGCCGCGGC GGTCTGCAGT TCATGCTGAT CATCTGGCTG CAGGGAATCT GGCTGCCACA GCACGGTTAC GACTACACGC AGACCCCGCT GTGGGCGGGT ATCTACATGG TCCCGCTCAC CGTCGGGTTC CTCCTCTCCG CCCCCGCCGC GGGCGCCCTG TCGGACCGGA TCGGCGGACG GTCACTGAGC GCTGCGGGTC TGCTCATCAC CGCACTCACA TTCCTCGCGC TGATCGCGCT GCCGGTCGAC TTCCCGTACT GGGCGTTCGC CCTGATCCTC TTCGTCAACG GGATCGGCAT GGGGATGTTC GGCTCACCCA ACCGCGCCGT GGTGATGAAC TCGCTACCCG CCACGTCGCG CGGCTCGGGA TCGGGAATGA TGACCACGTT CCAGAACGCG GCCATGGTGC TCTCGATCGG CCTGTTCTTC TCGCTGATGA TCGGCGGGCT CGCGGGGTCG CTGCCGGGTG CCATGTCGAC GGGGCTCCAG GCGAACGGCG TGCCCGCGCA GTACGCGAGC GAGATCGCCG CACTACCTCC GGTCGCGGTG TTGTTCGCAG CCTTCCTGGG CTACAACCCG ATCCAGCAGC TGCTCGGCCC GCACCTCGCT CAACTCAACC TGACCGCCGA GCAGTCCCAG CACCTGACCG GCTTGCAGTT CTTCCCCCAC CTGATCTCCG AATCGTTCCG GTCGGGCCTC GAGCTCGCCT TCTCGTTCGC GGCGGTCGTC TGCCTGATCG GCGCAGTCGC GTCGCTGCTC ACCGGCCGGG AGAAGCCCGA CGGTGCGCCC GACGCCGAGA CCCTCGCCGC GGAGGCCGAC GAGGTCGACA CCGAGGCCCT GTACTGA
|
Protein sequence | MAFRDIDTRD GYKWIALSNT TLGMLIATIN SSIVLIALPD IFKGIHLNPL EPGNTSYLLW MMMGFLVVTA VLVVAFGRLG DMFGRARMYN MGFAIFTVSS IFLAVTWFDG PAAALWLIAW RVVQGVGGAF LMANSSAILT DAFPQNQRGL ALGINGVAAI AGSFLGLVIG GVLAPVNWHL VFLVSVPFGV VGTIWAYLKL EDRGERVPAT MDWWGNTTFA VGLIAVLVGI TYGIQPYGGD AMGWGSPFVL SCLIGGAAVL AVFCYIELRV PAPLFDLHLF RSKDFLWGNV ANLCGSLGRG GLQFMLIIWL QGIWLPQHGY DYTQTPLWAG IYMVPLTVGF LLSAPAAGAL SDRIGGRSLS AAGLLITALT FLALIALPVD FPYWAFALIL FVNGIGMGMF GSPNRAVVMN SLPATSRGSG SGMMTTFQNA AMVLSIGLFF SLMIGGLAGS LPGAMSTGLQ ANGVPAQYAS EIAALPPVAV LFAAFLGYNP IQQLLGPHLA QLNLTAEQSQ HLTGLQFFPH LISESFRSGL ELAFSFAAVV CLIGAVASLL TGREKPDGAP DAETLAAEAD EVDTEALY
|
| |