Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0844 |
Symbol | |
ID | 9154984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 854711 |
End bp | 855922 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003645818 |
Protein GI | 296138575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAGTG TTTTCGTGTC CCTGCGGACG CGGAACTATC GGCTCTGGGC CGGCGGACAG TCGGTCAGCC TCGTCGGCAC CTGGATGCAA CGCGTCGCCG AGGACTGGCT GGTGCTCGAT CTCGCGCACG GCCGCGCCTG GGTGCTCGGT GTGGTGATGG CCCTGCAGTT CGGCCCCACC CTGCTGCTAT CGGCGTGGGC GGGATTGCTC GCGGACCGCT ACGACAAGCG GCGGGTCCTG ATGCTCACAC AGTCGGCGGC CGCACTCTGC GCCGCGGCCC TCGCGGTGCT CACGCTCTCC GGCATCGTCG CGCTGTGGCA CGTCTTCGTG ATCGCCTTCG TCTTCGGATG TACCGCGGCG ATCGCGGGTC CCTTCCGGCA GGCGTTCACC ATCGAGATGG TGGGGCCGGA GTTGTTGCCC AATGCCATCG GGCTCAATTC GATGGTGTTC AACGCCGCCC GAATCGTGGG GCCGGCGATC GCGGGCCTGC TCATCGCCGG CGTGGGCACC GGCGCGGTGT TCGCGGTCAA CGCCGTGTTC ACCGGCGCGA TCGTGGCTGC GCTGCTCGCG ATGCGAGTGG CACAGCTGCA CCCGTCGCCA CCGGTGGAGC GCGCGAAGGG CCAGGTGCGC GAGGGCTTCC GGTATGTCCG ACACAGGCCC GAACTGCTGC TGGTGATCGT CGCGGTGTTC TTCGTCTCGA CCTTCGGCAT CAACTTCCCA CTGGCCCTGT CGATCCTGGC CAGGCAGGGC TTCGGTCTCG GCGCCGATGC CTACGGCCTC CTGTCCACCA TGCTCGCGAT CGGCACGCTC TCCGGAGCAC TCGTCGCCGC GAAACGCACC GGACGAGCAG CGTTGCGCAC GTGCCTGGTG GGCGGAACGG CGTTCGGTGT GGTCCAGGTG TGCACCGGCC TGGCTCCGTG GTTCTGGCTG GCCGCCGCGC TACTGATCGC GGTGGGATTC CTGCAGATGG CCTTCACCAC CTCGGCGATG AGCATCATGC AACTCTCCGT GGATCCCGAG TTCCGCGGCC GGGTCATGGG CATCTACATG CTGGCCTTCC TCGGCGGCAC ACCGCTGGGT GCGCCGCTCC TCGGCGCCAT CGCCGACGCG ACGACGCCGA CCGCGCCCCT CCTCGTCGGC GGGGTGGTCT CCGCTCTCAC CTGCGCCCTG TGCGGGATGT ATGCGCTGCG TGGGTCGCGG AACAGCGTCT GA
|
Protein sequence | MPSVFVSLRT RNYRLWAGGQ SVSLVGTWMQ RVAEDWLVLD LAHGRAWVLG VVMALQFGPT LLLSAWAGLL ADRYDKRRVL MLTQSAAALC AAALAVLTLS GIVALWHVFV IAFVFGCTAA IAGPFRQAFT IEMVGPELLP NAIGLNSMVF NAARIVGPAI AGLLIAGVGT GAVFAVNAVF TGAIVAALLA MRVAQLHPSP PVERAKGQVR EGFRYVRHRP ELLLVIVAVF FVSTFGINFP LALSILARQG FGLGADAYGL LSTMLAIGTL SGALVAAKRT GRAALRTCLV GGTAFGVVQV CTGLAPWFWL AAALLIAVGF LQMAFTTSAM SIMQLSVDPE FRGRVMGIYM LAFLGGTPLG APLLGAIADA TTPTAPLLVG GVVSALTCAL CGMYALRGSR NSV
|
| |