Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3923 |
Symbol | |
ID | 9158104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4045901 |
End bp | 4047259 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003648834 |
Protein GI | 296141591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0699267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTGACG GGGGTCACGG ATTCAGTAAT CTCTGCGCAA TGAATCGCAG GCGAATCGTC GTCGCCTCCA TGGTGGGCAC CACCATCGAG TTCTTCGACT TCTACATCTA CGCGACCGCG GCAGTGCTCG TGTTCCCCAC CTTGTTCTTC CCGAAGGGGG ACGACACCGC GGCACTGCTC GCCTCGTTCG CCACCTTCGG TCTGGCGTTT GTGGCGCGGC CTGTGGGGTC GATCCTGTTC GGCCACTTCG GTGATCGCGT GGGCCGCAAG GCCACCCTGG TGGGTTCGCT GCTCACCATG GGTATCGCGA CCTTCCTCAT CGGCGTGCTG CCCACCTTCG ACCAGGTGGG CTACTGGGCG CCGGCGCTGC TGGCGCTGAT GCGATTCGCA CAGGGCCTGG GATTGGGCGG TGAGTGGTCG GGCGCCGCGC TGCTGGCGAC CGAGACCGCC GCTCCGGGCA AGCGAGCGTG GGCCGCGATG TGGCCGCAGC TGGGTGCGCC GTTCGGGTTC TTCCTGGCCA ACGGGACGTT CCTGGTGATC ATGCAGGTGA TGGACTTCGA TTCGAAGACG TCCGCCTCCA ATCACGCGTT CATGACCTGG GGTTGGCGCA TCCCGTTCCT CGCCTCGGCC GTGATGGTGA TCGTGGGACT GTACGTGCGG CTCAAGCTCA CCGAGACCCC GGTCTTCGCC AAGGCCGTCG AGGACGGCAA GAAGGTGAAG GCCCCGCTGG GTGAGGTGCT GCGCACGGCA TGGCGGCCGC TGATCATCGG CACGTTCGTC ATGGTGGCCA CGTACACGCT GTTCTACTTG GTGACCACGT GGATCGTGTC GTACGGCACC GGCAAGGTGG TCGACAAGTC GGGTGTGAAG CTGGGCATCT CGTACATCGA CTTCTTGCAG ATGCAGCTGG TCGCCGTGCT CTTCTTCGCC GCCTTCGTCG CGGTATCCGG GTACTTCGCC GACCGCGTCG GCCGACGGCT GGTGCTGATC GGCGCCACGC TGGCCATCAT CGGTTTCGGC CTGTCGTTCA AGTGGATCCT GGATCCGTCG ACCACCACGC AGGGGTCCAT GCTGGCGGTG TTGATCGTGG GACTGACCCT GATGGGCATC ACGTTCGGCC CGATGAGCGC GGTGCTCCCG GAGCTGTTCG CCACCAACGT CCGCTACACG GGCAGCGGTA TCGCGTACAA CTTCGCCTCG ATCCTCGGTG CCGCCATCGC ACCGTTCATC GCCACCTGGC TGGTCAGTGA CTACGGCGTC GGTTGGGTCG GCGTGTATCT CGCGTTGGCC GGTGGCGCCA CGCTGATCGC CCTGCTCGCG ATGCACGAGA CCCGCGACGT GGAGCTCGAC AAGGTCTGA
|
Protein sequence | MGDGGHGFSN LCAMNRRRIV VASMVGTTIE FFDFYIYATA AVLVFPTLFF PKGDDTAALL ASFATFGLAF VARPVGSILF GHFGDRVGRK ATLVGSLLTM GIATFLIGVL PTFDQVGYWA PALLALMRFA QGLGLGGEWS GAALLATETA APGKRAWAAM WPQLGAPFGF FLANGTFLVI MQVMDFDSKT SASNHAFMTW GWRIPFLASA VMVIVGLYVR LKLTETPVFA KAVEDGKKVK APLGEVLRTA WRPLIIGTFV MVATYTLFYL VTTWIVSYGT GKVVDKSGVK LGISYIDFLQ MQLVAVLFFA AFVAVSGYFA DRVGRRLVLI GATLAIIGFG LSFKWILDPS TTTQGSMLAV LIVGLTLMGI TFGPMSAVLP ELFATNVRYT GSGIAYNFAS ILGAAIAPFI ATWLVSDYGV GWVGVYLALA GGATLIALLA MHETRDVELD KV
|
| |