Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3852 |
Symbol | |
ID | 8449471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4223836 |
End bp | 4225176 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042901 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003203137 |
Protein GI | 258653981 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00000421805 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.905597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGGAT CCGCGGTCGT GAACCCGCGC GCCCAGTACT CCGGTTGGGT CTCGCCCCTG TGCTGGATTG CCGTCGCCCT CGAAGGATTT GACCTGGTCG TGCTGGGGGT GGTGTTGCCC GCGCTGCTCA AGTACGACGA CTGGGGGCTC AACCCCAATT CAGCCTCGGT GATCTCGGTC GTCGGCCTGG TCGGCGTGAT GGTCGGGGCG TTGGCCGCCG GCACGGTCAG CGACCTGATC GGCCGCCGCC GCACCATGCT GTGGACGGTG ATCAGCTTCT CCGTGCTGAC CCTGGCCTGC GCCTTCGCCC CCGACCCAGT CACCTTCGCG GTGCTGCGCT TCCTGGCCGG TCTCGGCCTG GGCGGCGTGC TGCCCACCGC GTTGGCGCTG ATCAACGAGT ACGCCCGGTC GGGTCGCGGC GGGCGGGCCA CCACCACCAT GATGACCGGC TACCACGTGG GCGCGGTGCT GACCGCGCTG CTGGGCATCC TGATCATCGA GCCCTGGGGC TGGCATGCGA TGTTCATCGT CGGCGCCCTG CCGGCCATCG TGCTGGTCCC GTTGATGATC AAGTACCTGC CCGAGTCGAA CGCCTTCCTG CAGGCCCGAG CCGGGCTCGC GCCGAGCGCC GGCAAGGCCA CCACGACGGA CCGGGCCGAC CAGGCGGCCA AGCCGGCCAA GCCGGCCAAG TCCAAGAACC CGGTCGGCAT GCTGTTCCAC CACGGTCTGG GCCGGTCCAC GGTGGCGTTC TGGGTCGCCT CGTTCATGGG CCTGCTGCTG GTGTACGGGC TGAACACCTG GCTGCCGCAG ATCATGCGCG AGGCCGGCTA CGAGCTGGGC GCCGCGCTGG CCCTGTTGCT CGTACTCAAC GTCGGCGCGG TGCTCGGCCT GCTGGTCGCC GGGCAGGTCG CCGACAAGAT CGGCACCCGT CGCTCGTCGA TCAGCTGGTT CGCCGTGGCC GCCCTGTTCC TGGCCCTGCT GTCGATCAAG CTGCCCGGCA TCGGGGTGTA CATCAGCGTG CTGCTGGCCG GCATGTTCGT GTTCAGCGCG CAGGTGCTGG TCTACGCCTA CGTCGCCCAT GTCTACCCGG CCGCCGCCCG CGGCACCGCG CTGGGCTCCG CGGCCGGCGT CGGCCGGCTG GGCGCCATCA CCGGCCCGCT GATCACCGGC GTCATGCTGA CCGCCGGGGT GGCCTACCCG TGGGGCTTCT ACCTGTTCGC GGCGGTCGCC GCGATCGGTG CCGCGGCCAT CTTCCTGGTC GATCGGAACC CGGCCCCGGC CGAGCCGCTG CCGGTCACCG AACAGCAGGC CGACCAGATC ACCCACATCC ACCCGCACTG A
|
Protein sequence | MNGSAVVNPR AQYSGWVSPL CWIAVALEGF DLVVLGVVLP ALLKYDDWGL NPNSASVISV VGLVGVMVGA LAAGTVSDLI GRRRTMLWTV ISFSVLTLAC AFAPDPVTFA VLRFLAGLGL GGVLPTALAL INEYARSGRG GRATTTMMTG YHVGAVLTAL LGILIIEPWG WHAMFIVGAL PAIVLVPLMI KYLPESNAFL QARAGLAPSA GKATTTDRAD QAAKPAKPAK SKNPVGMLFH HGLGRSTVAF WVASFMGLLL VYGLNTWLPQ IMREAGYELG AALALLLVLN VGAVLGLLVA GQVADKIGTR RSSISWFAVA ALFLALLSIK LPGIGVYISV LLAGMFVFSA QVLVYAYVAH VYPAAARGTA LGSAAGVGRL GAITGPLITG VMLTAGVAYP WGFYLFAAVA AIGAAAIFLV DRNPAPAEPL PVTEQQADQI THIHPH
|
| |