Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2998 |
Symbol | |
ID | 8448611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 3289464 |
End bp | 3290702 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645042082 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003202324 |
Protein GI | 258653168 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0156539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000637715 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATTACC CCTCCGGTGC GGGCACCGTC CCCGCCGTCA CCGATACCCC CGCACCCACC GACCCGGCGC CGGCCCCGGC CAGGACCGGC CAGCCGATCG CCGTCTGGGT GCTGGCCTTC GCGGCCATGG TCTCGTTCAT GGGCATCGGG CTGGTCGATC CGATCCTCAA GTCGATCGCG GCCAACCTGG ACGCCACCCC CAGTGAGGTC TCGCTGCTGT TCACCAGCTA CCTGCTGGTC ACCGCGATCG CGATGCTGAT CACCTCGTTC GTCTCCAGCC GCTTCGGTGG CCGGACCACG CTGATGGCCG GGCTCGTCAT CATCATCGTG TTCACCACCC TGGCCGGAAC GTCCGATTCG GTCGCCGCGC TCGTCGGCTG GCGGGCCGGC TGGGGTCTGG GCAATGCCCT GTTCATCGCC ACCGCACTGG CCGCGATCAT CGCCGTCGCC CGGGGCGGCG CCGAGAAGGC CGTCACGCTC TACGAAGCCG CGCTGGGCGT CGGCATCTCG GTCGGGCCGC TGGTCGGCGC GCTGCTGGGC ACCGTCAACT GGCGGGCCCC GTTCTTCGGC GTGGCCGTGT TGATGGGCAT CGCCCTGCTG GCCATCTCGC TGTTCCTGAA GGACAAGGTC ACGGTCACCC ACCGAATCCG GCCGGCCGAC CCATTGCGCG CGCTGGGGCA CGGTGGCCTG CTGGTATTGG GCATTGCCGC CCTGCTCTAC AACGGCGGCT TCTTCGCGGT GCTGGCCTTC ACCCCGTTCA CGTTGCCCTA CAGCGCTTTC GGGATCGGCT TCCTGTTCTT CGGGTGGGGC GTCCTGCTGG GCCTGTGCGC CGTCTGGGGC GCACCGTGGA TGCACCGCCG GTTCGGGCTG ACCAACGCGT TCATCATCAC CCTGGGCGTG TTCACCGCGA TCCTGGTCGC ATTGGCCTTG ACCGTCGACA ACCACGTCGC CGTCACCGTG TTGGTGATCG CGTGCGGCGC CCCGCTGGGC GTGCTGAACA CGCTGTTCAC CGAGTCGGCG ATGAACGTCT CCCCCGTCCC GCGCCCGGTC GCCTCGGCCG GTTACAACTT CGTCCGGTTC CTGGGGGCGG CCGCCTCGCC GTGGATCTGC GGCAAGCTCG GCGAGGAGGT CGGCCTGTCG GCCCCGTTCT GGTTCGGTGG CGCCTGCGTC ATCGGCGGAC TGCTGATGAT CGCGGTCTTC GGCCGCCGGC ACCTGGCCGC GATCAACGCC CGGCACTGA
|
Protein sequence | MDYPSGAGTV PAVTDTPAPT DPAPAPARTG QPIAVWVLAF AAMVSFMGIG LVDPILKSIA ANLDATPSEV SLLFTSYLLV TAIAMLITSF VSSRFGGRTT LMAGLVIIIV FTTLAGTSDS VAALVGWRAG WGLGNALFIA TALAAIIAVA RGGAEKAVTL YEAALGVGIS VGPLVGALLG TVNWRAPFFG VAVLMGIALL AISLFLKDKV TVTHRIRPAD PLRALGHGGL LVLGIAALLY NGGFFAVLAF TPFTLPYSAF GIGFLFFGWG VLLGLCAVWG APWMHRRFGL TNAFIITLGV FTAILVALAL TVDNHVAVTV LVIACGAPLG VLNTLFTESA MNVSPVPRPV ASAGYNFVRF LGAAASPWIC GKLGEEVGLS APFWFGGACV IGGLLMIAVF GRRHLAAINA RH
|
| |