Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2472 |
Symbol | |
ID | 8448083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2727701 |
End bp | 2729068 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645041585 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003201829 |
Protein GI | 258652673 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00897] polyol permease family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00022569 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0159414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAG TTTCGATCCC GGCCCGCCCC TTCGCGCACA ATGAGGTGAT TCCTGTGTCT TCCCGACTGC CCCGCTCCTT GATCTTCGGC TATGCCGCGA TCGCGCTGTT CATGACCGGC GACGGATTCG AGCTGACATT CCTGGCCCGG TATCTGGTCG ACCTGGGCTA CACCCCGGTC GACGCCGCGC TGGCGTTCAG CGTGTACGGA TTCGTGGCCG CGATCGCGGC CTGGTGTTCG GGCGTGATCG CCGAGATGTT CGGCGCCCGG CGAGTGATGG TGGTCGCCGG AATCGCCTGG CTGGTGTTGC AGGTCGTGCT GCTCAGCGTC GGTCTGAATT CGGGCAGCCT GCCGCTGATC CTGCTGATCT ACGGCGCGCG GGCCGCGGCC TACCCGATGT TCATCTACTC CTTCGTGGTG CTGATCGCCC AGACCGTCGA CCGGTCCCGG TTGGCCACCG CGATGGGCTG GTACTGGGCC GCCTACTCCC TGGGGATCGG CGTGCTGGGC ACGTACTTGC CGAGCTGGCT GCTGCCCGTG CTGGGGGAGC AGCTGACGCT GTGGCTCGCG CTGCCCTGGG TGGCCTCCGG TGTGCTGCTG GCCGCCTGGA GCGCCCGCCG CTACGGCGGC ACCGCCCAGA CCGCCCAGAC CGCGCCGACC GGCACCGAAC CCGCGTCCCG GCTGCGCGAG TTGGCCCGGG GCGCCACCAT TCTCGTCGAG AACCGGCCGA TCCTGATCCT GGCCGTCGTC CGCGTCATCT GCAACCTCAC CCTCTTCGGC TTCCCGGTGA TCATGCCGCT GTACCTGTCG ACGACCACCT ACGACGGGGT CGGCGTGCTG GCCGTGACGC AGTGGATGCA GTTGTGGGGC CTGATGTTCG CCGTCACCAT CGTGACCAAC GTGCTGTGGG GCCGCATCGG TGACCGGTTC GGATGGATGC GCCAGATGCG CTGGTACGGC TGCGTGGGCT GCGCGGTCGC GACCCTGTCC TTCTACTACC TGCCGCAGTG GACCGGGCCG AACATGTGGG CGCTGTCGGC GGCCGCCGTC CTGCTCGCGG TGGCGGTGTC CGCGTTCGTG CCGATGGGGG CCGTCTTCCC GGCCCTGGCC CCCGGTCACA CCGGAGCCGC GGTGTCCGCG CACAACCTGG CGGCCGGGGT GTCGACCTTC CTCGGCCCGG CCATCGCCAT GGTGCTGCTG CCCATCGCCG GCATCGGCGG GGTCTGCTGG GCCTACGCAG TCCTGTACCT GATCGGTGCC GGCCTGACCG TCTTCGTCCG CCCCGATCAG CCCGGTATCA GGTCCCGGCG CCCGGTGCCG GCCACCGCCT CCGCCTCCGC CTCCGCCTCC GCCTCCGCCT CCGCATGA
|
Protein sequence | MSEVSIPARP FAHNEVIPVS SRLPRSLIFG YAAIALFMTG DGFELTFLAR YLVDLGYTPV DAALAFSVYG FVAAIAAWCS GVIAEMFGAR RVMVVAGIAW LVLQVVLLSV GLNSGSLPLI LLIYGARAAA YPMFIYSFVV LIAQTVDRSR LATAMGWYWA AYSLGIGVLG TYLPSWLLPV LGEQLTLWLA LPWVASGVLL AAWSARRYGG TAQTAQTAPT GTEPASRLRE LARGATILVE NRPILILAVV RVICNLTLFG FPVIMPLYLS TTTYDGVGVL AVTQWMQLWG LMFAVTIVTN VLWGRIGDRF GWMRQMRWYG CVGCAVATLS FYYLPQWTGP NMWALSAAAV LLAVAVSAFV PMGAVFPALA PGHTGAAVSA HNLAAGVSTF LGPAIAMVLL PIAGIGGVCW AYAVLYLIGA GLTVFVRPDQ PGIRSRRPVP ATASASASAS ASASA
|
| |