Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4303 |
Symbol | |
ID | 9248178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5119792 |
End bp | 5121093 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003682198 |
Protein GI | 297563224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.530625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.111618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACCG CCGTGGGCCG TCGGCGCCTC GCGCTGTGCG TCCTGTTCCT CCTGCCCGGG CTGGGGATCT CGTCCTGGGT CACCCGTACG CCCGCCATCC GCGACGCGCT GGGCGCCTCC ACCGCCGAGA TGGGGTTCGT CCTGTTCGGC CTCTCGATCG GTTCCATGAT CGGCGTCCTC GGGTCGGGGG CGGTCGTCGC CCGCCTGGGC GCACGGCCGG TCATCGTGGC CGGGACCGCC GCGATGCTGG GGAGCCTGCC CGTCATCGGT CTTGGCGCGG GCCTCTCGTC CGCCCTCGTC GTCGCGTTCG GCCTGTTCCT CTTCGGGCTG GGCATGGGCG CTGGGGAGAT CGCGATGAAC ATCGAGGGAG CCGACGTCGA GCGGGTCATG GCCGAGCCGC TGCTGCCGCG CATGCACGGC TTCTTCAGCC TGGGGACCGT GATCGGCGCC CTCGTCGGCA TGGCGCTCAC CGCAGTCGGG TTCCCCGTGG CGTGGCACCT GGCGGCGATG GGCGTCCTGA CCCTGGCGGT GGCGGCGACG CTCTTCGGCT CCCTGCCCCC CGGCACCGGC AGGGCGCTCC CGCGCGCCTC CGGCCAGGGG AGCGCGGGCG GCGGACGCGC GCTGTGGAAG GACGCGCGCC TGGTGCTGAT CGGCCTCATC GTGCTCGCGA TGGCCCTGGC CGAGGGCACC GCCAACGACT GGCTCCCGCT GATCATGGTC GACGGCCACG GCTTCGACCC GGCCCTGGGG TCGATGGTCT ACGCCGTCTT CGCCGCGTCG ATGACGGTCG GGCGCTTCGC CGGGGGCTAC TTCCTGGCGC GCTTCGGCAG GGCCCGCGTG CTCGGGGCGA GCGCGCTGGC CGGGGTGGCG GGCATGGGGC TGGTGGCCGG TGCGGACAGC CCGGCCCTGG CGGCGGCGGC CGTGGTCCTG TGGGGACTGG GCGCCTCGCT GGGCTTCCCC GTGGCACTGT CGGCGGCCGG GGACTCCGGG CCCGACTCCG CGGCCCGGGT CTCCCTGGTG GCGACGCTCG GCTACGTCGC GTTCCTGGTC GGGCCGCCCG TGCTCGGCCT GCTGGGGGAG GCGTACGGGT TGCGTACGGC CCTGGTCGTG CCCCTGCTCC TCGTGGCGTT CGCCGGGTTC CTCAGCCCGG CGGCCCGTCC GCGCCGGGCG GCGGGCGCGC GGGCCGAAGA GGACAGTGCT GGAACCGGAA GGGGCGGTAC GGAAGCCGAG GGGAGTGGTG CGGGGACCGA TCGGGGCGGC GCGGAGGCCG GGCAGGGTGG TGCGGAGGCA GGACGGGCCT GA
|
Protein sequence | MGTAVGRRRL ALCVLFLLPG LGISSWVTRT PAIRDALGAS TAEMGFVLFG LSIGSMIGVL GSGAVVARLG ARPVIVAGTA AMLGSLPVIG LGAGLSSALV VAFGLFLFGL GMGAGEIAMN IEGADVERVM AEPLLPRMHG FFSLGTVIGA LVGMALTAVG FPVAWHLAAM GVLTLAVAAT LFGSLPPGTG RALPRASGQG SAGGGRALWK DARLVLIGLI VLAMALAEGT ANDWLPLIMV DGHGFDPALG SMVYAVFAAS MTVGRFAGGY FLARFGRARV LGASALAGVA GMGLVAGADS PALAAAAVVL WGLGASLGFP VALSAAGDSG PDSAARVSLV ATLGYVAFLV GPPVLGLLGE AYGLRTALVV PLLLVAFAGF LSPAARPRRA AGARAEEDSA GTGRGGTEAE GSGAGTDRGG AEAGQGGAEA GRA
|
| |