Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1671 |
Symbol | |
ID | 9245521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2042825 |
End bp | 2044069 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679606 |
Protein GI | 297560632 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAATC CGTACGCGCA GATCTTCGCG GTGCGTGGAG CCAAGGGCTT CACCGTCGCC GGGCTCATCG GGCGCATGCC CGTGGCCATG ACCAACATCG GCATCATCAC GATGCTCTCC ACCACCCACG GCAGCTACGC CCTGGCGGGC GCGGTGGCCG CCGCCTTCAC CCTGTCCATG GCGCTGATCA CCCCGCAGGT CTCGCGTCTG GCCGACCGCC ACGGACAGCG CCGCGTCCTG CCCCCCGCGG CCGCCGTCAG CGTCGCGTCC CTGCTCCTGA TGCTGCTGTG CGTGCGGTTC GACGCGCCGT ACTGGACGCT CTTCGCGTTC GCGGTCCCCG CCGGGACCAT GCCGAACATG TCCGCGATGT CCCGCGCGCG CTGGACCGAG CTGCTGCGCG GCTCGCCCCG GCTGCACACC GCCTACTCCT TCGAGTCCGT GGCCGACGAA CTCACCTTCA TCACCGGCCC GGCGCTGTCG GTGGTGCTGA GCACCATGGC CTTCGCCCAG GCCGGTCCGC TGGCCGCGGC CGCCTTCCTG GCCCTGGGCG TCACCCTGTT CGTGGCCCAG CGCGGTACGG AGCCGCCGCT CCAGGCTCCC GAGGCGTCCG GGACCAAGGG CGCGGGCGCG CTCAGCGGCG CCCTCCTGGT CCTGGTGCTG ACCCTGCTCG CGGGAGGCGT CATCGTCGGC TCGGTGGACG TGGTCGCGGT GGCCTTCGCC GAGTCGCTCG GCGTGACCAG CGCCACCGGC GTCGTGCTGT CGGCCTACGC GCTCGGGTCG GCGATCTCCG GGCTGACCTT CGGGGTGCTC GACCTGCCCT GGCGGCTCCA CATGATGCTG ATCGTGGCCG TGGCCGGGAC GTTCGCGACC ACCCTCCCCT TCCTGGCGGT CGGTAGCATC TGGACCCTGT CCGTGGCCGT GTTCTTCGCG GGGATCTTCT TCGCCCCCAC GATGATCCTG GTGATGACGC TGATCGAGCG GACCGTACCG CCGTCCAAGC TGACGGAGGG CATGACCTGG GCCCTGACCG GCCTGACCAT CGGTACCGCG ATCGGCACCT TCTCCTCGGG GCTGGCGGTG GAGGAGAGCG GCACCACGGG CGGCTTCCTC GTCGCGGTCG CGGCCGGCGC CCTCGCCCTG GTCCTGACCC TGGTGTTCGC CCCCCTCCTG GCCCGGGCCC AGGCGAGGGC CGAGCGGGCG CAGGAGGAAC AGGCCGCGGA GGAGGCGGCC GGTACCGCCG GGTGA
|
Protein sequence | MPNPYAQIFA VRGAKGFTVA GLIGRMPVAM TNIGIITMLS TTHGSYALAG AVAAAFTLSM ALITPQVSRL ADRHGQRRVL PPAAAVSVAS LLLMLLCVRF DAPYWTLFAF AVPAGTMPNM SAMSRARWTE LLRGSPRLHT AYSFESVADE LTFITGPALS VVLSTMAFAQ AGPLAAAAFL ALGVTLFVAQ RGTEPPLQAP EASGTKGAGA LSGALLVLVL TLLAGGVIVG SVDVVAVAFA ESLGVTSATG VVLSAYALGS AISGLTFGVL DLPWRLHMML IVAVAGTFAT TLPFLAVGSI WTLSVAVFFA GIFFAPTMIL VMTLIERTVP PSKLTEGMTW ALTGLTIGTA IGTFSSGLAV EESGTTGGFL VAVAAGALAL VLTLVFAPLL ARAQARAERA QEEQAAEEAA GTAG
|
| |