Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1665 |
Symbol | |
ID | 9245515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2034419 |
End bp | 2035735 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679600 |
Protein GI | 297560626 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.395823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCG AGGTCGCGGA GCAGGTCGAC GAGAAGGACC TGCGGCGGGG TGTGTTCGCC GGGGCGGTGG GCGTCTTCGT CCACTGGTTC GACTGGGCCG TCTACGCCTA CCTGGCCACC ACCATGGCGC AGGTGTTCTT CCCCGAGCAG GACGGCACCA CCGCCCTGCT GTCGGTCTTC GCGGTCTTCG CCGTGGCGTT CTTCGTGCGT CCGCTCGGCT CGGTGCTCTT CGGCCACCTC GGCGACCGCT TCGGGCGCAA GACCACGCTG TCGATCGTCA TCATCTCGAT GGCGGCGGGC ACGCTCATGC TCGGGCTGCT GCCCAGCTAC GAGTCCGTCG GCATCCTCGC GCCGATCCTG CTGGTGGTCG CCCGCATCAT CCAGGGGCTC GCCGCGGGCG GGGAGTTCGG CTCGGCCGCC GCCTTCCTCG CGGAGTTCTC GCCGCCCAAG CGCCGCGGGT TCGGGTGCTC CTGGATCGAG TTCGGCTCGG TCGGCGGGTT CCTGTGCGCG TCGTTCGCCG TGTGGGCCCT GCACGCGTCC TTCCCGGCCG AGGTCGTCCT CGACTGGGCG TGGCGGATCC CCTTCCTGCT GACGGTGCCC ATGGCCGCGG TGGGCCTCTA CATCCGCCTG CGCATCGAGG ACACGCCCGA GTACCGCGCG CTGGAGGACA TGAACAACGT CCCGAGCCAG CCCGTCGTCG AGGTGTTCCG CTCCAACGGC AGGCAGTTCC TCCAGACGGT CGGCATCGAG ACCTTCATGA ACTCCACCTT CTACATCGTC CTGGTGTACC TGATCACCTA CCAGGAGGAG ATCGTGGGGG TGCCCGCCGA CCGGGCGGCC CTGCTCTCCG CGGTGGCCTC GGTCGTCGCC ATGGGGATCA TCCCGCTCTC GGGCAGGATC TCGGACCGCG TGGGCCGCAA GCCGGTGCTC TACACCGCCG CCGCGCTGCT GATCGCGGCC TCCGTGCCGC TGTTCTGGCT GATGCAGGTG CAGACCTCGT GGGCGGCGTT CGCCGCGACC TTCGGCCTCG CCGCGATCCT GGCGGTCATC CTGGGCACCC ACGCGTCCGC CGTGGCGGAG CTGTTCCCGA CCCGGACCCG GCAGAGCGGG CTGTCGATGG CCTACAGCGT CGCCGGGGCG TTCTTCGCGG GAACCCTGCC GTACCTGATG ACCTGGCTGA TCTCCCTCAC CGGCAGCAGC ATGGTCCCCG CCTTCACCAT GGTCGTGATC GGCGTCATCG GCGCGGTCAC ACTGCGCACC ATGCCCGAGA CCAGCGGCTC CGACCTGCTG CACGAGAGCG ACCGGGCCTC TCGCTGA
|
Protein sequence | MTSEVAEQVD EKDLRRGVFA GAVGVFVHWF DWAVYAYLAT TMAQVFFPEQ DGTTALLSVF AVFAVAFFVR PLGSVLFGHL GDRFGRKTTL SIVIISMAAG TLMLGLLPSY ESVGILAPIL LVVARIIQGL AAGGEFGSAA AFLAEFSPPK RRGFGCSWIE FGSVGGFLCA SFAVWALHAS FPAEVVLDWA WRIPFLLTVP MAAVGLYIRL RIEDTPEYRA LEDMNNVPSQ PVVEVFRSNG RQFLQTVGIE TFMNSTFYIV LVYLITYQEE IVGVPADRAA LLSAVASVVA MGIIPLSGRI SDRVGRKPVL YTAAALLIAA SVPLFWLMQV QTSWAAFAAT FGLAAILAVI LGTHASAVAE LFPTRTRQSG LSMAYSVAGA FFAGTLPYLM TWLISLTGSS MVPAFTMVVI GVIGAVTLRT MPETSGSDLL HESDRASR
|
| |