Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4495 |
Symbol | |
ID | 9248375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5331248 |
End bp | 5332477 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003682389 |
Protein GI | 297563415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.675529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACCTG TGTCCGGTCG GGCCTCCGGC GCGCCCGCAC CCGCCGTGGT CGGTGCGCGC CGGGGCCTGG CGGTCCTGTG CGTCACCGTC ACCACCGGCT ACGGGGTGCT GTTCTACGCC TTCCCCGTCC TGGCGCCGAG CATCACCGCC GACACCGGAT GGTCCCTGAC CGCGGTGACC GCGCTGTTCT CCGCCTCCCA GGTCATGGCG GGACTGGCGG GCATCCCGGT GGGACGCTGG GTGCAGGCCC GGGGCCCGCG CCCGGCGATG ACGGCGGCTG CCCTGGCCGC GGCTCCCGCC GTGGCGGCCC TCGCCCTGGC CCCGAACCTG TGGGGCTTCG CCGCCGCCTG GCTGGTGGCC GGAGCGGCGA TGGCCGGACT GTTCTACCCT CCGGCCTTCG CCGCCCTGAC CCAGTGGTAC GGAAGGGCGA AGGTCCGGGC CCTGACCGCG CTGACCCTGG CCGCCGGTCT GGCCAGCACC GTCTTCGCTC CCCTGACCGC GTTCCTGGAA GGAGTCTGGG GGTGGCGGAC CGCCTACCTG GTACTCGCGG CCGTGCTCCT GGTCGTGGTG GTGCCCCTGC ACGCCTTCGC CCTGCCACAG GGCTGGGTCG CCGACGGCGC CGGGCAGCAG AGGGGCCGAG GGCAGGGCGC GCGTGCCGTG GTGCGCGGTC GGGTGTTCTG GGCTCTGACG ACGGCTCTGG CCCTGGGGTC CTTCACCGTC TACGCGGTCG TGGTCAACAT CGTCCCCCTG CTGGATGAAC AGGGTTTCGG CACGGCGGAA GCGGCCTGGG CCCTGGGGGC GGGCGGTGTG GGGCAGGTGC TCGGCCGTCT GGTCTACGCG CCCCTGGAAC GGTGGACCGA CCCGGTGCCG CGCGCCGTGG CCGTGCTGGG CGCGTGTTCG GTGACCACCC TGCTTCTGGC CCTGGTGCCG GGACCCCTGG GGCCGGTCCT GGCCATCGCG GTGCTGGCGG GCATGGCACG CGGCATCCTC ACCCTCCTCC AGGCCACCGC CGTGTCCGAC CGGTGGGGGA CGGAGCACTA CGCCACCCTC AACGGCGTCA TGCACACCCC GCTCATGCTG GCCGTCGCGG TCGCGCCCTG GGCGGGCGCA GCCCTGGCCG GTCCCCTGGG CGGCTATCCG GCGGCGTTCG CGGCGCTGGG AGCCCTGGCG GCGCTCGGCG CGCTGACCGC CCTGGCCACC CGCGCCGAAC GGGTTCCCAC CCCTTCCTGA
|
Protein sequence | MTPVSGRASG APAPAVVGAR RGLAVLCVTV TTGYGVLFYA FPVLAPSITA DTGWSLTAVT ALFSASQVMA GLAGIPVGRW VQARGPRPAM TAAALAAAPA VAALALAPNL WGFAAAWLVA GAAMAGLFYP PAFAALTQWY GRAKVRALTA LTLAAGLAST VFAPLTAFLE GVWGWRTAYL VLAAVLLVVV VPLHAFALPQ GWVADGAGQQ RGRGQGARAV VRGRVFWALT TALALGSFTV YAVVVNIVPL LDEQGFGTAE AAWALGAGGV GQVLGRLVYA PLERWTDPVP RAVAVLGACS VTTLLLALVP GPLGPVLAIA VLAGMARGIL TLLQATAVSD RWGTEHYATL NGVMHTPLML AVAVAPWAGA ALAGPLGGYP AAFAALGALA ALGALTALAT RAERVPTPS
|
| |