Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0780 |
Symbol | |
ID | 9244625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 959358 |
End bp | 960620 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003678730 |
Protein GI | 297559756 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.347678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCTT TGGACACGCT GCGACGCATG CACCACATCG CCGGGTGGCC GCTGCTCCTG AGCACCTTCC TCGCCCGCCT GCCCATCTCC ATGTCCCTGA TCGGCCTGCT CACCCTGGTC ACCACCACGA CCGGGAGCGT GGCCGCCGCC GGTGCGGTGA CCGGCGCCTT CGCGCTCGGG GAGACCGTGG GCGGGCCCGT CATCGCCCGC TTCGCCGACC GGCGCGGGCA GCGCGTACCG GTGCTGGTCA CCTCGCTCGT GGACGCCGTG CTCATCACCG TGCTCGTGAC CGCCGTCCTG GCGGAGGCCT CCGCCCCGGT CCTGGCCGTG CTCGCCGCCG CGGCGGGTCT GTGCATGCCC CAGATCGGCC CCATGGCGCG TACGCGCTGG GTGGTGCTCA TCCGTCGGGG GCCGCACCGG GGCGAGGAGC GCGAGCGGTC GGTGTCCGCG GCGATGTCGG TCGAGGGCGT GCTGGACGAG GCCGCGTTCG TCCTGGGCCC CGCCCTGGTG GGCCTGCTCA CCGTGACGCT CTCCCCGGCG GCGAGCGTGC TGGGCGCGGC CGTGCTCATC GGCGTGTTCG GGACCGTGTT CGCCCTGCAC CCCACCGCGC CGCCCGGAAC CCCGCCGGTG CGCGGCGCCG GGGGCCGCAT CGCCACGCCC GCGCTGCTGG TGCTGGCCGT CCCCATGTTC TGCCAGGGCC TGTTCTTCGG CGGCATGTCC ACCGGCGTGA CCGCCTTCTC CGCCGCCTCC GGGCACGGGG ACCTGTCCGG GCTGATGTAC GCGGTGATGG GCACCAGCAG CGCCGTCGCG GGCCTGCTGA TGGCCTCCGT CCCCGTGGGC TTCCCGCTCA CGGCGCGGGC CCGGATCGCG GCGGGCGCGC TGTTCCTGCT CACCCTGCCG CTGTACGCGG CCCACGGCGC GGCCGCCCTG GCGGTGGCCA TCTTCGTCCT GGGCGCCGCC ATCGGCCCGC ACATCGTGAG CCTGTTCGGG CTGATCGAGA GGGCCGCCCC GGCCAGCCGC CTGTCGCAGT CGATGGCCGT CATCCTGAGC TGCCTGATCC TGGGCCAGGC GCTGGGGTCG TCGGTCGCGG GCGTCCTCGC CGACGCGCAC GGCCACCAGG GGGCGTTCGT GCTGGCCACG CTGGGCGGCC TGGTCTCCTT CGCGGTGACC GTCCTGGTGA TGCGCGCCCG CTGGTACGTG CGCGGCGAAC CGTCCTCTCC GGCGACGGTG ACACGTTCCG CTCCGGGCGT CGGAGAGGGG TGA
|
Protein sequence | MSPLDTLRRM HHIAGWPLLL STFLARLPIS MSLIGLLTLV TTTTGSVAAA GAVTGAFALG ETVGGPVIAR FADRRGQRVP VLVTSLVDAV LITVLVTAVL AEASAPVLAV LAAAAGLCMP QIGPMARTRW VVLIRRGPHR GEERERSVSA AMSVEGVLDE AAFVLGPALV GLLTVTLSPA ASVLGAAVLI GVFGTVFALH PTAPPGTPPV RGAGGRIATP ALLVLAVPMF CQGLFFGGMS TGVTAFSAAS GHGDLSGLMY AVMGTSSAVA GLLMASVPVG FPLTARARIA AGALFLLTLP LYAAHGAAAL AVAIFVLGAA IGPHIVSLFG LIERAAPASR LSQSMAVILS CLILGQALGS SVAGVLADAH GHQGAFVLAT LGGLVSFAVT VLVMRARWYV RGEPSSPATV TRSAPGVGEG
|
| |