Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1014 |
Symbol | |
ID | 9244860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1240458 |
End bp | 1241792 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003678963 |
Protein GI | 297559989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.506653 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACC TCGACCCAGA GGCGACCGAG GAGCGCCCGG CACGCGGCGG GCCGGTGGTG AGCGGTCGCG AGGCGGGCCG GATCGCTTTC GCGGCCTTCG TCGGCACCGC ACTGGAGTGG TACGACTACT TCCTCTTCGG TACGGCCGCC GCGCTCGTGT TCAACCGGCT GTACTTCACG ACGCTGGACG CCACGGCCGC CACCCTGGCC GCGTTCGCCA CCTTCGGCGT CGGCTTCGTC GCGCGTCCGC TGGGCGCGGT GCTGTTCGGC TGGATGGGCG ACCGGATCGG ACGGCGACCA GCGCTGCTGA TCACCGTGGT CTCCATCGGT GTGGCCACCG GCCTCATCGG CCTGCTGCCC GACTTCGCCT CGATCGGCAT CGCCGCTCCG CTGCTGCTGG CCCTGCTCCG GCTCGTGCAG GGCGTCGCGG TCGGCGGCGA GTGGGGCGGC GCGGTGACCA TCGCCGTCGA GCACGCCCCG CCGGAGAAGC GCGGGCGCTA CGCGGCCCTG CCGCAGATCG GGTCGCCCGT GGGCACCCTG CTCTCCTCGG GCGCGTTCTC GCTGGTCCTG CTCCTGCCCG CCGAGCAGTT CGACTCCTGG GGCTGGCGCC TGCCGTTCCT CGCGGCCTTC CCGCTGCTGC TGGTCGCGGT CTACATCCGC CAGAAGGTGG AGGAGTCGCC GGTCTTCGAG GAGATGGAGA AGCAGGAGGC GCGCTCCAAG GTCCCCGCGG TCGACGTGTT CCGCCACGCG TGGGGCAGGC TCCTCATCGC GATCGCGTCC GCCATGCTGG GCGTCGGCGG CTTCTACGTC ATGACGACGT TCGCCATCAG CTACGGCACC GACACCCTCG GACTGTCGCG CAGCCTCATG GTCAACGCCA CCCTGGTCGC CGCGGTGGTG CAGATCGGCG TGATCGTCTA CTTCGGGCGC CTGGCCGAGA AGCTCGGCCC CGGGCGCGTC ACCATGTGGG GCGGTATCGC CACGGCGGTG ATCGCCTTCC CGGTGTTCTG GCTGATCGAC ACGACCTCGC CGGTGCTGGT GGTGCTGGCG GTGGCCGGGG GCGTGGGCTT CCTGTCCATC GCCTACGCGG TCTCCGGCGC GCTGCTCACC GAGCTGTTCC CGGCGAACCT GCGCTACAGC GGCGTCGCGC TGGCCTACAA CCTGGCCGGC GCCCTGAGCG GCTTCCTGCC GTTCATCGCG ACCGCCCTGC TGGAGAGCGC GGACGGACGC TCCTGGGTCG CCTCGGTGCT GTTCCTGGGG ATCGCCCTGG TCACGGCGGT CGGCGGCTTC TACGGCGAGC GGCTGCGCGT CCGCGACGAC GTGGTGGTGC GCTGA
|
Protein sequence | MSDLDPEATE ERPARGGPVV SGREAGRIAF AAFVGTALEW YDYFLFGTAA ALVFNRLYFT TLDATAATLA AFATFGVGFV ARPLGAVLFG WMGDRIGRRP ALLITVVSIG VATGLIGLLP DFASIGIAAP LLLALLRLVQ GVAVGGEWGG AVTIAVEHAP PEKRGRYAAL PQIGSPVGTL LSSGAFSLVL LLPAEQFDSW GWRLPFLAAF PLLLVAVYIR QKVEESPVFE EMEKQEARSK VPAVDVFRHA WGRLLIAIAS AMLGVGGFYV MTTFAISYGT DTLGLSRSLM VNATLVAAVV QIGVIVYFGR LAEKLGPGRV TMWGGIATAV IAFPVFWLID TTSPVLVVLA VAGGVGFLSI AYAVSGALLT ELFPANLRYS GVALAYNLAG ALSGFLPFIA TALLESADGR SWVASVLFLG IALVTAVGGF YGERLRVRDD VVVR
|
| |