Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1048 |
Symbol | |
ID | 9244894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1292240 |
End bp | 1293673 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003678997 |
Protein GI | 297560023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.137816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.36117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCGG CACGGCAGCC CCGCAGGAAC GGACACGGGA GCGGAAGCGG GCACGGGAAG AGCGGGGACA GGGCCGGGAT CGGGAACGGG AGCGGGGCCG GGAGCGAGAG CGAGCCCGGG AACGGGGCCG GGAACAGGGC TGGGAATGAG AGCGGGAACG GGAACAGGGC CGTAGCCGGG AGCGGGAACG TCCTGTCCCG GACCACCGCC CGCGGGGCCG CGGCGCTGCG CGACGTGTGC GGCCTGCCCG CGCTGCCCCT GCTGCTGATC GGCTCGCAGT TCGCGTTCAA CGTCGGCTTC TTCGCGGTCC TGCCCTACCT CGCCGACCAC CTCGGCGGCG CTCTGGGCCT GGCCGGGTGG CTGGTCGGCC TCGTGCTGGG CCTGCGCACC TTCAGCCAGC AGGGTCTGTT CGTGGTGGGC GGCACGCTCA CCGACCGGTT CGGCGCGCGC CCGGTGGTGC TGGCGGGCTG CGCGCTGCGC GTGGCCGGGT TCTGCTGGCT CGGCTTCGCC CAGGGCACCG CGTCGGTCAT CGCCTCGGTA CTGCTCATCG GTTTCGCCGC CGCGCTGTTC TCCCCGGCCG TGGAGACCGA GATCGTGCGC CAGGCCGTGC GGCACGAGCG GAGGACCGGT GAACCCCGCA CCCGGATGCT CGCCCTGTTC TCCGTCGCGG GACAGGCGGG AACCCTCGTC GGACCGCCGC TGGGCGCCCT GCTGCTGCTC GGCGGCTTCC GCGCCGCCTG TCTGGCCGGG GCCGCGGTCT TCGCCGCCGT GCTGGCCGGG CACCTGCGGC TGATGCCCCG CACCGCGCCG GGCGCGCCCG CGTCGGCCGA CCGGGGAACG CCGCGCCTGC GGGAGGGGCT GCGGCTGCTG CTGGGCAACC GCCGGTTCCT GGCGCTGAGC CTGGCCTACA GCGGCTACCT GCTGGCCTAC AACCAGCTCT ACCTGGCCCT GCCAGCGGAG GTGGAGCGGG CGACCGGCTC CCAGACGGCG CTGGGCTGGC TGCTGGCGTG GGCGTCGCTG CTCTACATCG CCACGCAGGT GCCGATGCTG CGCTGGGCCT CCGCCCGTCT GACGCGGCGC GCGACGCTGG TGTGGGGGCT GGCGCTGATC TGCCTGGGCT TCGCCGGGGC CGCCGCGTCG ATGCCGGTCG GCGCGCTGGG CGGGCTCGTG CCCTCCGCGG TGTTCGTCAC CGCGCTGACC CTGGGCCAGG TGCTCATCGT GCCCACGGTG CGGGCCTGGA TCCCCGACCT GGTAGAGGAC CGCCACCTGG GCCTGTTCAC GGGCGCGCTG TCCTCGTTGT CGGGGCTGGT GGTGCTCCTG GGCAGTGTGC CCGCGGGGGC GCTGCTGGAC CTGGGCGGAC CGCTGCCCTG GCTCCTGCTC GCCCTGGTGC CCCTCGCCGG AGCGGTGCTG GCGCCCGGCC GGGGCCGCCC CTGA
|
Protein sequence | MSPARQPRRN GHGSGSGHGK SGDRAGIGNG SGAGSESEPG NGAGNRAGNE SGNGNRAVAG SGNVLSRTTA RGAAALRDVC GLPALPLLLI GSQFAFNVGF FAVLPYLADH LGGALGLAGW LVGLVLGLRT FSQQGLFVVG GTLTDRFGAR PVVLAGCALR VAGFCWLGFA QGTASVIASV LLIGFAAALF SPAVETEIVR QAVRHERRTG EPRTRMLALF SVAGQAGTLV GPPLGALLLL GGFRAACLAG AAVFAAVLAG HLRLMPRTAP GAPASADRGT PRLREGLRLL LGNRRFLALS LAYSGYLLAY NQLYLALPAE VERATGSQTA LGWLLAWASL LYIATQVPML RWASARLTRR ATLVWGLALI CLGFAGAAAS MPVGALGGLV PSAVFVTALT LGQVLIVPTV RAWIPDLVED RHLGLFTGAL SSLSGLVVLL GSVPAGALLD LGGPLPWLLL ALVPLAGAVL APGRGRP
|
| |