Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1580 |
Symbol | |
ID | 9245430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1932546 |
End bp | 1933760 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679515 |
Protein GI | 297560541 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0196311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACC GACCCCTCCG CGCGCCGGGC CGGGCCCCCT CCGGCACGGA CGCGCGCAGG GCGTTGGCCG CCCTGTGCGT CACCGTCACC GCCAGCCAGG GCGTGCTGTT CTACGCCTTC CCCGTGCTGG CGCCCGCCAT CGCCGAGGAC ACCGGCTGGT CCCTGCCCGC CGTCATCGCC CTGTTCTCCG GGTCCCAGGT CGTGGCCGGA CTGGGCGGGC CCCTGGTGGC CCGCTGGCTG CGCGTGCGCG GCCCCCGGCC GGTGATGACC GCGGCGGCCC TGCTGGGCGC GGTCGCCGTC GCCGGACTGG CCCTGGCCCC GAACCTGTGG TTCTTCGGCG CGGCCTGGCA GGTGGCCGGA GCCGCCGTGG CCGGGCTCTC CTACCCGCCC GCCTTCGCCG CCCTGACCCG CTGGTACGGG CAGGGCAGGG TCCGGGCGCT CACCGCCCTC ACCCTGGTCG GCGGGCTGGC CAGCACCGTC TTCGCCCCGC TGACCGCCGC CCTGGAGGCG CAGGTCGGCT GGCGCGGCGC CTACCTCGCC CTCGCCCTGG TCCTGGCCCT GGTGGTGCTG CCGCTGCACG CCCTGGCCCT GACCCCGCCC TGGACTCCCG GCGGCACCGC CGACCGGGCC GGGCACCGCC GGGCGGTGCG CGGGGTGGTC CGCAGCGGTG CGTTCTGGGC GCTGACCACG GCGCTCGCCC TGGGCACCCT CACCGTGTAC GCGGTCGTGG TCGGTGTCGT CCCGCTCATG GAGGGGCGCG GGTTCGGCAC CGCCGAGGCC GCCTGGACGC TCAGCGCCGT GGGCGTGGGC CAGGTGCTGG GCCGCCTCGT CTACGCCCCG TTGGCGCGGT ACAGCGGGGC GGTGCACCGG ATCGCCGCGG CGCTCCTGGC CTGCGCCGGG GCGACCGGCC TGATCTCCCT GGTCAGCGGG CCGCTGTGGC TGGTCATGAC CGCGGCGGCG CTGGTGGGCG CCGCGCGGGG CGTGCTCACC CTGCTCCAGG CCACGGCCGT GGCGGACCGC TGGGGGGAGG AGCACTACAC CACGCTCAAC GGCATCATGC ACACCCCGCT CATGCTCACG ATGGCCCTGG CGCCGGGGGC GTGCGCGCTG CTGGCCGGGC CCCTGGGCGG CTACCCGGCG GTGTTCCTGC TGCTGGCGGC CCTGTCGGTG CTCGGCGCCC TGGTCGCGCT GGCCAGCGGT CCGGCCCGCC GTTAG
|
Protein sequence | MTHRPLRAPG RAPSGTDARR ALAALCVTVT ASQGVLFYAF PVLAPAIAED TGWSLPAVIA LFSGSQVVAG LGGPLVARWL RVRGPRPVMT AAALLGAVAV AGLALAPNLW FFGAAWQVAG AAVAGLSYPP AFAALTRWYG QGRVRALTAL TLVGGLASTV FAPLTAALEA QVGWRGAYLA LALVLALVVL PLHALALTPP WTPGGTADRA GHRRAVRGVV RSGAFWALTT ALALGTLTVY AVVVGVVPLM EGRGFGTAEA AWTLSAVGVG QVLGRLVYAP LARYSGAVHR IAAALLACAG ATGLISLVSG PLWLVMTAAA LVGAARGVLT LLQATAVADR WGEEHYTTLN GIMHTPLMLT MALAPGACAL LAGPLGGYPA VFLLLAALSV LGALVALASG PARR
|
| |