Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1168 |
Symbol | |
ID | 9245018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1424282 |
End bp | 1425535 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679115 |
Protein GI | 297560141 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.262607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCGC AGGTGGCCAG TGTGTCGGCG ATGTTCGGCG TGCCCGTCGT GCTGCCGCAG CTGAGGGAGG CCTTCGGCGT CACCACCGCG CAGGCCGGGA TGCTGGCCGG TCTGCCCTCG CTCGGCCTGC TGCTCACGCT CCTGGTGTGG GGCCTGGTGA TCGACCGGTT CGGGGAGCGG CCGACGATGG CGCTGAGCCT GGTGCTCACG GCGTGCGCGC TCGGCCTGCT CTGGGCGGTC TCGGGGCTGT GGGGGGCCGT CGCGGTCCTG CTGCTGGTGG GCGCGGTCGG CGGCCCGGTC AACGCGGCGA GCGGACGGCT GGTGCTGTCG TGGTTCTCCG AGCGCGAACG CGGACTGGCC ATGGGCGTCC GCCAGTCGGC GCTGCCCCTG GGCGTTGGCC TCTCCGCCCT GGCGCTGCCC TGGGCCGCCG GTGCCTGGGG GTTCGTCGGC GCGATGACCC TCCCGGCGGT GCTCGCGCTC CTGGCGGTCC CGTTCGTGCT GGCGATGCCC GCCTCCGCTC CCGGGCGGCC CCCGGCACCG GGGGCCGCCG GGGACACCGC GGCCGGGGAC ACCGCGAGCG CGGATGCCGC TGACGACGCC GCGCGCACCG AGGCCCCCGC GGGCGAGGAC GCCGCTGACA CCAGCGGCGG CGCCGCGCGC CCCGCGGCCG GGTCGCCCTA CCGGCACGGG CACATCTGGC GGGTGCACGG GGTGAGCATG CTGCTCGGCG TCCCGCAGAT CGCGCTGATG ACTTACGCCG TCATCTACCT GGTGCAGGAA CAGGGGTGGA GCCCGCCCGC CGCGGGCGCG GTCGTCGCGG TGGCGCAGGT GCCCGGGGCG GCCGGGCGGC TGCTGCTGGG CGTCTGGTCC GACCGGACCG GGGACCGGAT GCGCCCGGTG CGGTGGCTGG CGGTGGTCTC GGGGGTCTGC CTGGTGCTGC TCGCGGCGCT GCCGGGGGCC TGGGCGTGGA CGGCCGTACC GCTGCTCCTG GCCTGCCTGG TCCTGACGAT GTGGCACAAC GGCCTGACCT TCACCGCCGT CGCCGAGACG GCGGGAACCC CATGGGCGGG GCGGGCCCTG GCCGCGCAGA ACTGGCTCCA GGCGGTCAGC ACGACCCTGA CGCCGGTGCT CATGGCGGCG GTGATCACCC TCGCGGGCCT GCCCTCCGTC TTCGCGGTGA GCGCGCTCTT CTCCGCCGCG GCCGTCGCCG TGGTCCCGGT GGCGCGGGGG CGTCAAGAGA AATGGGGTGT TTGA
|
Protein sequence | MVAQVASVSA MFGVPVVLPQ LREAFGVTTA QAGMLAGLPS LGLLLTLLVW GLVIDRFGER PTMALSLVLT ACALGLLWAV SGLWGAVAVL LLVGAVGGPV NAASGRLVLS WFSERERGLA MGVRQSALPL GVGLSALALP WAAGAWGFVG AMTLPAVLAL LAVPFVLAMP ASAPGRPPAP GAAGDTAAGD TASADAADDA ARTEAPAGED AADTSGGAAR PAAGSPYRHG HIWRVHGVSM LLGVPQIALM TYAVIYLVQE QGWSPPAAGA VVAVAQVPGA AGRLLLGVWS DRTGDRMRPV RWLAVVSGVC LVLLAALPGA WAWTAVPLLL ACLVLTMWHN GLTFTAVAET AGTPWAGRAL AAQNWLQAVS TTLTPVLMAA VITLAGLPSV FAVSALFSAA AVAVVPVARG RQEKWGV
|
| |