Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3276 |
Symbol | |
ID | 9247138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3911315 |
End bp | 3912697 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003681188 |
Protein GI | 297562214 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC AGGCACCCCC CGCGCCAGCC GGACAGGGGG AGCGCAGCAC CCGGGACCTC ACCAAGGCCT CCGTGTCCGG GTGGCTGGGG ACCGCCATGG AGTTCATGGA CTTCCAGCTC TACTCCCTGG CCGCCGCCCT GGTGTTCAAC CAGATCTTCT TCCCCGACCT CAACCCCGCG GTCGGCCTGA TCGCGGCGAT GGGCACCTAC GGTGTCGGGT ACGTCGCCCG GCTCGTCGGG GCGGTCTACT TCGGCCGCAT GGGCGACCGC CTCGGCCCCA AGAAGGTCCT GTTCATCACC GTCGCCCTGA TGGGCGTCTC CACCACCCTG ATCGGCGCCC TGCCCACCTA CCAGCAGGTG GGCCTGCTCG CCCCGATCCT GCTCGTGGGC CTGCGGCTGA TCCAGGGCTT CGGCGCGGGC GCCGAGATCG CGGGCGCCAC CGTGATGCTG GCGGAGTACG CCCCGGCCAG GCGTCGGGGG TTCATCGCCT CCCTGGTGTG CCTGGGCACC AACTCCGGCA CCCTGGGCGC CTCCGCGATC TGGGCGGTCC TGGTGTTCGC GCTCTCCGAG GAGCAGCTGC TGTCCTGGGG CTGGCGCCTC CCCTTCCTGG CGAGCTTCCT CCTGCTGCTC CTGGCGCTGT GGATCCGCCT CTCCGTCAAG GAGAGCCCGG TCTTCGAGCA GCGCGAGGAC ATCGTCGACG GTGTGGCGAT GTCCCGGAGC GAACTGGCCG CGGCCGCGGT GAAGGAGGAC AGGAGCGGAC TGGAGACCGC CCTGCACCAG CGCAAGGGCC GCGCGTTCCT GCTCGCCCTC GGCCTGCGCT TCGGCCAGGC GGGCAACTCC GGCATCGTCC AGACCTTCCT CGTCGGCTAC CTCAGCGCCA ACCTGATGCT CAACGACGCG GTCGGCACGT CCGCGATCGT CTACGGCTCC CTGCTCGGCT TCGTCACCGT CCCCCTGGTC GGCGTGCTCG GTGACCGCTT CGGACGCCGT CCCGTCTACC TCTTCCTGAC CGTGGCGAGC ATGCTGTTCG CCGTGCCCAT GATGCTGATG ATCGAGACCG GCGACACCGT GCTCGTGACC GTCGCGATGG TCGTCGGGCT CAACCTGTCG GTCCTGGGCC TGTTCTCCGT GGAGAGCGTC ACCATGGCCG AGCTGTTCGG GGCGCGCACC CGGTTCACCC AGCTGGCCCT GGCCAAGGAG ATCGGCGGCG TCCTGGCCAC CGCGATCGGC CCGGTGCTGG CCGCCACGCT CACCGCCGCG ACCGGCTCCT GGTGGCCGCT GTCGGCGATG ATCATCGCCT ACTCCCTGAT CACCCTGGCC TCCGCCTACC TGTCCCCCGA GGTGCGCGGA CGCGACCTGG TCCGACTGGA GGACGCCGTA TGA
|
Protein sequence | MTEQAPPAPA GQGERSTRDL TKASVSGWLG TAMEFMDFQL YSLAAALVFN QIFFPDLNPA VGLIAAMGTY GVGYVARLVG AVYFGRMGDR LGPKKVLFIT VALMGVSTTL IGALPTYQQV GLLAPILLVG LRLIQGFGAG AEIAGATVML AEYAPARRRG FIASLVCLGT NSGTLGASAI WAVLVFALSE EQLLSWGWRL PFLASFLLLL LALWIRLSVK ESPVFEQRED IVDGVAMSRS ELAAAAVKED RSGLETALHQ RKGRAFLLAL GLRFGQAGNS GIVQTFLVGY LSANLMLNDA VGTSAIVYGS LLGFVTVPLV GVLGDRFGRR PVYLFLTVAS MLFAVPMMLM IETGDTVLVT VAMVVGLNLS VLGLFSVESV TMAELFGART RFTQLALAKE IGGVLATAIG PVLAATLTAA TGSWWPLSAM IIAYSLITLA SAYLSPEVRG RDLVRLEDAV
|
| |