Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1370 |
Symbol | |
ID | 9245220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1681330 |
End bp | 1682850 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679308 |
Protein GI | 297560334 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.855348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCC AGGCCCCGCA CCGCCGCGCG ACGTGGCGCG AGTGGCTCGG ACTGACGATC CTGACCCTGC CCGTGTTCAT GATGGCCAAC GACGTGTCGG TGCTCTACCT GGCGCTGCCC CGGATCGGCG CCGACCTGCT GCCCTCGGCC GCCCAGTCGC TGTGGATCCT GCACGTGGGC GAGCTGCTGG GCGCCGGGCT GGTGCTGACC ATGGGCCGCC TGGGCGACCG GGTGGGGCGG CGCCTGCTGC TCCTGACCGG CCTGGCCGTG TACGCGGCCG CCTCGGCGGC GGCAGCCTTC GCCCCCGACC CCGTGACGCT CATCGCGGCC CGCGCCGTGC TGGGCGCGTC CGTGGCGGTG ATCTCGCCGT CCGCGCTGGC GCTGCTGCGG CAGATGTTCC CCGACTCCCG GCAGTTCGCC ACCGCCGTGG CGCTGTACCT GAGCGCGTTC TCGGTGGGGA TGGCGCTGGG TCCGCCGCTG GGCGGCCTGT TGCTGGAGTT CCTCTGGTGG GGGTCGGTGT TCCTGGTGAA CGTGCCCGTG GCCCTGTTCG CGCTGGTGAC CCTGCCGTTC CTGCTGCCGG AGTTCCGCGA CCCCGGTGCG GGGCGGCTGG ACCCCGCGAG CGTCCTGCTG TCCACGGCCG CCCTGGTCCT GGTCGTGTTC GGCCTCCAGG AGGCCGTGTC GCGGGGGCCG GAGCCGCCGC TGCTGGCCGC CGTGGCCGGG GGCCTGGCGC TGGGCTGGCT GTTCTGGCGC CGCCAGCGGC GGCTGGACGA CCCGCTGCTG CCCCCGGGGC TGTTCTCCGC TCGGGGCTTC GGGGCGGCGG TGTCCCTGAC CCTGCTGATG CTGCTGGTCG CGGGCGGCCC GAACCTGTTC CTCGTGCAGT TCCTCCAGTC CGCCCTGGAG GTGCCGCCGG GCCTGGTGGG CCTGCTGCTG GTCCTTCCCG CCGTGGCGGG CCTGGCGGGC ACCATGCTCA CCCCGCTGCT GCTGCGGTGG GCGACCGCGG GGCAGGTGCT GGCGCTGTCG ATGCCCGTCG CCCTCGTCGG GCTGGTGTCC ATGGCGGCGT CCGCCGGTCC CGGGTCCCTG TGGGGGCTGG TCACCGGTAC CGTCCTGCTC TCCCTCGGCG GCGGCCCGGC GATGACGCTG GGCAGCCAGC TGGCGCTGTC CGCCGCGCCG CGGGAGCGGA CGGGGACGGC CTCGGCGGTG GTGGACGTGG CCTCGGGCAT GGGGCAGACG CTGAGCCTGG CGCTGCTGGG CGGGCTGGGG CTCGCGGTGT ACCGGGGCGT CCTGGAGGGT TCCGTGCCCG CCGGTGTCCC CTCGGACGCG GCCGAGACCG CCGGTGAGGG CGTCGGCGCC GCCGCGGCCG TCGCCGGTGA CCTGGGCGGC GCGGTGGGGG CCGCGCTGCG CGGGGCCGCC GAGACGGCAC TGGGCACCGC GCTCCAGACC GTCTCCGTGG TCGGCGCGGT GGCGCTGGCC TGCACGATCG TCGTGGTGTC GGTACGGCTG TGGCGGCACC GCCCCGACTG A
|
Protein sequence | MTIQAPHRRA TWREWLGLTI LTLPVFMMAN DVSVLYLALP RIGADLLPSA AQSLWILHVG ELLGAGLVLT MGRLGDRVGR RLLLLTGLAV YAAASAAAAF APDPVTLIAA RAVLGASVAV ISPSALALLR QMFPDSRQFA TAVALYLSAF SVGMALGPPL GGLLLEFLWW GSVFLVNVPV ALFALVTLPF LLPEFRDPGA GRLDPASVLL STAALVLVVF GLQEAVSRGP EPPLLAAVAG GLALGWLFWR RQRRLDDPLL PPGLFSARGF GAAVSLTLLM LLVAGGPNLF LVQFLQSALE VPPGLVGLLL VLPAVAGLAG TMLTPLLLRW ATAGQVLALS MPVALVGLVS MAASAGPGSL WGLVTGTVLL SLGGGPAMTL GSQLALSAAP RERTGTASAV VDVASGMGQT LSLALLGGLG LAVYRGVLEG SVPAGVPSDA AETAGEGVGA AAAVAGDLGG AVGAALRGAA ETALGTALQT VSVVGAVALA CTIVVVSVRL WRHRPD
|
| |