Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2213 |
Symbol | |
ID | 9246063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2642788 |
End bp | 2644074 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003680141 |
Protein GI | 297561167 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.175074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000120086 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCATTGGA CGTATGAAAC CGGTCTTTCT AGCGTGGACG GTATGAGCCT GACCAGTGCC TCGCCGCCTG ATGACGACAC CCGTCCCCGT GCGGGGACCG GCGCCTACCG GCGCACGGTC ATCGCGCTGG TGGCCGCCGC CGTGGCGACC TTCGCCCAGT TGTGGGCGGT CCAGCCGATC CTGCCCGCGA TCGCGGAGGG TTTCGGCGCC TCGGCCTCCC AGGCCGCGCT CGCGGTCTCG CTGGCCACGG GCGGCCTGGC CGGGTTCACC CTGGTCTGGA GCGGGGCGGC GGACCGCTTC GGTCGTGCGC GTGTCATCGG TGTCTCCCTG CTGGCCGCGA CGCTGCTGGG CTGTGTGATC CCCTTCGTCG CCGATCTGTG GCCGCTGCTG GTGCTGCGCG CGTTGCAGGG TGCGGCTCTG GGCGGGGTGC CGGCCGCGGC GGTGGCCTAC CTGTCGGAGG AAATCCACCC GGCCGACGCC TCGCGTGCCA CGGGCCTGTA CATCGCGGGC AACCCGTTGG GCGGGATGGG CGGGCGTCTG CTGGCCGGGT TCGCCGCGGA TCTGGGCGGC TGGCAGTGGG GGATCGCCGC CAACACCCTG CTGGCCCTGG TCGCGCTGGT TGTGTTCGCG CTGGTCCTGC CCCGTCGGCC GCGGGCCGTG CGCACCGTCG CGGTGCGGGG GGAGGCGTCC CCGTCGCGGG GGTCGGGCGG GGGCGGGGTG GGCGGGCGCC TGCGTGCGGC GGTGACCACG CCCGGGCTGA TCGCCCTCTA CACCCAGGCC CTGTTGCTGA TGGGCGCCTT CATGACGGTC TACAACCTGT TGGGTTTTCG TCTGATGGCC GAGCCCTTCG GGCTCTCCCA GGCCGCGGCC TCGCTGCTCT TCCTGTCCTA CACGGCGGGG ATGCTGGGTT CGGCGGTGGC GGGGGGAGCC AGCGCGCGTT GGGGCGGGTA CGCGGTGCTC ACCACGGCGA CCGTGTTGAT GGCCGCCGGG TTGGGCGGGA TGTTCGCCAC GGCTTTGCCG GGTCTGCTGG CGGCCCTGTT GGTGATGACC TTCGGTTTCT TCTGTGCGCA CGCCACCGCC TCGGCGTGGG TGGGTACCCG CGCGGTGCGG GGGCGGGCCC AGGCGATGGC GGTCTACACG CTGGCCTACT ACCTGGGGTC GAGCCTGTTC GGCTGGTTGG GCGGTCTGGT CTACGACGCC GTGGGCTGGG GTGGGGCGGT GGTCTTCGCG TTGGGGTTGT GCTCGGTGGC CGCGGCGGCG GGTCTGCGTC TGCGCCGTCT GCGGTAG
|
Protein sequence | MHWTYETGLS SVDGMSLTSA SPPDDDTRPR AGTGAYRRTV IALVAAAVAT FAQLWAVQPI LPAIAEGFGA SASQAALAVS LATGGLAGFT LVWSGAADRF GRARVIGVSL LAATLLGCVI PFVADLWPLL VLRALQGAAL GGVPAAAVAY LSEEIHPADA SRATGLYIAG NPLGGMGGRL LAGFAADLGG WQWGIAANTL LALVALVVFA LVLPRRPRAV RTVAVRGEAS PSRGSGGGGV GGRLRAAVTT PGLIALYTQA LLLMGAFMTV YNLLGFRLMA EPFGLSQAAA SLLFLSYTAG MLGSAVAGGA SARWGGYAVL TTATVLMAAG LGGMFATALP GLLAALLVMT FGFFCAHATA SAWVGTRAVR GRAQAMAVYT LAYYLGSSLF GWLGGLVYDA VGWGGAVVFA LGLCSVAAAA GLRLRRLR
|
| |