Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5436 |
Symbol | |
ID | 9249339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 622949 |
End bp | 624184 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003683321 |
Protein GI | 297564348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCTCCG AACCCGCTGG CGGCGCCAGG GCCTGGCTCG TGTGGGGCGT CGCCGTGGGC GGGTACTTCC TGGCGATGCT GCACCGCAAC GGTCTGGGTG TGGCCGCCCT GGAGGCCCAG GCCCGCTTCG ACGTCGGACC GGCTCTGCTG TCCCTTCTGC CGATGCTGCA ACTCCTGGTG TACGTGGTCC TCCAGGTGCC CACCGGGTTG CTGGCCGACC GGCTCGGGCC CCGCTACACG CTCGTCATGG GCATGGCGGC GATGACGGTC GGCGCGTGCC TGTTCGCGCT CGCCCCCGGC ATCGAGGTGG CGGTGGCCGG GCGGTTCCTC ATCGGTCTCG GCGACGCCCT GGTCTTCCTC AACGTCATCC GCCTGGCGGC CCTGTGGTTC CCGCGTTCGC GCTACGCCCT CGTCAGCGGG CTCACCGGCG TGGTCGGCGG AACGGGGCAG GTGGCCAGCG CCGCGCCGAT GGCCTGGGCC CTGGAGGGCT TCGGCTGGGT GGCCGCCTTC CTGGCCACCA CGGTCCTGAC CGCGCTGATG GCCCTGCTGA TGCTGGTGGT CGTGCGCGAC CGCCCGGCCG GTGCGGCGGG CCGTTCCACG GTCGCCGACC CGATCTCGGT GTGGGCCGCG CTCAAGGAGG CGCTGCGTTC GCGAGGCCCG CAGATCGGCA TGGCCCACCA CGCGGCCGTC ATGGCCCCGT TCACGATGAT GATGGTGTTG TGGGGTTACC CGTTCCTCGT GGGCGGCCTG GGGCTGTCCG AGGACACCGC CGCCCTCACC CTCACCGCCC TGGCCGCGGG CGGGCTGTGG ATGGCTCCGC TCGCGGGCGT CGTGATCGGG CGCAGCCCCG GCGTGCGCCG GTGGCTGGGG CTCATCCTGA GCACGACGCT GAGCCTGGGC TGGCTGCTGA TGGTGGCGTG GCCCGGGGGA GTCCCGGTGG CCCTGGGGCT GACGGTGCTC GCGGCGAGCG CGATCGGGCA GACCCTGGCG CCGACGGTCT CCTTCGACTT CGCGCGCGAC GGGATCCCCG CGAGCCGGAC CGGCGTCGCC TCCGGCCTGG TGAACATGAG CGGGTTCACG ACCGCCGTGG TGTGCACGGT GGCGGCGGGC GCGCTGCTCC AGACGCTGCC GGAGGGCCCC GAGGCCTACC AGCTGGCGTT CGTGCCGATG GCCGTCGCCA CGGTGTGCGC GACCGCGGCG CTGTACTACT TCGTCCTCCG CCGCCCGCGG GCCTAG
|
Protein sequence | MASEPAGGAR AWLVWGVAVG GYFLAMLHRN GLGVAALEAQ ARFDVGPALL SLLPMLQLLV YVVLQVPTGL LADRLGPRYT LVMGMAAMTV GACLFALAPG IEVAVAGRFL IGLGDALVFL NVIRLAALWF PRSRYALVSG LTGVVGGTGQ VASAAPMAWA LEGFGWVAAF LATTVLTALM ALLMLVVVRD RPAGAAGRST VADPISVWAA LKEALRSRGP QIGMAHHAAV MAPFTMMMVL WGYPFLVGGL GLSEDTAALT LTALAAGGLW MAPLAGVVIG RSPGVRRWLG LILSTTLSLG WLLMVAWPGG VPVALGLTVL AASAIGQTLA PTVSFDFARD GIPASRTGVA SGLVNMSGFT TAVVCTVAAG ALLQTLPEGP EAYQLAFVPM AVATVCATAA LYYFVLRRPR A
|
| |