Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2166 |
Symbol | |
ID | 9246016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2586109 |
End bp | 2587494 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003680094 |
Protein GI | 297561120 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.203269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0459407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCTCAC AGACTGGCCC GCGCGCACCG CGCACCGGTA AGAGCTGGAT CTCCGCCTGG GACCCCGAGG ACGAAGGGTT CTGGAACGGC GGCGGACGAC GCGTGGCCCG CCGCAACCTG TGGGCCTCCA TCGCCTCCGA GCACATCGGC TTCTCGGTGT GGAGCATCTG GTCGGTACTC GTGCTCTTCA TGATCCCCGA GCACGGCTTC TCCACCACCC CCGAGCAGAA GTTCCTGCTC CTGTCGGTGG TCACCCTGGT CGGCGCGATC CTGCGCGTGC CCTACACCCT GGCCGTGCCC GCCCTCGGCG GACGCAACTG GACGGTCATC TCCACCCTGA CCCTGGCCGT GCCCACCGTC GCCGCCTTCT TCCTGGTCCG CGACCCCGAC ACCCCCTTCT GGCTGCTGCT GGTCCTGGCC GCCACCGCGG GCGTGGGCGG CGGCAACTTC TCCTCCTCCA TGGCCAACAT CAACTCCTAC TTCCCCGAAC GGGAGAAGGG GTGGGCGCTG GGCCTGAACG CGGGCGGCGG CAACATCGGC GTGGCCACCG TCCAACTCGT GGGCCTGGCC GTCATCGCCC TGTTCACCAC CTCCGCCGGA CACCTGGTCC CGCTGTTCTA CGCACCGCTG ATCCTGCTGG CCGCCTGGTG GGCGTACCGG GCCATGAACA ACCTGGTCCA CGTGCGCAAC GACGTCTCCG CGCAGCTGTC GGCCGTCCGC GACCGCCACT TCTGGATCAT GTCGCTGCTG TACGTGGGCA CCTTCGGCTC CTTCATCGGC ACCGGGTTCG CCTTCGGCCT GCTGCTGCAG TCCCAGTTCG GGCTGGCGCC CGTGCAGTCC GCCGCGATCG CCGTGCTCGG CCCGGTCATC GGCTCCCTGA TCCGCCCCGT GGGCGGCAGG ATGGCCGACT CCCTGGGCGG GGCCCGCGTC ACCCTGTGGG TCTTCCTGGC CATGGCCGCC TGCGCCGCCG TCCTGGTGCT CTCCGTCCAG GCCGCCCACC TGGCCCTGTT CATCGGCGCG TTCGCGGTGA TGTTCGTCCT CACCGGCCTG GGCAACGGCT CCACCTACAA GATGATCCCC TCCCTGTACG CCGCACGCGC CGAGGACGCC ATCGCCGCCG GGGAACCCCG GGAACAGGCC CTGGCCCGCA CCAAGCGCGT GGCCTCCTCC GTGCTCGGCC TCATCGGCGC GGTCGGCGCC CTGGGCGGGG TGGGAGTCAA CATCGCCTTC CGCGAGTCCT TCGCCGCCAC CGGCTCCACC GCCCCGGCCT TCGTCGTCTT CGGCGCCTTC TACCTGGTGT GCGCCGCCGT CACCTGGGCG GTCTACCTGC GCCGCCCCGC CGCCGCCCCG GTCGGCGCCG CCAGCGTGGA GAGCGCCGAC CGATGA
|
Protein sequence | MASQTGPRAP RTGKSWISAW DPEDEGFWNG GGRRVARRNL WASIASEHIG FSVWSIWSVL VLFMIPEHGF STTPEQKFLL LSVVTLVGAI LRVPYTLAVP ALGGRNWTVI STLTLAVPTV AAFFLVRDPD TPFWLLLVLA ATAGVGGGNF SSSMANINSY FPEREKGWAL GLNAGGGNIG VATVQLVGLA VIALFTTSAG HLVPLFYAPL ILLAAWWAYR AMNNLVHVRN DVSAQLSAVR DRHFWIMSLL YVGTFGSFIG TGFAFGLLLQ SQFGLAPVQS AAIAVLGPVI GSLIRPVGGR MADSLGGARV TLWVFLAMAA CAAVLVLSVQ AAHLALFIGA FAVMFVLTGL GNGSTYKMIP SLYAARAEDA IAAGEPREQA LARTKRVASS VLGLIGAVGA LGGVGVNIAF RESFAATGST APAFVVFGAF YLVCAAVTWA VYLRRPAAAP VGAASVESAD R
|
| |