Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4657 |
Symbol | |
ID | 9248539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5531610 |
End bp | 5532932 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003682549 |
Protein GI | 297563575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.511231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTCC ACCAGCAGTC GGCGTCCGGG CCCGCACCGG CGCCGGAGGC GGGCTTCCAG TCGCGGATCA AGGTCATCCG CGCGGCGGTC ATCGGCACCG TCGTCGAGTA CTACGACTTC GGCATCTACG GCTACATGGC CACGTTCGTG GCGATGCTCT TCTTCGTCTC GGAGGACCCG ACGGCGGCCC TGCTGGGCAC GTTCGCCGCG TTCGCCGTCG CGTTCTTCAT GCGCGTGCCC GGCGGCATCC TCTTCGGCCA CATCGGGGAC CGCTACGGGC GCAAGCGGGC CCTGTCCTGG ACCATCCTGC TGATGGTCCT GGCCACCGCG GCCATCGGCG TGCTGCCCAA CTACTACACG CTCGGCGTCT GGGCGACCGT CCTGCTGGTC CTGTGCCGCT GCGTCCAGGG CTTCGCCGCG GGCGGCGAAC TCGGCGGGGC CAACGCCTTC GTGGCCGAGT CCGCCCCGGC CCGCTGGCGG GCCACCCAGA CCTCCCTGGT CAACTCGGGC ACCTACTTCG GCTCGCTGTT CGCCTCGCTG GTGGCGCTCA CCTTCACCAC GGTCTTCACC GAGCAGCAGA TGCTGGACTG GGCGTGGCGC CTGCCGTTCC TGCTCAGCCT GCCCATCGGC GTCATCGGCC TCTACATCCG CAGCCACCTG GACGACACCC CGCAGTTCAA GCAGCTGGAG GACAAGGGCG AGACCGAGCG CATGCCGATC CGGACCCTGC TGGTCACCAA CTGGCGGTCC GTGCTGAAGA TCATCGGCCT GGGCGCGGTG ATCACCGGCG GCTACTACAT CGTCTCGGTG TACGCGGCCA CCTACCTCCA GACCACGGCC GGGCACTCCG CCCAGCTGGC CTTCGCCTCG ACCTCGGTCG CCATGGTCGT CGGCGCCGCC ACGCTGCCGC TCTCGGGCTA CCTGGCCGAC ACCATCGGCC GCAAGAAGGT CATCCTCATC GGCAGCGTCG GGGCCGCGCT CCTGGGCTTC CCGATGTTCA TGATGATGTC CGCCGGCCCG GCCTGGGCCG CGATCGTGGG CCAGACGGCG CTGTTCGTGT GCGTCTCGGT CGTCAACGGC GCCTCGTTCG TCACCTACGC CGAGATGCTG GGCGCCTCCA CCCGCTACAG CGGGATCGCG CTCGGCAACA ACGTCACCAA CACGCTCCTG GGCGGCACCG CGCCCTTCGT CGCGACCTGG CTGATCAGCG CCACCGGCCA GCCGCTGGCT CCCGCGGGCT ACTTCGTCCT CACCGCCCTG GTGACGCTGG TGGCCGTCCT GTTCGTCACC GAGACCCGCG GCACCGAACT GCCGATCGAC TGA
|
Protein sequence | MSLHQQSASG PAPAPEAGFQ SRIKVIRAAV IGTVVEYYDF GIYGYMATFV AMLFFVSEDP TAALLGTFAA FAVAFFMRVP GGILFGHIGD RYGRKRALSW TILLMVLATA AIGVLPNYYT LGVWATVLLV LCRCVQGFAA GGELGGANAF VAESAPARWR ATQTSLVNSG TYFGSLFASL VALTFTTVFT EQQMLDWAWR LPFLLSLPIG VIGLYIRSHL DDTPQFKQLE DKGETERMPI RTLLVTNWRS VLKIIGLGAV ITGGYYIVSV YAATYLQTTA GHSAQLAFAS TSVAMVVGAA TLPLSGYLAD TIGRKKVILI GSVGAALLGF PMFMMMSAGP AWAAIVGQTA LFVCVSVVNG ASFVTYAEML GASTRYSGIA LGNNVTNTLL GGTAPFVATW LISATGQPLA PAGYFVLTAL VTLVAVLFVT ETRGTELPID
|
| |