Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3548 |
Symbol | |
ID | 9247417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4257479 |
End bp | 4258804 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003681455 |
Protein GI | 297562481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.119426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.943713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGA ACACCGCTTC CGCGCAAGAG CCCGCACGCA CCCCGTTCCA CCGCGCGTGG CTGGTCGCCC TCGTCGCGTG CCTGACCATC GTCGCCGCGG CGGCCTTCGC CGCGATGCCC GGAGTCCTGA CCGATCCCCT GCACGCCGAG TACGGGTGGT CCCGCGGGGC GATCGGTGCG GCGGCCTCGG TCAACATGCT CGTCTACGGC CTGATCGCCC CCTTCGCCGC GGCGCTGATG GACCGGTTCG GTGTCCGCGG GGTCGCCCTG GCGGCCCTGG GCGCCGTCGT GGCGGGGGCC GGACTGACCC TCGTCATGAC CACCGCCTGG CAGCTGACCC TCTACTGGGG GCTGCTCGTC GGCGCGGGCA CGGGGTCACT GGCGATGACG TTCGCCGCCA CGGTGGCCGA CAACTGGTTC GTCCGACGCC GTGGGCTGGT CATCGGCGCC CTGACCGGGG CCAGCGCCTT CGGCCAGCTG GTGTTCCTCC CCGCGCTGGC TTGGATCGTG GACCACCGCG GATGGCGTCC GGCCATCGTG ACCCTGGTGC TGACGGCGGG CGTCATGATC GTCCTCGTCG CCCTGGTGCT GCGGAACCAC CCGGCCGACC TGGGACAGCG CCCCTACGGT TCCCCGGTCT TCGTGGACAG GCCCCGAGCG GACCGGGGCG CGGCGCGCCG GACCGTGCGG GTCCTGTCCT CCTCGGTGCG CAGTCGCCGG TTCTGGCTCC TGGGCGGAGC GTTCGCGATG TGCGGCGCGA CCACCAACGG CATCATGTGG ACCCATTTCG TACCCGCCGC CCAGGACCAG GGGATGGCCG TTACGGTCGC GGCGGCGCTG GTGTCGGCGA TCGGCGTCTT CTCCCTGGTG GGGACGGTCC TGTCCGGATG GCTCACCGAC CGGGTGGACC CGCGCCTGCT CCTGGTCGCC TACTACGCGG GTCGGGGCGT GCTCCTGGCC GCGCTGCCCG CGCTGCTGGG GCCGGACGCC GGAGCGGCGA TGGCGGCCTT CGTCGTGGTG TTCGGCCTGC TCGACGTCGC GACGGTGCCC CCGACCATCC TGCTGTGCCG CCGGCTCTTC GGCGCAGACG GGGCCATCGT CTTCGGGTGG GTCAACGCCG TCCACCAGGT CGGCGCGGGG TCCATGGCGG TCTTCGGCGG CTTCGTCCGC GACGTGGGCG GAAGCTACGG GCCGGTGTGG CTGACGGGCG CCGCCCTGTG CGCCGTCGCC GCGATGCTGG CCCTCAGGGT GCCGCGCGGC ACGGGCTCCG ACCTCGCGGA CGAGGGGTCC GGAGGGGCCC GTGACCACGA GCGCAGGAGG GTGTGA
|
Protein sequence | MAENTASAQE PARTPFHRAW LVALVACLTI VAAAAFAAMP GVLTDPLHAE YGWSRGAIGA AASVNMLVYG LIAPFAAALM DRFGVRGVAL AALGAVVAGA GLTLVMTTAW QLTLYWGLLV GAGTGSLAMT FAATVADNWF VRRRGLVIGA LTGASAFGQL VFLPALAWIV DHRGWRPAIV TLVLTAGVMI VLVALVLRNH PADLGQRPYG SPVFVDRPRA DRGAARRTVR VLSSSVRSRR FWLLGGAFAM CGATTNGIMW THFVPAAQDQ GMAVTVAAAL VSAIGVFSLV GTVLSGWLTD RVDPRLLLVA YYAGRGVLLA ALPALLGPDA GAAMAAFVVV FGLLDVATVP PTILLCRRLF GADGAIVFGW VNAVHQVGAG SMAVFGGFVR DVGGSYGPVW LTGAALCAVA AMLALRVPRG TGSDLADEGS GGARDHERRR V
|
| |