Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1563 |
Symbol | |
ID | 9245413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1913871 |
End bp | 1915145 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679498 |
Protein GI | 297560524 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0974748 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCGA AGACCGACAC CCCGCGTACA GCACCGCCCG GAGGCGCGGA CCAGGGACAC GCGCCGGAGG GTTCCATCCT CCGGCAGCCG CGCGCCGTGT GGGCCGTGGC GTTCGCCTGC GTGATCGCCT TCATGGGCAT CGGCCTGGTC GACCCGATCC TGCCCGCGAT CTCCCGCAGC CTGGAGGCCA CGCAGACCCA GACCTCGCTG CTGTTCACCA GCTACCTGCT GGTCACCGGC CTGGCCATGC TCGTCACCAG CTGGGTGTCG AGCCGCCTGG GCGCCAAGCG CACCCTGCTG GTCGGCCTGG CGCTGATCGT GGTCTTCGCC GCCGCGGCCG GGGCCAGCGG CAGCGTCGAG TCCGTCATCG GCTTCCGGGC CGGGTGGGGA CTGGGCAACG CGTTCTTCAT CTCCACCGCC CTGGCCACCA TCGTCGGCGC CGCCAGCGGC GGGGCCTCCT CCGCGATCGT GCTCTACGAG GCGGCGCTGG GCCTGGGCAT CGCCGCGGGC CCGCTCCTGG GCGGACTGCT CGGCAGCGTC AGCTGGCGCG GCCCCTTCTT CGGTACCGCC GCGCTGATGG CGGTCGGGTT CGTCGCGATC GCCGTCCTGC TCAGGAGCGA CGCCGCGGAG CGGACCGCGC CGGTGCCGCT GTCGGCCCCG TTCGCGGCGC TGCGCGTCCC GGGCATCCTG GTGCTGGCCC TGGCCGCGCT GTTCTACAAC ATCGGGTTCT TCGCCCTGCT GGCCTACACG CCCTTCCCGC TGGGGCTGGA CGAGATGGGC CTGGGGTTCA CCTTCTTCGG CTGGGGCCTG GCGGTCGCCG TGACCTCGGT CTTCGTGGCT CCGGTGCTCA CCCGCCGGTG GCCGCGCACG CGGGTGCTGT GGGCGACCCT GCTCCTGCTG GCCGCGGACC TGGTCGCCGC GGGCGCGCTC ATCTCGTCCA CGGCGGGGCT GATCACCGCC ATCGTGCTGG GCGGCCTGCT GCTGGGCGTG CTCAACACGG TGCTGACCGA GTGCGTGATG GAGGCCAGCG ACCTGCCGCG TTCGGTGGCG TCCTCGTCCT ACTCGGCGGT GCGCTTCCTC GGCGGCGCCG CCGCTCCCCC GGCCGCCTCC GCGCTCGCCG CGGCCCTGTC CCCCGGCGCG CCGATGTACG CGGCGGCGGT GTCCGTGGCG CTGGCCGCGG GGATCGTGCT CCTGGGCCGC GGGGCGCTGC GCAGGGTGGA CAACGGCCCC GAGTCCGCCC GGGCCGAGGC CGAGGCGATC ACGCTGGGCG AGTGA
|
Protein sequence | MVAKTDTPRT APPGGADQGH APEGSILRQP RAVWAVAFAC VIAFMGIGLV DPILPAISRS LEATQTQTSL LFTSYLLVTG LAMLVTSWVS SRLGAKRTLL VGLALIVVFA AAAGASGSVE SVIGFRAGWG LGNAFFISTA LATIVGAASG GASSAIVLYE AALGLGIAAG PLLGGLLGSV SWRGPFFGTA ALMAVGFVAI AVLLRSDAAE RTAPVPLSAP FAALRVPGIL VLALAALFYN IGFFALLAYT PFPLGLDEMG LGFTFFGWGL AVAVTSVFVA PVLTRRWPRT RVLWATLLLL AADLVAAGAL ISSTAGLITA IVLGGLLLGV LNTVLTECVM EASDLPRSVA SSSYSAVRFL GGAAAPPAAS ALAAALSPGA PMYAAAVSVA LAAGIVLLGR GALRRVDNGP ESARAEAEAI TLGE
|
| |