Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3881 |
Symbol | |
ID | 9247752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4652473 |
End bp | 4654014 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003681784 |
Protein GI | 297562810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCTC GACGCGCCAC CGCCCGCACC TGGGCCGGAC TGGCCCTCCT ACTGATCCCG GCACTGCTGG TGTCCATGGA CATCTCGGTG CTCTTCGTCG CCGCGCCCGC CATCACCGAG GCGCTGCGAC CCACGTCCGC GCAGTGGCTG TGGATGATGG ACGTCTACGG CTTCGTGCTC GCCGGACTGC TCGTCACCAT GGGAAGCCTG GGCGACCGGA TCGGCCGCAG GCGCCTGCTG CTCACCGGCG GGGTGCTCTT CGGCGCCGCG TCCGTGCTGC TGGCGCTGGC GCCCTCGCCG GAGCTGTTCA TCGCCGGGCG GGCCCTGCTG GGCGTGGCAG GGGCGACCCT GGCGCCCTCC ACGCTCTCCC TGGTCCGGGA CATGTTCACC GACCCCCGCC AGCGCGGCGC CGCGGTCGGG GCCTGGACCG TCGCCTTCAC CGGCGGCGCC GTCGCCGGGC CGATCCTCGG CGGACTGCTC CTGGAGTTCT TCTGGTGGGG CTCGGCCTTC CTCGTCAACC TGCCGTTCAT GGTCGTGCTG GTGGCCGCCG CACCCCTGCT CGTGCCCGAG TCGCGCGACC CGGAGGCCTC CGGCTTCGAC CTGCCGGGCG CGGGCCTCTC GCTCGCGGCC GTCCTGGGCC TGGTCTACGG CGCCAAGCGC CTGGCCGAGC ACGGGGCCGA CCCCCACGCC CTCACCGCCC TGGCGGCCGG AGCGGCGCTC CTGGCTCTGT TCGTGCTCCG GCAGCGCCGT GCCGCGCACC CGCTGATGGA CCTCTCGCTC CTCGCCCGCC CCGCTTTCAC CGCCGCGATC ATCGGCAACC TGGCCCTGTC CTTCGCCGTC GGGGGGATGG GGCTCCTGAC CTTCACCTTC CTCCAGACCG TGCACGGCCT GAGCCCGCTC CACGCCGCCC TGTGGGCGCT GCCCACGATC CTGGGCACCG TCCTGGGCGC GGTCCTGGCC GGCTCGCTCG CGCCCCGGGC CAGACCCGGC GTGCTCATGG CGGCGGGGCT GGCCCTCAGC GCGGCGGGGT TCGCGGTCGT GGGCCTGGTG GACGCCGACA CCCGCCTGGC GGTGTTCCTC GGCGGCTACA CCCTGCTGAC CCTCGGGGCC GGCGTCGTCG GAACCCTGGC CAACACCCTG GTCCTGGCCA CGGCCCCCCG GGAGCGCGCC GGGGCCGCCG CGGGGATCTC CGAGACCAGC ACCGAGTTCG GCACCGCCCT GGGCATCGCG GTCCTGGGCA CCGCCGCAGG CGCCGTCTAC CGCACCTCCG TGGCGGACGC GCTGCCCTCG GTGGACGGGG CCGCGGCCGA GACCGTCACC GGAGCCCTGG CCGCCGCCCC CCGAGCACAG GACCCCGGGG CCCTGCTCGA CGCGGCCTTC GACGCCTACA CGGCCGGGGT CAACACCGCC GCCCTCACCG GCGCGGGCGT GCTGGCCGCG GTCGCGCTCC TGGTCGCCGT CGCGCTGCGG AGGCTGCCCC CCGCGACCGG CGGGGAGCCC GGCGCACCGG CCCCGGCCGG GGGAGTCCCG GCCCGTCCGT GA
|
Protein sequence | MNPRRATART WAGLALLLIP ALLVSMDISV LFVAAPAITE ALRPTSAQWL WMMDVYGFVL AGLLVTMGSL GDRIGRRRLL LTGGVLFGAA SVLLALAPSP ELFIAGRALL GVAGATLAPS TLSLVRDMFT DPRQRGAAVG AWTVAFTGGA VAGPILGGLL LEFFWWGSAF LVNLPFMVVL VAAAPLLVPE SRDPEASGFD LPGAGLSLAA VLGLVYGAKR LAEHGADPHA LTALAAGAAL LALFVLRQRR AAHPLMDLSL LARPAFTAAI IGNLALSFAV GGMGLLTFTF LQTVHGLSPL HAALWALPTI LGTVLGAVLA GSLAPRARPG VLMAAGLALS AAGFAVVGLV DADTRLAVFL GGYTLLTLGA GVVGTLANTL VLATAPRERA GAAAGISETS TEFGTALGIA VLGTAAGAVY RTSVADALPS VDGAAAETVT GALAAAPRAQ DPGALLDAAF DAYTAGVNTA ALTGAGVLAA VALLVAVALR RLPPATGGEP GAPAPAGGVP ARP
|
| |