Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2798 |
Symbol | |
ID | 9246649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3341392 |
End bp | 3342705 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | metabolite/H+ symporter, major facilitator superfamily (MFS) |
Protein accession | YP_003680716 |
Protein GI | 297561742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.606232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0553764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACAGA AAGCCTCCTT CGGGCGGGTG GTCACCGCCA GCCTCATCGG CACGACGATC GAGTGGTACG ACTTCTTCCT CTACGGGTCG GCCGCCGCGC TCGTGTTCAA CCACGTCTTC TTCCCCGAGT CCGACCCGCT GGTGGGCACC ATGCTGGCGT TCACCACCTA CGCGGTGGGG TTCGTCGCGC GTCCGCTGGG CGGCCTGGTC TTCGGTCACT TCGGCGACAG GATCGGCCGC AAACAGCTGC TGGTCATCAG CCTGCTGCTG ATGGGCGGAT CGACGTTCGC GATCGGCCTG CTGCCGACCT ACGCGGTGAT CGGTGTGGCC GCTCCGCTGC TGCTGACGCT GCTGCGGGTG GTCCAGGGCT TCGCCCTGGG CGGCGAGTGG GGCGGCGCGG TGCTGCTCGT CGCCGAGCAC GGCGAGCCGC GGCACCGGGG GTTCTGGGCG TCCTGGCCGC AGGCGGGCGC TCCGGGCGGC AACCTGCTGG CCACCGCCGT GCTGGCGGTG CTCGCCGTGG TGATGAGCGA CGCGGCCTTC CTCTCCTGGG GCTGGCGCGT GCCGTTCCTG CTCTCGGGCG TGCTGGTGCT CATCGGCCTG TGGGTGCGCC TGGCGGTCAG CGAGTCGCGG GTCTTCCGCG ACGCGCACGA GCGGGCGGCG GCCTCGGCCC GGCCCGAGCG CGCCCCGATC CTGGGCGTGC TGCGCGACCA CTGGCGCGAG GCCCTGGTGG CCATGGGCGT GCGCATGGCC GAGAACGTGT CGTACTACGT CGTCACCGCG TTCATCCTGG TCTACGCGAC GCAGGAGGCG GGGATGCCCA ACGGCCAGGT GCTCAACGCG GTGCTCGTCG CCTCGGCGGT GCACCTGGTG ACCATCCCCG CCTGGGGAGC CCTCTCGGAC CGGATCGGCC GCAGGCCGGT GACCGCGCTG GGGGCCGCGG GGGCGGGGCT GTGGGTGTTC GCCTTCTTCC CGCTCGTGGA CGCGGGGACC TTCTGGTCGG TGACCCTGGC CGTGACGGTC GGCCTGGTCC TGCACGGGGC CATGTACGGC CCGCAGGCGG CCTTCTTCTC CGAGCTGTTC AGCACCCGCG TGCGCTACTC GGGAGCGTCG GTGGGCTACC AGCTGGCGTC GATCGTGGCG GGCGGGCTGG CCCCGCTGGT CGCGACGGCC CTGCTGGCCT CGTTCGGCAG CAGCGTGCCG GTCTCGCTGT ACGTGGCCGC CATGGCCGCG GTCACCCTGG TCGCCGTCGC GGCCGCCCGC GAGACCAGGG GCCGCGACCT GCGGGAGGTG GCAGTGTCCG AGAGAAGTGG GTGA
|
Protein sequence | MRQKASFGRV VTASLIGTTI EWYDFFLYGS AAALVFNHVF FPESDPLVGT MLAFTTYAVG FVARPLGGLV FGHFGDRIGR KQLLVISLLL MGGSTFAIGL LPTYAVIGVA APLLLTLLRV VQGFALGGEW GGAVLLVAEH GEPRHRGFWA SWPQAGAPGG NLLATAVLAV LAVVMSDAAF LSWGWRVPFL LSGVLVLIGL WVRLAVSESR VFRDAHERAA ASARPERAPI LGVLRDHWRE ALVAMGVRMA ENVSYYVVTA FILVYATQEA GMPNGQVLNA VLVASAVHLV TIPAWGALSD RIGRRPVTAL GAAGAGLWVF AFFPLVDAGT FWSVTLAVTV GLVLHGAMYG PQAAFFSELF STRVRYSGAS VGYQLASIVA GGLAPLVATA LLASFGSSVP VSLYVAAMAA VTLVAVAAAR ETRGRDLREV AVSERSG
|
| |