Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2261 |
Symbol | |
ID | 9246111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2704074 |
End bp | 2705309 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003680189 |
Protein GI | 297561215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.484467 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCCG CCTTCTGGAA CCTGTGGACC TCCTCCGCCC TGTCCAACCT CGCCGATGGC GTCCTGAAGA CGGCGCTGCC GCTGGTGGCG CTGCGCTTCA CCGACTCCCC CGTCCTCATC GCCGGGGTGA CGTTCGCGCT GACCCTCCCG TGGCTGTTCT TCGCGCTCCC GGCGGGCGCC CTGGCCGACC GGCTCGACCG GCGCCGGACG ATGCTCGGCG CCAACCTCGC CCGGGCGCTG CTGCTCGGCG TCCTGGCGCT GTCCCTGGCC CTGGACCTGG GCTCGGTCGG GCTGCTGTAC GCGGTGGCGC TGTGCGTCGG GGTCACCGAG ACCCTCTACG ACACCTCGGC CCAGTCGATC CTGCCGCAGG TGGTCGGCCG CGACCGGCTC CCCTGGGCCA ACGGGCGGCT GCACGCCGCC GAACTGACCG CGAACCAGTT CCTCGGGCCT CCCCTGGGCG GCCTGCTGGT GGCGGCGGGC GCGGCGGCGG CCTTCACCGC CCCGGCGGCG CTGTGGCTGG TCGCGGTGGG CGCGCTGCTG CTGGTGCGCG GCCGGTTCCG GACCGCGCGG GCCGCGCCCG CCACCCTGCG CGCCGACATC GCCGAGGGGC TGCGCTTCCT GTGGCGCGAC CGGATCCTGC GCTCGTTCGC GGCGATGGTG GGCGCCAGCA ACTTCGCCAG CAACGCGGCC TTCACCGTCT TCGTGCTCTT CGCGGTGGGT CCGGACTCGC CCATGGGCCT GTCCGAGCCC GCCTACGGCC TGCTCATGAC CGCCGTCGCC GCGGGCAGCG TGGTGGGCGC CCTGTGCGCC GGGCGGATCG AGCGACTCTT GGGACGCACC CGGGCGCTGC GGACCTGCGC GCTGACCTTC GCCGTGCTCG TGGGCCTGCC CGCGGTGACC GCCGACCCCG TCCTGGTCGC GGCGGGCTTC TTCGCGGGCG GGGTGGGGAT CGCGGTGTGG AACGTGGTCA CGGTGTCGCT GCGCCAGCGG ATCACCCCCG ACCCCCTGCT CGGCCGGGTG AACAGCGCCT ACCGTCTGCT GGCATGGGGC ACCATGCCGC TGGGCGCCGC GACGGGCGGC CTCATCGCGG AGTTCCTGGG ACTGACCTGG GTGTTCGCCT CCATGGGGCT GCTGTGCCTG GGGCTGCTCG CCGGACTGGC CCGGTTGGAC GACGCGGCCC TGACCGCGGC GGAGCACCGG GCCGACGAGA GCCGGCAAGC CCCCGGGGAC CGGTGA
|
Protein sequence | MGAAFWNLWT SSALSNLADG VLKTALPLVA LRFTDSPVLI AGVTFALTLP WLFFALPAGA LADRLDRRRT MLGANLARAL LLGVLALSLA LDLGSVGLLY AVALCVGVTE TLYDTSAQSI LPQVVGRDRL PWANGRLHAA ELTANQFLGP PLGGLLVAAG AAAAFTAPAA LWLVAVGALL LVRGRFRTAR AAPATLRADI AEGLRFLWRD RILRSFAAMV GASNFASNAA FTVFVLFAVG PDSPMGLSEP AYGLLMTAVA AGSVVGALCA GRIERLLGRT RALRTCALTF AVLVGLPAVT ADPVLVAAGF FAGGVGIAVW NVVTVSLRQR ITPDPLLGRV NSAYRLLAWG TMPLGAATGG LIAEFLGLTW VFASMGLLCL GLLAGLARLD DAALTAAEHR ADESRQAPGD R
|
| |