Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2172 |
Symbol | |
ID | 9246022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2593995 |
End bp | 2595431 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003680100 |
Protein GI | 297561126 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.647615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000450319 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCGCA GCGCCACGGT CTACTTCGCC TCGTACGGGC TGTCGTTGTT GGGCAAGGGC ATCGCCGGCG TGGTGATGCC GCTGCTCGTC CTGGAACGCA CCGGCGACGT GCTGGCGGCG GGCGTGCTCG CGACCGCGTC CACGGCGGCC TCCGCCCTGG CGGGGCTCGT CTGCGGTCTG ATCGTGGACC GGATCGACCG CCGCTGGGTC TCGGTGGTCT CCGACCTGCT CGCCGCCCTG TCGGTGGCGG CGCTGCCGGT CGTCGACGCC CTGTGGGGCC TGAACATGGC CTGGTTCCTG GCCCTGGCCG TCGTGGGGGC GGTGATCCGC GTGCCCGGGA TGACGGCTCA GGAGACGCTG CTGCCCGCCC TGGCCCGACT CGGAGCCGAC CGGCCGGGTG GTCGGGGTGG TCGGGGTGGC GCCGCCGGTC CGGGGAGGCT GGACCGGCTG ATCGCCACCC GCGAGACCAT GGGCAGCGTC CTGCTGCTGG CCGGTCCCGG CCTGGGCGGG CTGCTCGTGG GGCTGCTCGG GCTGTCCTCG GCGCTGCTGC TGGTGACCGC CGTCACGAGC CTGCTCGCCG CGGCCACGAC CCTCCTGCTG GACCCGCGGA CCGGTCGGAT CGACCGTCGG CGCCCCGATC CGGCCGACAC CGCTGAGGCC TCCGGCGCGG CCGCACCTGG CGGCTCGGTG CGCCGGGCGG TGACCGACCT GCTGGAGGGA TGGCGGTTCC TGGGCCGCAC CCGGCTCCTG CTCGGCACCA CTGTGATCGG CGCGGTGCTC GTCGCCGTCC TGAGTTCGCT GCAGACCACC CTCATGCCCG CCTACTTCAC CGGTCAGGGG CTTCCCGGGC TCACCGGGCT GACGCTGAGC GCGCTCGCGG CGGGCAGCAT CGCCGGGTCC GCGCTCTACG CCGCCACCGC CGGGCGGGTC CGCCGCCGCA GTTGGTTCGT CCTCGGCATG AGCGGCACCC TGGTCGGGTT CGGCGCGGTC GGCACCATGG CCTCCCCGTG GCTGGTGCTG GGCGCCGCGG CCCTGGTGGG GATGACCTTC GCACCGGCCT CGGCGGTGCT CGGCGTCCTG ACCGTCGAGG CCACCCCGGA CCGCATGCGC GGCCGGGTCC TGGGGGCGCA GAACACGGTC ATGCTCGCCG CGCCCGCCCT CACCAGCGCC CCGCTCGCGG CCGTGGCCGC CGCGGCCGGG CTGCCCGTCG CCGGAGCCGT CCTCGCCGTG CTGGCGGGCG TCACGGCGGT CGCCGCCCTG GTCGCCCCGG TGTTCCGCTC GCTGGACGAC CCGCGGGGGC GGGAGCCGGT CGGCCCCGAA GGGCGTCCGC GGCCCCGGGA ACGGACGTCC CCCGAACTCG CGCTCGCGGA CACCGTCCCC CACGCGCCGG ACGTCCGCGC GGTGGGGGAA CGCGGGGAAC GCACCGCCGA CCGGTGA
|
Protein sequence | MRRSATVYFA SYGLSLLGKG IAGVVMPLLV LERTGDVLAA GVLATASTAA SALAGLVCGL IVDRIDRRWV SVVSDLLAAL SVAALPVVDA LWGLNMAWFL ALAVVGAVIR VPGMTAQETL LPALARLGAD RPGGRGGRGG AAGPGRLDRL IATRETMGSV LLLAGPGLGG LLVGLLGLSS ALLLVTAVTS LLAAATTLLL DPRTGRIDRR RPDPADTAEA SGAAAPGGSV RRAVTDLLEG WRFLGRTRLL LGTTVIGAVL VAVLSSLQTT LMPAYFTGQG LPGLTGLTLS ALAAGSIAGS ALYAATAGRV RRRSWFVLGM SGTLVGFGAV GTMASPWLVL GAAALVGMTF APASAVLGVL TVEATPDRMR GRVLGAQNTV MLAAPALTSA PLAAVAAAAG LPVAGAVLAV LAGVTAVAAL VAPVFRSLDD PRGREPVGPE GRPRPRERTS PELALADTVP HAPDVRAVGE RGERTADR
|
| |