Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5168 |
Symbol | |
ID | 9249061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 310241 |
End bp | 311815 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | NCS1 nucleoside transporter family |
Protein accession | YP_003683054 |
Protein GI | 297564081 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.409771 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCACA GCCCGACCCC GGCGGAACCC GCCGCCGTCA CCCACCCCGA CGGCCGGGTG TCCCTCGCCG AGGGGACCTC CCTGCCCGAG GGCCCCTACG TCAACGCCGA CCTCCAACCC GTCCCGATGT CCCGGCGCAC CTGGGGCACC GGCAGCTTCG GCGCCCTGTG GGTGAGCATG TCGGTCAGCA TCCCGGCCTG GACCCTGGCC AGCGGCCTGA TCGCGGCGGG CATGGACTGG CGCCAGGCCA TGCTCGCCGT GGTCCTGGGC AACCTCGTCG TCCTGCTGCC CATGGTGCTG ACCGGCCACG CGGGCGCCAG GTACGGCATC CCCTTCCCCG TGTTCGCGCG CGCCTCCTTC GGCCTGCGCG GCGCCAACCT GCCCGCGCTG CTGCGCGGCG CGGTCGCCTG CGGCTGGTAC GGCATCCAGA CCTGGGTCGG CGGCCAGGGG GTGTACATCC TGCTCGGGCG GCTGCTCGGC GACGGGTGGA CGGGGTCGGC GGCGCTGGGC GGCCAGCCGT GGACGCTGTG GCTGTCCTTC GGCCTGTTCT GGGTCGCCCA GCTCGCGATC ATCCTGTGGG GCATGGAGGG CGTGCGCCGG ACCCAGGTGT GGGCCGCGCC GCTGATGATC CTCGGCGGCG TCGCGCTGCT GGCCTGGATG GCCGTGGAGG CCGGGGGCCT GGCCCCGATG CTGTCCCTGG ACTCGGGCGA GCCCCTGGAC TGGGGGCCCT CCTTCTGGGC GCTGTTCTTC CCGTCGCTGA TGGGCGTGAT CGGCTACTGG GCGACCCTGA CCCTCAACAT CAGCGACTTC ACCCGCTTCT CGTCCTCCCA GCGCGCCCAG GTGGTGGGCC AGACGCTGGG CCTGCCCACC ACGATGACGC TGTTCTCGCT GCTGGCCGTC ATGGTCACGG CGGGCACCGC GGCCGTCTAC GGCGAACCCC TGTGGAACCC GATCGACGTC GTGGCGCGGA TGGACAGCGG GATCGGCCTG CTCTTCGCGG TCTTCGTCGT GCTGCTGGCC ACGGTCTCCA CCAACATCGC GGCCAACCTG GTCGGTCCGG CCTACGACCT GTCCAACCTC AGGCCCCGGC TGATCAGCTT CCGCGCCGGG GCGATCACCA CCTGCGTGCT CAGCGTGCTG ATCATGCCGT GGCGGCTGCT GGAGAACGAG AGCGTCTACA TCTTCACCTG GCTGGGCACG GTGGGCGGCC TCCTGGGCAC CGTGGCGGGC GTCCTGCTCG CCGACTACTG GCTGCTGCGC CGCACGCGGA TGAACCTGCC CGCGCTGTAC GAGCGCGGCT CGGAGTACTG GTACCGGCAC GGGTGGAACT GGCGCGCCCT GGTGGCCTTC GGCGTCGGCT CGGTGCTGGC CGTGGGCGGT TCGCACTCCC CCGAGGGGTC GGGCCCCTTC CCGGCCGAGG GTCTGGTCCC GTTCCTGGCG CCGCTGGCGG ACTACGGGTG GCTCGTGGGG CTGGCCAGCG GCCTGCTGCT GCACTGGGGC CTGGGCGTGC TCCTGCCCCA CCGGGACGCG GGGGAGCGGG CCGCGAGGAG GACGGAGGGG GCCGCCGCCG GCTGA
|
Protein sequence | MTHSPTPAEP AAVTHPDGRV SLAEGTSLPE GPYVNADLQP VPMSRRTWGT GSFGALWVSM SVSIPAWTLA SGLIAAGMDW RQAMLAVVLG NLVVLLPMVL TGHAGARYGI PFPVFARASF GLRGANLPAL LRGAVACGWY GIQTWVGGQG VYILLGRLLG DGWTGSAALG GQPWTLWLSF GLFWVAQLAI ILWGMEGVRR TQVWAAPLMI LGGVALLAWM AVEAGGLAPM LSLDSGEPLD WGPSFWALFF PSLMGVIGYW ATLTLNISDF TRFSSSQRAQ VVGQTLGLPT TMTLFSLLAV MVTAGTAAVY GEPLWNPIDV VARMDSGIGL LFAVFVVLLA TVSTNIAANL VGPAYDLSNL RPRLISFRAG AITTCVLSVL IMPWRLLENE SVYIFTWLGT VGGLLGTVAG VLLADYWLLR RTRMNLPALY ERGSEYWYRH GWNWRALVAF GVGSVLAVGG SHSPEGSGPF PAEGLVPFLA PLADYGWLVG LASGLLLHWG LGVLLPHRDA GERAARRTEG AAAG
|
| |