Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4342 |
Symbol | |
ID | 9248217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5176640 |
End bp | 5177662 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | folate-binding protein YgfZ |
Protein accession | YP_003682237 |
Protein GI | 297563263 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.964763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.943713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCGC CGCTGCTGAG CACACCGGGC GCCGTGAGCG CCGAGTCCCC CGACACCGGA GTCGCCGCGC ACTACGGCGA CCCCGCGCAC GAGGGACGCG CCGCCGAACG TTCGAGCGCG TGGGTCGACC GGAGCAACCG GGGCGTGGTG CGGGTGACCG GCCCCGACCG CCTGGGCTGG CTCAACGACC TCACCAGCCA GCTGACCCGG GGCCTGGCCC CGGGCACCGC CACCGAGGCG CTGGTCCTGG ACACCAAGGG ACACCTGCGG CACCACCTGT CGCTGGTGGA CGACGGCGAG GCCACCTGGA TCCACACCGA GCCGGGGGAC GGCCCGGAGC TGGCGGGGTT CCTCGACTCG ATGCGGTTCA TGCTGCGTGT CGAGGTGGAG GACCTGAGCG GTTCCCACGC GGTGCTGAGC GTGCTCGGTC CGGACCGCGC CAAGGCCGTG GAGGCCGCGT CGCTCGGTGA CGTGACCGCG CGCGCGGTCG AGGGCGAGAC CGACCTGTTC GTGCCCGCCG AACGGCTCGT CGGGGCCGCG GAGGCGCTCA CGGCGGCCGG GGCGCGCCCG GCGGGCATGT GGGCCTACGA GGCGAACCGG ATCGCGGAGC ACCGGGTGCG CGCGGGTCTG GACACCGACG ACCGCACCAT CCCGCACGAG GTGGACTGGG TGGGCCGCGC GGTGCACCTG GAGAAGGGCT GCTACCCGGG CCAGGAGACG GTGGCGCGGG TGCACAACCT GGGCCGTCCG CCGCGCCGTC TGGTCATGCT GCACCTGGAC GGCACCGCCG AGCGCCTGCC GCAGGTGGGG GCCGCCATCG AGCTGGACGG GCGCTCCGTG GGCCGGGTGG GCACGTCGGC CCGCCACCAC GAGCTGGGGC CGATCGCCCT GGGCGTGGTC AAGCGCTCGG CCCCCACCGA CGCCGACCTG GTGGTGGACG GCATCGCCGC CGGTCAGGAG GTCGTGGTCG ACCCGGACAC GGGCGCCAAC GCCAAGATCG AGCTGCGCCG CCGTCCGCGT TAA
|
Protein sequence | MTSPLLSTPG AVSAESPDTG VAAHYGDPAH EGRAAERSSA WVDRSNRGVV RVTGPDRLGW LNDLTSQLTR GLAPGTATEA LVLDTKGHLR HHLSLVDDGE ATWIHTEPGD GPELAGFLDS MRFMLRVEVE DLSGSHAVLS VLGPDRAKAV EAASLGDVTA RAVEGETDLF VPAERLVGAA EALTAAGARP AGMWAYEANR IAEHRVRAGL DTDDRTIPHE VDWVGRAVHL EKGCYPGQET VARVHNLGRP PRRLVMLHLD GTAERLPQVG AAIELDGRSV GRVGTSARHH ELGPIALGVV KRSAPTDADL VVDGIAAGQE VVVDPDTGAN AKIELRRRPR
|
| |