Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4004 |
Symbol | |
ID | 9247876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4787657 |
End bp | 4788637 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | 4-diphosphocytidyl-2C-methyl-D-erythritolkinase |
Protein accession | YP_003681907 |
Protein GI | 297562933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.144756 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTTA ATACCACCGT AACCGTGCGG GTCCCCGCGA AGGTGAACCT CCAGTTGGCG GTGGGGCCCA GACGTGAGGA CGGGTTCCAC GACCTCGTCA ACGTGTTCCA CGCCGTCTCG CTGTTCGATG AGGTTACCGT CCAGGCGGCT TCCTCGCGCC GACCGAGCCC GCCCGGCGGT GTCGGATCGG CCGCCGAAGC CTTCGCCGGA CTGACCGCGG GCGGCGCGCT GGGGTCCCAC CTGTCCCGGG TGCCCCTGGA CGGGTCCAAC CTGGCCGCGC GCGCGGTGAC CCTGCTGGCG GAGCGGACCG GCCGGGGCCT CCCCGTCACG GTCCACCTGG ACAAGAACAT CCCGGTGGCG GGCGGCATGG CCGGGGGCAG CGCGGACGCC GCGGCCGCCC TGCTGGCCTG CGACCGTCTG TGGGGCTGCG GCCTGCCGCT GGAGGAACTG CTCGGGTACG CGGCCGACCT GGGCAGCGAC GTCCCCTTCG CGCTGCTGGG CGCCACCGCG GTGGGCATGG GGCGCGGGGA GGTCCTGCGG CCGATGACCA GTCCGGGAAG GTTCCGCTGG GTCTTCGCCC TGTCGCCGCA CGGCCTGTCC ACGGCCAGCG TGTTCGCCGA GTACGACCGC ATCCGCCCGG ACGCCCCCGA GCCCCTGCTG GACCGGGGGC TGGCCGACGC GCTCGCCGGC GGCGACCCGG CGGCCCTGGG AGCCGCGCTG ACCAACGATC TCCAGGAGGC CGCGCTCTCG CTGCTGCCCG AGCTGGAGCG CACCCTCAAG ACCGGCGCCG GTGCGGGGGC GCTGGGCTCC CTGGTGTCGG GGTCCGGTCC GACCTGCGCC TTCCTGGTCG GCGGCGGCGC CGAGCGGGAG TCGGTCGTCG CGGAGCGGGA GGCCGCGGTG GTGGCGGCGC TGGAGGGCAG CGGGCTGTGC GAACAGGTCG TGACGGCGTA CGGGGACGTC CCCGGGGCCG CCGTCCTGTG A
|
Protein sequence | MTVNTTVTVR VPAKVNLQLA VGPRREDGFH DLVNVFHAVS LFDEVTVQAA SSRRPSPPGG VGSAAEAFAG LTAGGALGSH LSRVPLDGSN LAARAVTLLA ERTGRGLPVT VHLDKNIPVA GGMAGGSADA AAALLACDRL WGCGLPLEEL LGYAADLGSD VPFALLGATA VGMGRGEVLR PMTSPGRFRW VFALSPHGLS TASVFAEYDR IRPDAPEPLL DRGLADALAG GDPAALGAAL TNDLQEAALS LLPELERTLK TGAGAGALGS LVSGSGPTCA FLVGGGAERE SVVAEREAAV VAALEGSGLC EQVVTAYGDV PGAAVL
|
| |