Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2624 |
Symbol | |
ID | 9246475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3130122 |
End bp | 3131150 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | 4-hydroxythreonine-4-phosphate dehydrogenase |
Protein accession | YP_003680547 |
Protein GI | 297561573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.159271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00648281 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCGTC CCACACTGGC AGTCACCCTG GGCGACGTGG CGGGGATCGG CCCCGAGATC ACGGCCAAGG CGCTGCTGCA CCACCCCGAG GTCCGCGAGT ACGCCAAGCC GGTCGTGGTC GGTGACGCCG ACGCGCTGCG CAACGCCGTC GCCGCCGTCG GCGGGGACCC GGAGGCGGTC AACACCGTCG CCTCCCCGGC CGAGGCCGCC GACGAGCCCG GCGTGATCGA CGTCGTGCAG ACCGGCCCCT CGCTGGGCCA CGTCCCCCCG GGCGAGCTGA GCGCGGAGGC GGGCGACGGG GCGGCCCGGT TCGTCATCGC CGCCGTGGAC CTGGCCAAGC GCGGCCTGGT GGAGGGCATC GTCACCCCGC CGCTGAACAA GGCCGCGATG CACCTGGGCG GCCACGCCTG GCCCGGGCAC ACCGAGCTGC TCGCGCACGA GTTCGGGGTG AAGGACTACA GCCTGGTGCT GTCGGCGGAC GAGCTGTCCT TCTTCCACCT GACCACGCAC GTGTCGCTGC GCCAGGCCAT CGAGGGCGTC ACCCAGGAGC GCACCCTCCA GGTGCTGCGC CTGATGAGCG CCTTCGCCCG CGCCCAGGGC AGCCCGGACG AGCCCATCGG GGTGGCGGGC CTGAACCCGC ACGCGGGCGA GAACCGCCTG TTCGGCGACG AGGACGCCGA CGTCCTGGCG CCCGCGATCG CCCGCGCCCG CGAGGAGGGC ATCAACGCCC ACGGCCCGCT CCCGGCCGAC GCCCTGATCC CGGCGGCGGT CAAGGGCAAG TGGAAGCTGG TCGCGGTCTG CTACCACGAC CAGGGGCACG CGCCCTTCAA GGCGGTCTAC GGGGACGACG GGGTCAACAT CACCGCGGGC CTGCCGGTGG TGCGCGTCTC GGTCGACCAC GGCACGGCCT TCGACATCGC GGGCCGGGGC ATCGCCCGCG AGGCCAGCCT CGTGCTGGCG ATCCGCCGCG CGGCCGAGCT GGCCCCCGGC TGGGGCCACG TCTGGCAGGC CACCCGCTCC GAGGGGTAG
|
Protein sequence | MSRPTLAVTL GDVAGIGPEI TAKALLHHPE VREYAKPVVV GDADALRNAV AAVGGDPEAV NTVASPAEAA DEPGVIDVVQ TGPSLGHVPP GELSAEAGDG AARFVIAAVD LAKRGLVEGI VTPPLNKAAM HLGGHAWPGH TELLAHEFGV KDYSLVLSAD ELSFFHLTTH VSLRQAIEGV TQERTLQVLR LMSAFARAQG SPDEPIGVAG LNPHAGENRL FGDEDADVLA PAIARAREEG INAHGPLPAD ALIPAAVKGK WKLVAVCYHD QGHAPFKAVY GDDGVNITAG LPVVRVSVDH GTAFDIAGRG IAREASLVLA IRRAAELAPG WGHVWQATRS EG
|
| |