Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3024 |
Symbol | |
ID | 9246877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3612528 |
End bp | 3613742 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Thiamin pyrophosphokinase catalytic region |
Protein accession | YP_003680940 |
Protein GI | 297561966 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.468462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.163552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCGA TGAAGGTATC GGATGCGCTC AGCGCCACAG TCATGCGGTT CCGCCGCACC CGGGCGGACG CTCCCACAGG GCTGATCGCG CCCGTCCGCT CGGACCGGCG CACCAAGAAC CTGACCAAAC GTCTGCGACC GGGCGACATC GCGGTCATCG ACCACGTCGA CCTGGACCGC GTCAGTGCCG AGGCCCTGGT GGGCTGCGGT GTGTCCGCGG TCCTCAACGT CGCCACGAGC ATCAGCGGCC GCTATCCCAA CCAGGGGCCG GAGATGATCG TGGACGCGGG GATCCCCCTG GTCGACGACG TCGACCCCGA GGTGTTCACC CGCGTCCGCG ACGGCGAGCG GCTCCGCCTG GAGGGCGCCA CCCTGTACCG CGGCGACGAG GTCGTCGCGC GCGGCACCCT CCAGGACCTG GGGACCGTCA GCGCGGCGAT GGAGGAGGCC CGCGCGGGGC TGGCCACACA GTTGGAGGCC TTCGCGGCCA ACACCATGGC CTACCTCCAG CACGAGCGCG ACCTGCTCCT GGACGGCATC GGCATCCCCA CCATCGAGAC CGACATCGAG GGCCGCCACG TCCTCATCGT GGTCCGCGGC TACCACTACC GGGAGGACAT CGCGGCGCTG CGGCCCTACA TCCGCGAGTT CCGGCCGGTC ATCATCGCCG TGGACGGCGG GGCGGACGCG GTCATGGAGG CGGGGTACAA GCCTGACATC ATCGTCGGCG ACTTCGACTC GGTCTCGGAC CGGGCGCTGA CCAGCGGGGC CGAGCTGGTC GTGCACGCCT ACCGCGACGG CAGGGCCCCG GGCCTGCCCA GGCTCACCGA CCTGGGGCAC GGCGCGGTGG TCTTCCCCGC CACGGGCACC AGCGAGGACG TGGCGATGAT GCTCGCCGAC GGCGCGGGCG CCGCGCTCAT CGTGGCCGTG GGCACCCACG CCACCCTGGA GGAGTTCCTC GACAAGGGCC GGGCGGGGAT GGCCAGCACC TTCCTGACCC GGCTGCGGGT GGGCGGCAAG CTCGTCGACG CCAAGGGCGT CAGCCGCCTG TACCGCTCGC GCATCTCCCC CTGGGCCCTG CTCGGGCTGG TCGCGGCCTG CCTGCTCACC ATCGTGGTCG CCGCCTACAG CTCACCGGCG GGCCAGGTCT ACCTCACCTT TCTCGCGGCG CGCTGGGACG CGTTCTCCTA CTGGTTGACA GGGCTGTTGA CGTGA
|
Protein sequence | MPAMKVSDAL SATVMRFRRT RADAPTGLIA PVRSDRRTKN LTKRLRPGDI AVIDHVDLDR VSAEALVGCG VSAVLNVATS ISGRYPNQGP EMIVDAGIPL VDDVDPEVFT RVRDGERLRL EGATLYRGDE VVARGTLQDL GTVSAAMEEA RAGLATQLEA FAANTMAYLQ HERDLLLDGI GIPTIETDIE GRHVLIVVRG YHYREDIAAL RPYIREFRPV IIAVDGGADA VMEAGYKPDI IVGDFDSVSD RALTSGAELV VHAYRDGRAP GLPRLTDLGH GAVVFPATGT SEDVAMMLAD GAGAALIVAV GTHATLEEFL DKGRAGMAST FLTRLRVGGK LVDAKGVSRL YRSRISPWAL LGLVAACLLT IVVAAYSSPA GQVYLTFLAA RWDAFSYWLT GLLT
|
| |