Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0723 |
Symbol | |
ID | 9244565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 886050 |
End bp | 887765 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_003678674 |
Protein GI | 297559700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACAGA TGCCCGCGAT GAACGCGGTC GTGGAGGTCC TCAAGGACGA GGGCGTCGAC ACGGCCTTCG GCTGCCCCGG CGCGGCCATC CTGCCCCTCT ACAAGGCGAT GGAACAGGTC GGCGGCATCG AGCACCTGAC CGTCCGCCAC GAGGAGGGCG CCACCCACAT GGCCGACGGC TGGTCCCGGA CCACCGGCAA GGTCGGCGTG GCCATCGGCA CCTCCGGCCC CGCCGGAACC AACATGATCA CCGGCCTGTA CACGGCCATC GCGGACTCGG TCCCGATCGT GTGCATCACC GGCCAGCAGC GCACCGACCT GCTGGACAAG GAGGGCTTCC AGGCGGTCGA CATCGTCGAG ATCGCCAAGC CCGTGACCAA GTGGGCCGTC CAGATCAAGG AGGCCGCCAC CGCGCCCTGG ATCTTCCGCG AGGCGTTCCG GATCGCCCGG GAGGGACGCC CCGGCCCCGT CCTGGTGGAC ATCCCCGTGG ACGTCGCCCA GCAGCTGATC GACTACGACC CCGCCATCGA CGCCCCGCTC AAGGTCAACG CCGTCGAGCC GCACCAGCCC CGGGTGGAGC GCGCGCTGGA CATGCTGCTG GAGGCCGAGC GCCCCCTGAT CCTGGCCGGA GGCGGCGTGA TCACCGCCGA GGCCTCCGAC GACCTGCGCG CGCTGGCCGA GCACCTCCAG GTCCCGGTGC AGGTCACCCT CATGGGCAAG GGCTCCTTCG ACGAGGACTC CCCGCTGTAC TCGGGCATGA CCGGCGTGCA GACCTCCCAG CGCTACGGCA ACGCCTCCTT CCTGGAGTCG GACCTGGTCC TGGCCGTGGG CGCGCGCTTC GCCGACCGCC ACACCGGGCA GATCGACGTC TACCGGGGCG AGCGCAGATT CATCCACGTC GACATCGAGG CCACCCAGAT CGGCCGGGTC TTCGAACCCG ACCTGGGCGT GGTCTCCGAC GCGCGGCTGT TCCTGCGCGA GCTGCTCGCC GCCGCGCGCG CCCGCGGCGC CAAGGCCGAG GTGCGGCCCT GGATCCACCG CGTCGCCGAG CTCAAGGCCA CCCTGACCCG CCGCGAGGAC TTCGACACGG TCCCGGTCAA GGCGCCGCGC GTCTACAAGG AGATCAACGA GGTCTTCGGC GAGGACACCT ACTTCGTCAC CGCGATCGGC CTGTACCAGA TCTGGGGAGG CCAGCATCAG AAGGCGTACA AGCCGCGCCA CTACCAGATC TGCGGCCAGG CGGGCCCGCT CGGCTGGGAG ATCCCCGCCG CCATCGGCGT CAAGAAGGCG CTCAAGCACA CCGAGCCGGA CGCGGAGGTC GTCGGGATCG TCGGCGACTA CGGGTTCCAG TACATGGTCG AGGAACTGGC CGTGGCCGCC CAGTACGACG TGCCCTACGT CATCATCATG CTCAACAACG AGTACCTGGG CCTGATCCGC CAGGCCTCGA TCCCGTTCGA CATGAACTAC CAGGTGGACA TCCACTACGA CGAGTACGGC ACCGACAACG TCAAGCTCAT GGAGGCCTAC GGCTGCTCCG GGCGCCGCGT CGTGGAGCCC GGGGAGATCC GCGAGTCCCT GGAGTGGGCC CGCAAGCAGG CCCAGGCCAC CTCGCGGCCG GTACTCGTGG AGATCATGAT CGAGCGCGAG GCCAACACGC CGCACGGGCC CGCGATCGAC GCGGTCCGCG AGTTCGAGCC GGTCCCGGGG GCCTGA
|
Protein sequence | MPQMPAMNAV VEVLKDEGVD TAFGCPGAAI LPLYKAMEQV GGIEHLTVRH EEGATHMADG WSRTTGKVGV AIGTSGPAGT NMITGLYTAI ADSVPIVCIT GQQRTDLLDK EGFQAVDIVE IAKPVTKWAV QIKEAATAPW IFREAFRIAR EGRPGPVLVD IPVDVAQQLI DYDPAIDAPL KVNAVEPHQP RVERALDMLL EAERPLILAG GGVITAEASD DLRALAEHLQ VPVQVTLMGK GSFDEDSPLY SGMTGVQTSQ RYGNASFLES DLVLAVGARF ADRHTGQIDV YRGERRFIHV DIEATQIGRV FEPDLGVVSD ARLFLRELLA AARARGAKAE VRPWIHRVAE LKATLTRRED FDTVPVKAPR VYKEINEVFG EDTYFVTAIG LYQIWGGQHQ KAYKPRHYQI CGQAGPLGWE IPAAIGVKKA LKHTEPDAEV VGIVGDYGFQ YMVEELAVAA QYDVPYVIIM LNNEYLGLIR QASIPFDMNY QVDIHYDEYG TDNVKLMEAY GCSGRRVVEP GEIRESLEWA RKQAQATSRP VLVEIMIERE ANTPHGPAID AVREFEPVPG A
|
| |