Gene Ndas_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0723 
Symbol 
ID9244565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp886050 
End bp887765 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content70% 
IMG OID 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003678674 
Protein GI297559700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAGA TGCCCGCGAT GAACGCGGTC GTGGAGGTCC TCAAGGACGA GGGCGTCGAC 
ACGGCCTTCG GCTGCCCCGG CGCGGCCATC CTGCCCCTCT ACAAGGCGAT GGAACAGGTC
GGCGGCATCG AGCACCTGAC CGTCCGCCAC GAGGAGGGCG CCACCCACAT GGCCGACGGC
TGGTCCCGGA CCACCGGCAA GGTCGGCGTG GCCATCGGCA CCTCCGGCCC CGCCGGAACC
AACATGATCA CCGGCCTGTA CACGGCCATC GCGGACTCGG TCCCGATCGT GTGCATCACC
GGCCAGCAGC GCACCGACCT GCTGGACAAG GAGGGCTTCC AGGCGGTCGA CATCGTCGAG
ATCGCCAAGC CCGTGACCAA GTGGGCCGTC CAGATCAAGG AGGCCGCCAC CGCGCCCTGG
ATCTTCCGCG AGGCGTTCCG GATCGCCCGG GAGGGACGCC CCGGCCCCGT CCTGGTGGAC
ATCCCCGTGG ACGTCGCCCA GCAGCTGATC GACTACGACC CCGCCATCGA CGCCCCGCTC
AAGGTCAACG CCGTCGAGCC GCACCAGCCC CGGGTGGAGC GCGCGCTGGA CATGCTGCTG
GAGGCCGAGC GCCCCCTGAT CCTGGCCGGA GGCGGCGTGA TCACCGCCGA GGCCTCCGAC
GACCTGCGCG CGCTGGCCGA GCACCTCCAG GTCCCGGTGC AGGTCACCCT CATGGGCAAG
GGCTCCTTCG ACGAGGACTC CCCGCTGTAC TCGGGCATGA CCGGCGTGCA GACCTCCCAG
CGCTACGGCA ACGCCTCCTT CCTGGAGTCG GACCTGGTCC TGGCCGTGGG CGCGCGCTTC
GCCGACCGCC ACACCGGGCA GATCGACGTC TACCGGGGCG AGCGCAGATT CATCCACGTC
GACATCGAGG CCACCCAGAT CGGCCGGGTC TTCGAACCCG ACCTGGGCGT GGTCTCCGAC
GCGCGGCTGT TCCTGCGCGA GCTGCTCGCC GCCGCGCGCG CCCGCGGCGC CAAGGCCGAG
GTGCGGCCCT GGATCCACCG CGTCGCCGAG CTCAAGGCCA CCCTGACCCG CCGCGAGGAC
TTCGACACGG TCCCGGTCAA GGCGCCGCGC GTCTACAAGG AGATCAACGA GGTCTTCGGC
GAGGACACCT ACTTCGTCAC CGCGATCGGC CTGTACCAGA TCTGGGGAGG CCAGCATCAG
AAGGCGTACA AGCCGCGCCA CTACCAGATC TGCGGCCAGG CGGGCCCGCT CGGCTGGGAG
ATCCCCGCCG CCATCGGCGT CAAGAAGGCG CTCAAGCACA CCGAGCCGGA CGCGGAGGTC
GTCGGGATCG TCGGCGACTA CGGGTTCCAG TACATGGTCG AGGAACTGGC CGTGGCCGCC
CAGTACGACG TGCCCTACGT CATCATCATG CTCAACAACG AGTACCTGGG CCTGATCCGC
CAGGCCTCGA TCCCGTTCGA CATGAACTAC CAGGTGGACA TCCACTACGA CGAGTACGGC
ACCGACAACG TCAAGCTCAT GGAGGCCTAC GGCTGCTCCG GGCGCCGCGT CGTGGAGCCC
GGGGAGATCC GCGAGTCCCT GGAGTGGGCC CGCAAGCAGG CCCAGGCCAC CTCGCGGCCG
GTACTCGTGG AGATCATGAT CGAGCGCGAG GCCAACACGC CGCACGGGCC CGCGATCGAC
GCGGTCCGCG AGTTCGAGCC GGTCCCGGGG GCCTGA
 
Protein sequence
MPQMPAMNAV VEVLKDEGVD TAFGCPGAAI LPLYKAMEQV GGIEHLTVRH EEGATHMADG 
WSRTTGKVGV AIGTSGPAGT NMITGLYTAI ADSVPIVCIT GQQRTDLLDK EGFQAVDIVE
IAKPVTKWAV QIKEAATAPW IFREAFRIAR EGRPGPVLVD IPVDVAQQLI DYDPAIDAPL
KVNAVEPHQP RVERALDMLL EAERPLILAG GGVITAEASD DLRALAEHLQ VPVQVTLMGK
GSFDEDSPLY SGMTGVQTSQ RYGNASFLES DLVLAVGARF ADRHTGQIDV YRGERRFIHV
DIEATQIGRV FEPDLGVVSD ARLFLRELLA AARARGAKAE VRPWIHRVAE LKATLTRRED
FDTVPVKAPR VYKEINEVFG EDTYFVTAIG LYQIWGGQHQ KAYKPRHYQI CGQAGPLGWE
IPAAIGVKKA LKHTEPDAEV VGIVGDYGFQ YMVEELAVAA QYDVPYVIIM LNNEYLGLIR
QASIPFDMNY QVDIHYDEYG TDNVKLMEAY GCSGRRVVEP GEIRESLEWA RKQAQATSRP
VLVEIMIERE ANTPHGPAID AVREFEPVPG A