Gene Ndas_5088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5088 
Symbol 
ID9248977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp231122 
End bp232213 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionYP_003682975 
Protein GI297564002 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.993677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.910187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAGA ACGTGACGGG TACCGGCGCG AACGGCCACG TCCACGGGGT CCCGGAGGCC 
CTGGGCGGGC TGCGCCTGGT GCCCAGGACC GACACCGCGT ACAAGATGAA GGACTTCAAG
TCCGACCAGG AGGTGCGCTG GTGCCCGGGC TGCGGCGACT ACGCGATCCT GGCCGCCTTC
CAGTCCTTCC TGCCCGAGCT GGGCGTGCCG CGCGAGAACG TGGTGATGGT GTCGGGTATC
GGCTGCTCCT CCCGATTCCC GTACTACCTG AGCACGTACG GCATGCACTC GATCCACGGG
CGCGCCCCGG CGATCGCGAC CGGGCTGGCC ACCAGCCGCC CGGACCTGTC GGTGTGGGTG
GTGACCGGTG ACGGCGACGG GTTGTCCATC GGCGGCAACC ACCTCGTCCA CGCGCTGCGC
CGCAACGTCA ACATCAACAT CCTGTTGTTC AACAACCGGA TCTACGGGCT GACCAAGGGT
CAGTACTCCC CCACCTCCGA GCCGGGCAAG ATCACCAAGT CCTCGCCGGT GGGGTCGCTG
GACCACCCGT TCAACCCGCT GTCGCTGGCG CTGGGCGCGG AGGCCACGTT CGTGGCCCGC
ACGATCGACT CCGACCGCAA GCACCTCACG TCGGTGCTGC GGGCGGCGGC CGACCACCCC
GGCGCGTCGT TCGTGGAGAT CTACCAGAAC TGCCCGATCT TCAACGACGA CGCGTTCGAG
CCGCTGAAGG ACCCGGCGGC GCGGGACGTC CGGCTGCTGC GCCTGGAGCA CGGCGAGCCG
CTGCGGCTGG GCCCGGACCG GGGCGTGGTC GCCGGGGAGT TCGGCGGCTT GGAGGTCGTG
GACGTGGACT CGGTGGGAGA GGACCGGCTG CTGCGGCACG ACGCGCACCG GGAGGACCCG
GGGTACGCGT TCGCGCTGTC GCGCCTGGAC CAGCCCGCGT TCGAGCACGT GCCGATCGGG
GTGCTGCGGG ACGTGCGCCG CCCGGCCTAC GACGAGCTGG TGAACGAGCA GGTGGCCGAC
GCGCGGGCCG AGCGCGGCGC CGGCGAGCTG GCCGCCCTTC TGGCCAGCGG GGACACCTGG
AGGGTGGAGT AG
 
Protein sequence
MTENVTGTGA NGHVHGVPEA LGGLRLVPRT DTAYKMKDFK SDQEVRWCPG CGDYAILAAF 
QSFLPELGVP RENVVMVSGI GCSSRFPYYL STYGMHSIHG RAPAIATGLA TSRPDLSVWV
VTGDGDGLSI GGNHLVHALR RNVNINILLF NNRIYGLTKG QYSPTSEPGK ITKSSPVGSL
DHPFNPLSLA LGAEATFVAR TIDSDRKHLT SVLRAAADHP GASFVEIYQN CPIFNDDAFE
PLKDPAARDV RLLRLEHGEP LRLGPDRGVV AGEFGGLEVV DVDSVGEDRL LRHDAHREDP
GYAFALSRLD QPAFEHVPIG VLRDVRRPAY DELVNEQVAD ARAERGAGEL AALLASGDTW
RVE