Gene Ndas_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0017 
Symbol 
ID9243844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp20922 
End bp22721 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content71% 
IMG OID 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003677975 
Protein GI297559001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0984309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC TGGTGTCCGA CCACGTACTG AAGCGTCTGC GCGAGTGGGG GGTCGACCGC 
GTCTTCTCCT ACGCGGGCGA CGGCATCAAC GGCCTGCTCG CCGCGTGGGA GCGGGCCGAC
GACCGGCCCC GCTTCATCCA GTCACGGCAC GAGGAACTGG CGGCCTTCGA GGCCACGGGG
TACGCCAAGT TCTCCGGCCG GGTGGGGGTG TGCGCGGCCA CGTCGGGTCC CGGCGCCATC
CACCTGCTGA ACGGCCTCTA CGACGCCAAG CTCGACCACG TGCCGGTGGT GGCCATCCTC
GGCCAGACCG CGCGCAGCGC GATGGGCGGC TCCTACCAGC AGGAGGTCGA CCTGATGTCG
CTGTACAAGG ACGTGGCCAG CGACTACCTC CAGATGGTGA CCGTCCCCGA GCAGCTGCCC
AACGTGCTGG ACCGGGCGAT CCGGATCGCC GCGAGCAGGC GCACGGTCAC AGCGGTCATC
ATCCCCGCCG ACGTCCAGGA TCTGGAGTAC TCGCCGCCCG AGCACGAGTT CAAGATGGTG
CCCTCCAGCC TCGGCCTCCC CTCCCCGCGG TCCACGCCGT CCCCGGAGGG GCTGGCCGAG
GCCGCCGAGA TCCTCAACTC CGGTGAGCGC GTCGCCATGC TGGTCGGACA GGGGGCCAGG
GGAGCGGCGG ACGCCGTCGT CGAGATGGCC GACAGGCTCG GCGCCGGGGT GGCCAAGGCG
CTGCTGGGCA AGGACGTGCT CCCCGACGAC CTGCCCTTCG TGACCGGGTC GATCGGGCTG
CTCGGCACCC GGCCCTCCTA CGAGATGATG CGGGACTGCG ACACCCTGCT CGTGGTGGGA
TCCAGCTTCC CGTACAGCCA GTTCCTGCCC GAGTTCGACC AGGCGCGCGC CGTGCAGATC
GACATCGACC CGACCATGGT CGGCATGCGC TACCCGTTCG AGTGCAACCT GGTCGGCGAC
TCCGCGCAGA CGCTGCGGAT GCTGCTGCCG CTCGTGGAGC GCAAGACCGA CCGCTCCTGG
CGGGAGAAGG TCGAGGACGG CGTCGCGCGG TGGAGGCGGG TCCTCGAACG GCGCGCCCAC
GTGGACGCCG ACCCGGTCAA CCCCGAGCGC GTCTTCCACG AGCTGTCCCC GCTGCTGCCC
GACGACGTGA TGGTGACCGC GGACTCCGGT TCGGCGGCCA ACTGGTACGC GCGCCACCTG
GTGTTCCGCG AGGGCATGCG CGGAACGCTG TCGGGCACGC TGGCCACGAT GTGCCCCGGC
GTCCCCTACG CCACGGGGGC GAAGTTCGCC CACCCCGACA GGCCGGTGGT CGCGCTCGTG
GGGGACGGCG CCATGCAGAT GAGCGGCATC AACGAGCTGA TCACCATCGG CCACTACTGG
AGGGAGTGGG AGGACCCGCG GGTGGTCGTC GCCGTCCTCA ACAACCGCGA CCTCAACCAG
GTGACCTGGG AGCTGCGCGC GATGGGCGGA GCGCCGCAGT TCCTCCCCTC GCAGCGGATC
CCCGACTTCC CCTACGCCGG GTTCGCCGAG AGCATCGGCC TGAGGGGGAT CAAGGTGGAC
GACCCCTCCG ACGTGCGCGA CGCCTGGCAG CGGGCGCTGT CGGCGGACCG GCCCTGCGTG
GTCGAGTTCG TCACCGACCC CGCCGTCCCG CCGATCCCGC CGCACGCGAC GCTCGACCAG
ATGGAGAGCG TGGCCAAGGC CCTGGCCAAG GGCGACCCCG AGGCGTGGTC CGTGGTCAAG
CGGGGCGTCG TGTCCAAGGC ACAGGAGTTC CTGCCGGGAG ACGGGCGGGG AGGCCGCTGA
 
Protein sequence
MAELVSDHVL KRLREWGVDR VFSYAGDGIN GLLAAWERAD DRPRFIQSRH EELAAFEATG 
YAKFSGRVGV CAATSGPGAI HLLNGLYDAK LDHVPVVAIL GQTARSAMGG SYQQEVDLMS
LYKDVASDYL QMVTVPEQLP NVLDRAIRIA ASRRTVTAVI IPADVQDLEY SPPEHEFKMV
PSSLGLPSPR STPSPEGLAE AAEILNSGER VAMLVGQGAR GAADAVVEMA DRLGAGVAKA
LLGKDVLPDD LPFVTGSIGL LGTRPSYEMM RDCDTLLVVG SSFPYSQFLP EFDQARAVQI
DIDPTMVGMR YPFECNLVGD SAQTLRMLLP LVERKTDRSW REKVEDGVAR WRRVLERRAH
VDADPVNPER VFHELSPLLP DDVMVTADSG SAANWYARHL VFREGMRGTL SGTLATMCPG
VPYATGAKFA HPDRPVVALV GDGAMQMSGI NELITIGHYW REWEDPRVVV AVLNNRDLNQ
VTWELRAMGG APQFLPSQRI PDFPYAGFAE SIGLRGIKVD DPSDVRDAWQ RALSADRPCV
VEFVTDPAVP PIPPHATLDQ MESVAKALAK GDPEAWSVVK RGVVSKAQEF LPGDGRGGR