Gene Ndas_0915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0915 
Symbol 
ID9244760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1120550 
End bp1122232 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content75% 
IMG OID 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionYP_003678865 
Protein GI297559891 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.634895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAG GCGCAGCAGA ACGCCCCAGC CCGCCCACCG ACACCGCCGC GCACGCGGTG 
GTGGGGGTGC TGGCCGCGGC CGGGATCAGA CGGTGCTACA CCGTCCCCGG TGAGAGCTTC
CTGGAGCTCG CCGACGCGAT CGACCAGCAC CCGCGCATGC AGCTGGTCTC CACCCGGCAC
GAGAACGGCG CCGGGTTCAT GGCCGAGGCC GAGGCCAAGC TCACCGGCGT CCCCGCCGTG
GCCGCGGCCA CCCGGGGGCC GGGAGCGTCC AACCTGGCGG TGGGCGTGCA CACCGCCATG
CAGGACTCGA CGCCGATGAT CGTGTTCCTC GGACAGGCCG AGACCGAGCA CCTGGGCAGA
GAGGCCTTCC AGGAGGTGGA CCTCACCGCC TTCTACACGC CCATCACCAA GTGGTCCACC
ACCGTGCACC GGGCCGACCG GCTCGCGGAG GTCACCGCGA AGGCGGTCCG CGTCGCCACC
ACCGGCAGGC CCGGCCCGGT CGCCATCGCG GTGCCCGGCG ACCTGTTCGG CCAGCGCGTG
GGCCCCCAGG ACCCGCCCGG ACCGCCCGTG GTCCCGCGCC CGGTCCTGGG CACGGAGGCC
AGGGACCGGC TGGCGACCTG GCTGGCCAGG GCGGTGCGCC CGGTGATCGT CGCGGGCGGT
GGCGCCCGCG CGGCCCGGGA GGACCTGATC CGGGTCGCCG AACGCTTCAA CACGGGCGTG
TACGCCGCCT GGCGGCGCCA GGACGTGTTC CCCAACGACC ACCCGCTGTA CCTGGGGCAC
CTGGGGCTGG GCTGTTCTCC GCCGGTGCTC AACGCCCTGG CGGAGGCCGA CGCGGTCCTG
GTGGTCGGCT GCCGGATGAG CGAGACCACC ACCCAGGGGT ACCGGTTGCC CGAACGCGGC
GGACGCGTCC GGGTCGCGCA GATCGACATC GATCCGGGGC AGCTCGGGGC CGGTACCGAC
CTGTGGTTCG GCGCGGTCGC CGACGCCGGG GAGGCGCTGC GCGAACTCGC CGGGGCGCCG
GTCCAGGCGC CCTACCGCGA CTGGAGTTCG GCCCGCCGGG TGTGGGTGGA CACCGCGACG
GTGCCCCCCG AGGCCGCCGG GCACACCGGT TCCCGGCTCC ATCCGTGGGC GGTGGTCGCG
GGGATGCGCG CGGCGCTGCC CGAGGACGCG CTGATCACCA ACGACGCGGG AAACTTCGCC
TCCTTCCTGC ACCGGGGCTG GTGGTTCCGG CACCCGCGCA CCCAGCTGGC GCCGACCAGC
GGCGCCATGG GCTACGCCGT GCCCGCCGCG GTGGCGGCCA AGATCGCCGC CCCGGACCGG
ACGGTGGTGG CGGTGGCCGG TGACGGCGGC GCCCTGATGA CCGGGCAGGA GCTGGAGACC
GCGGTGCGGA TGGACGCGCC GGTCACCGTG GTGGTGTTCC AGAACGGCCT GTACGGCACG
ATCGCCATGC ACCAGGCCCG GGAACTGGGG AGGATCGCCG GAACCCGGAT CAGCGGACCG
CTGGACCTGG CCGGGTACGC CCGGTCCCTG GGCGCCAGGG GCGCGACCGC GCACACGCGT
GAGGAGTTGG AGAAGGCGCT GGCGGAGGCC GTGGGGGCGG ATCTGCCCAC CCTGGTGGAC
GTGCGGACGG ACCCGGAGGT GATCAGCCCG GGCGCCACCC TGTCGGGCCT GCTGAACGGT
TGA
 
Protein sequence
MATGAAERPS PPTDTAAHAV VGVLAAAGIR RCYTVPGESF LELADAIDQH PRMQLVSTRH 
ENGAGFMAEA EAKLTGVPAV AAATRGPGAS NLAVGVHTAM QDSTPMIVFL GQAETEHLGR
EAFQEVDLTA FYTPITKWST TVHRADRLAE VTAKAVRVAT TGRPGPVAIA VPGDLFGQRV
GPQDPPGPPV VPRPVLGTEA RDRLATWLAR AVRPVIVAGG GARAAREDLI RVAERFNTGV
YAAWRRQDVF PNDHPLYLGH LGLGCSPPVL NALAEADAVL VVGCRMSETT TQGYRLPERG
GRVRVAQIDI DPGQLGAGTD LWFGAVADAG EALRELAGAP VQAPYRDWSS ARRVWVDTAT
VPPEAAGHTG SRLHPWAVVA GMRAALPEDA LITNDAGNFA SFLHRGWWFR HPRTQLAPTS
GAMGYAVPAA VAAKIAAPDR TVVAVAGDGG ALMTGQELET AVRMDAPVTV VVFQNGLYGT
IAMHQARELG RIAGTRISGP LDLAGYARSL GARGATAHTR EELEKALAEA VGADLPTLVD
VRTDPEVISP GATLSGLLNG