Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0915 |
Symbol | |
ID | 9244760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1120550 |
End bp | 1122232 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | thiamine pyrophosphate protein domain protein TPP-binding protein |
Protein accession | YP_003678865 |
Protein GI | 297559891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.634895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAG GCGCAGCAGA ACGCCCCAGC CCGCCCACCG ACACCGCCGC GCACGCGGTG GTGGGGGTGC TGGCCGCGGC CGGGATCAGA CGGTGCTACA CCGTCCCCGG TGAGAGCTTC CTGGAGCTCG CCGACGCGAT CGACCAGCAC CCGCGCATGC AGCTGGTCTC CACCCGGCAC GAGAACGGCG CCGGGTTCAT GGCCGAGGCC GAGGCCAAGC TCACCGGCGT CCCCGCCGTG GCCGCGGCCA CCCGGGGGCC GGGAGCGTCC AACCTGGCGG TGGGCGTGCA CACCGCCATG CAGGACTCGA CGCCGATGAT CGTGTTCCTC GGACAGGCCG AGACCGAGCA CCTGGGCAGA GAGGCCTTCC AGGAGGTGGA CCTCACCGCC TTCTACACGC CCATCACCAA GTGGTCCACC ACCGTGCACC GGGCCGACCG GCTCGCGGAG GTCACCGCGA AGGCGGTCCG CGTCGCCACC ACCGGCAGGC CCGGCCCGGT CGCCATCGCG GTGCCCGGCG ACCTGTTCGG CCAGCGCGTG GGCCCCCAGG ACCCGCCCGG ACCGCCCGTG GTCCCGCGCC CGGTCCTGGG CACGGAGGCC AGGGACCGGC TGGCGACCTG GCTGGCCAGG GCGGTGCGCC CGGTGATCGT CGCGGGCGGT GGCGCCCGCG CGGCCCGGGA GGACCTGATC CGGGTCGCCG AACGCTTCAA CACGGGCGTG TACGCCGCCT GGCGGCGCCA GGACGTGTTC CCCAACGACC ACCCGCTGTA CCTGGGGCAC CTGGGGCTGG GCTGTTCTCC GCCGGTGCTC AACGCCCTGG CGGAGGCCGA CGCGGTCCTG GTGGTCGGCT GCCGGATGAG CGAGACCACC ACCCAGGGGT ACCGGTTGCC CGAACGCGGC GGACGCGTCC GGGTCGCGCA GATCGACATC GATCCGGGGC AGCTCGGGGC CGGTACCGAC CTGTGGTTCG GCGCGGTCGC CGACGCCGGG GAGGCGCTGC GCGAACTCGC CGGGGCGCCG GTCCAGGCGC CCTACCGCGA CTGGAGTTCG GCCCGCCGGG TGTGGGTGGA CACCGCGACG GTGCCCCCCG AGGCCGCCGG GCACACCGGT TCCCGGCTCC ATCCGTGGGC GGTGGTCGCG GGGATGCGCG CGGCGCTGCC CGAGGACGCG CTGATCACCA ACGACGCGGG AAACTTCGCC TCCTTCCTGC ACCGGGGCTG GTGGTTCCGG CACCCGCGCA CCCAGCTGGC GCCGACCAGC GGCGCCATGG GCTACGCCGT GCCCGCCGCG GTGGCGGCCA AGATCGCCGC CCCGGACCGG ACGGTGGTGG CGGTGGCCGG TGACGGCGGC GCCCTGATGA CCGGGCAGGA GCTGGAGACC GCGGTGCGGA TGGACGCGCC GGTCACCGTG GTGGTGTTCC AGAACGGCCT GTACGGCACG ATCGCCATGC ACCAGGCCCG GGAACTGGGG AGGATCGCCG GAACCCGGAT CAGCGGACCG CTGGACCTGG CCGGGTACGC CCGGTCCCTG GGCGCCAGGG GCGCGACCGC GCACACGCGT GAGGAGTTGG AGAAGGCGCT GGCGGAGGCC GTGGGGGCGG ATCTGCCCAC CCTGGTGGAC GTGCGGACGG ACCCGGAGGT GATCAGCCCG GGCGCCACCC TGTCGGGCCT GCTGAACGGT TGA
|
Protein sequence | MATGAAERPS PPTDTAAHAV VGVLAAAGIR RCYTVPGESF LELADAIDQH PRMQLVSTRH ENGAGFMAEA EAKLTGVPAV AAATRGPGAS NLAVGVHTAM QDSTPMIVFL GQAETEHLGR EAFQEVDLTA FYTPITKWST TVHRADRLAE VTAKAVRVAT TGRPGPVAIA VPGDLFGQRV GPQDPPGPPV VPRPVLGTEA RDRLATWLAR AVRPVIVAGG GARAAREDLI RVAERFNTGV YAAWRRQDVF PNDHPLYLGH LGLGCSPPVL NALAEADAVL VVGCRMSETT TQGYRLPERG GRVRVAQIDI DPGQLGAGTD LWFGAVADAG EALRELAGAP VQAPYRDWSS ARRVWVDTAT VPPEAAGHTG SRLHPWAVVA GMRAALPEDA LITNDAGNFA SFLHRGWWFR HPRTQLAPTS GAMGYAVPAA VAAKIAAPDR TVVAVAGDGG ALMTGQELET AVRMDAPVTV VVFQNGLYGT IAMHQARELG RIAGTRISGP LDLAGYARSL GARGATAHTR EELEKALAEA VGADLPTLVD VRTDPEVISP GATLSGLLNG
|
| |