Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0530 |
Symbol | |
ID | 6165724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 479807 |
End bp | 481138 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641667683 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001793919 |
Protein GI | 171185000 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0702223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.172479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGAAAA CCTTTTTATA TGGAATCGAT CAGCTGTTTA TGGCAAAGAC GTTAATTCAG CAGGCCAGGG AGGGCAGAGC GCCTCCCGAG CTTGAGAGGG TGGCTAAGGC GGAGGACGTA AGCGTGGCTA AGCTCCGGGA CCGCCTGGCT CGGGGGCAGG CGGTGGTTTT GACCAACGCC AAGTCGCCGC CTAGGAGGCT CACCGGCGTG GGGAAGGGGC TACACACGAA GGTCAACGTC AACCTGGGGA CCTCCTCGGA GGTGGTGGAC CTCGGGGCGG AGCTGAAGAA GGTGGAGGTG GCGAATAGGT GGGGCGACAC GTTGATGGAT CTAAGCGTCG GCGGCGATCT AGACGCGGTG AGGAGGGCCG TGTTGAGCAA GGCGGAGATC CCCGTGGGCA CCGTCCCCAT ATACCAAGCC TTTATCGAGG CCTTCGAGAA GAGGGGCGGC GGGGCTTACA TGACGGAGGA CCACCTGTTT GAGGTGGTGG AGAGGCAGTT GAAAGACGGC GTGTCGTTTA TGACGATACA CGCCGCGGTC ACGAGGGACC TGGCCTTGAA GGTGCTGAAG AGCGATAGGG TGATCCCCGT CGTGTCGCGC GGCGGCGACA TGGTCATCGG CTGGATGCTC TACAACGAGT CCGAGAACCC CTACCTCAAG AACTGGGACT ACCTCCTGGA GCTCTTCGCC GAGTACGACG CCACCATCTC CATAGGCGAC GCCCTGAGGC CGGGCGCCAT CGCAGACGCC CACGACGAGT TCCAGATAGC CGAGCTCGTC GAGGCGGCTA GGCTGGCCAA GAGGGCTATC AAGGCGGGGG TCCAGGTGAT GCTTGAGGGG CCGGGGCACG TGCCGCTGAA CGAGATCGTC TGGTCTATAA AGCTGGAGAA GAAGCTCACG GGGGGCGTCC CCTACTACGT CCTGGGGCCT CTGCCGACTG ACGTGGCCGC GCCCTACGAC CACATCGCCT CTGCGGTGGG CGCCGCCCTC GCCGCCGCCG CGGGGGCCGA CCTTCTGTGC TACATCACGC CGGCGGAGCA CCTCTCCCTG CCCACCGTCA AGCAGGTGGA GGAGGGGGTG AAGGCCTACA GGGTCGCGGC CCACATAGGA GACATCGTGA AGCTTGGGCC AAAGGCCTCG GGGTGGGATA GGGAGGTGAG CGTGTACAGG GGCAGGCTCG ACTGGGCCAA CATGATAAAC AAGCTCCTCG ACCCGGAGGC CGCGTGGGCG GTGTATAGGC AGTTCGGGGA GCCCAAGGTG AAGGGCTGCA CCATGTGCGG CAAGTACTGC CCCATGATGT GGGTGAAGGA GCAAGCGAGG AAAACCTCTT GA
|
Protein sequence | MWKTFLYGID QLFMAKTLIQ QAREGRAPPE LERVAKAEDV SVAKLRDRLA RGQAVVLTNA KSPPRRLTGV GKGLHTKVNV NLGTSSEVVD LGAELKKVEV ANRWGDTLMD LSVGGDLDAV RRAVLSKAEI PVGTVPIYQA FIEAFEKRGG GAYMTEDHLF EVVERQLKDG VSFMTIHAAV TRDLALKVLK SDRVIPVVSR GGDMVIGWML YNESENPYLK NWDYLLELFA EYDATISIGD ALRPGAIADA HDEFQIAELV EAARLAKRAI KAGVQVMLEG PGHVPLNEIV WSIKLEKKLT GGVPYYVLGP LPTDVAAPYD HIASAVGAAL AAAAGADLLC YITPAEHLSL PTVKQVEEGV KAYRVAAHIG DIVKLGPKAS GWDREVSVYR GRLDWANMIN KLLDPEAAWA VYRQFGEPKV KGCTMCGKYC PMMWVKEQAR KTS
|
| |