Gene Tneu_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0530 
Symbol 
ID6165724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp479807 
End bp481138 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content63% 
IMG OID641667683 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001793919 
Protein GI171185000 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0702223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.172479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAA CCTTTTTATA TGGAATCGAT CAGCTGTTTA TGGCAAAGAC GTTAATTCAG 
CAGGCCAGGG AGGGCAGAGC GCCTCCCGAG CTTGAGAGGG TGGCTAAGGC GGAGGACGTA
AGCGTGGCTA AGCTCCGGGA CCGCCTGGCT CGGGGGCAGG CGGTGGTTTT GACCAACGCC
AAGTCGCCGC CTAGGAGGCT CACCGGCGTG GGGAAGGGGC TACACACGAA GGTCAACGTC
AACCTGGGGA CCTCCTCGGA GGTGGTGGAC CTCGGGGCGG AGCTGAAGAA GGTGGAGGTG
GCGAATAGGT GGGGCGACAC GTTGATGGAT CTAAGCGTCG GCGGCGATCT AGACGCGGTG
AGGAGGGCCG TGTTGAGCAA GGCGGAGATC CCCGTGGGCA CCGTCCCCAT ATACCAAGCC
TTTATCGAGG CCTTCGAGAA GAGGGGCGGC GGGGCTTACA TGACGGAGGA CCACCTGTTT
GAGGTGGTGG AGAGGCAGTT GAAAGACGGC GTGTCGTTTA TGACGATACA CGCCGCGGTC
ACGAGGGACC TGGCCTTGAA GGTGCTGAAG AGCGATAGGG TGATCCCCGT CGTGTCGCGC
GGCGGCGACA TGGTCATCGG CTGGATGCTC TACAACGAGT CCGAGAACCC CTACCTCAAG
AACTGGGACT ACCTCCTGGA GCTCTTCGCC GAGTACGACG CCACCATCTC CATAGGCGAC
GCCCTGAGGC CGGGCGCCAT CGCAGACGCC CACGACGAGT TCCAGATAGC CGAGCTCGTC
GAGGCGGCTA GGCTGGCCAA GAGGGCTATC AAGGCGGGGG TCCAGGTGAT GCTTGAGGGG
CCGGGGCACG TGCCGCTGAA CGAGATCGTC TGGTCTATAA AGCTGGAGAA GAAGCTCACG
GGGGGCGTCC CCTACTACGT CCTGGGGCCT CTGCCGACTG ACGTGGCCGC GCCCTACGAC
CACATCGCCT CTGCGGTGGG CGCCGCCCTC GCCGCCGCCG CGGGGGCCGA CCTTCTGTGC
TACATCACGC CGGCGGAGCA CCTCTCCCTG CCCACCGTCA AGCAGGTGGA GGAGGGGGTG
AAGGCCTACA GGGTCGCGGC CCACATAGGA GACATCGTGA AGCTTGGGCC AAAGGCCTCG
GGGTGGGATA GGGAGGTGAG CGTGTACAGG GGCAGGCTCG ACTGGGCCAA CATGATAAAC
AAGCTCCTCG ACCCGGAGGC CGCGTGGGCG GTGTATAGGC AGTTCGGGGA GCCCAAGGTG
AAGGGCTGCA CCATGTGCGG CAAGTACTGC CCCATGATGT GGGTGAAGGA GCAAGCGAGG
AAAACCTCTT GA
 
Protein sequence
MWKTFLYGID QLFMAKTLIQ QAREGRAPPE LERVAKAEDV SVAKLRDRLA RGQAVVLTNA 
KSPPRRLTGV GKGLHTKVNV NLGTSSEVVD LGAELKKVEV ANRWGDTLMD LSVGGDLDAV
RRAVLSKAEI PVGTVPIYQA FIEAFEKRGG GAYMTEDHLF EVVERQLKDG VSFMTIHAAV
TRDLALKVLK SDRVIPVVSR GGDMVIGWML YNESENPYLK NWDYLLELFA EYDATISIGD
ALRPGAIADA HDEFQIAELV EAARLAKRAI KAGVQVMLEG PGHVPLNEIV WSIKLEKKLT
GGVPYYVLGP LPTDVAAPYD HIASAVGAAL AAAAGADLLC YITPAEHLSL PTVKQVEEGV
KAYRVAAHIG DIVKLGPKAS GWDREVSVYR GRLDWANMIN KLLDPEAAWA VYRQFGEPKV
KGCTMCGKYC PMMWVKEQAR KTS