Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1633 |
Symbol | thiG |
ID | 3705697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1824490 |
End bp | 1825533 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637738106 |
Product | bifunctional sulfur carrier protein/thiazole synthase protein |
Protein accession | YP_343635 |
Protein GI | 77165110 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis [COG2104] Sulfur transfer protein involved in thiamine biosynthesis |
TIGRFAM ID | [TIGR01683] thiamine biosynthesis protein ThiS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATTT TATTAAACGG TAAAATTCAC CAGGTACCAG AAAACTGCCT GATTTCAGAG CTCATCGCCC TACTGAAACT CCAGGGAAAA CGGCTTGCCG TGGAAGTCAA TCAGGAAATC GTATCCCGTA GCGAATATAC GCAGCGGCCA CTTCAATCTG GCGATAAAGT GGAGATCGTC TACGCTATCG GTGGTGGTTC AGACTCTGGC GCTACCGCTA GTCCCTTAAC ACAAAACGCC AAAACGGAAG AGATAATGAG CACATTGGAT ACTCCCTTAG TGGTTGCGGG CAAAACCTAC CATTCTCGAC TCATGGTAGG CACCGGTAAA TACCAGGATC TGGAAGAAAC TCAGAACGCT ATCCAGGCCA GCGGCGCGGA GATCGTCACT ATTGCTATTC GTCGGAGTAA TATTGGGCAA AATCCGGGGG AGCCAAATCT ACTCGATGTC ATATCGCCGC ATTGCTATAC ACTCTTGCCC AATACCGCCG GTTGCTATAA TGCCAAGGAG GCAGTACGTA CCTGCCGCTT GGCTCGAGAG CTGCTAGATG GCCATAGCTT GGTAAAGCTG GAAGTTCTAG GAGATGAAAA AACTCTATTC CCAGATCTAG TAGAAACCTA CCAGGCCGCT GAAGTGCTTA TCAAGGAAGA CTTTCAAGTG ATGGTCTATA CTAATGACGA TCCCATTGCC GCCAAACGCT TGGAAGAGAT GGGATGCGTC GCGGTCATGC CCCTGGCGGC ACCCATTGGC TCTGGGCTAG GCATTCGAAA TCCCTACAAT ATCCTTGAAA TTGTCCAGAA TGCCACCGTG CCTATCCTGG TAGATGCGGG CGTTGGCACC GCTTCCGATG CGGCGGTAGC CATGGAACTA GGCTGCGATG GAGTACTCAT GAATACCGCC ATTGCCGGGG CTCAAAACCC TATTTTGATG GCTTCGGCAA TGAAAAAGGC GGTGGAAGCA GGTCGTGACG CCTACCTGGC CGGGCGTATC CCCCGGAGAC GCTATGCCAG CGCCTCCTCC CCCCTCGAGG GCACCTTCTT TTAA
|
Protein sequence | MEILLNGKIH QVPENCLISE LIALLKLQGK RLAVEVNQEI VSRSEYTQRP LQSGDKVEIV YAIGGGSDSG ATASPLTQNA KTEEIMSTLD TPLVVAGKTY HSRLMVGTGK YQDLEETQNA IQASGAEIVT IAIRRSNIGQ NPGEPNLLDV ISPHCYTLLP NTAGCYNAKE AVRTCRLARE LLDGHSLVKL EVLGDEKTLF PDLVETYQAA EVLIKEDFQV MVYTNDDPIA AKRLEEMGCV AVMPLAAPIG SGLGIRNPYN ILEIVQNATV PILVDAGVGT ASDAAVAMEL GCDGVLMNTA IAGAQNPILM ASAMKKAVEA GRDAYLAGRI PRRRYASASS PLEGTFF
|
| |