Gene Noc_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1633 
SymbolthiG 
ID3705697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1824490 
End bp1825533 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content52% 
IMG OID637738106 
Productbifunctional sulfur carrier protein/thiazole synthase protein 
Protein accessionYP_343635 
Protein GI77165110 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2022] Uncharacterized enzyme of thiazole biosynthesis
[COG2104] Sulfur transfer protein involved in thiamine biosynthesis 
TIGRFAM ID[TIGR01683] thiamine biosynthesis protein ThiS 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATTT TATTAAACGG TAAAATTCAC CAGGTACCAG AAAACTGCCT GATTTCAGAG 
CTCATCGCCC TACTGAAACT CCAGGGAAAA CGGCTTGCCG TGGAAGTCAA TCAGGAAATC
GTATCCCGTA GCGAATATAC GCAGCGGCCA CTTCAATCTG GCGATAAAGT GGAGATCGTC
TACGCTATCG GTGGTGGTTC AGACTCTGGC GCTACCGCTA GTCCCTTAAC ACAAAACGCC
AAAACGGAAG AGATAATGAG CACATTGGAT ACTCCCTTAG TGGTTGCGGG CAAAACCTAC
CATTCTCGAC TCATGGTAGG CACCGGTAAA TACCAGGATC TGGAAGAAAC TCAGAACGCT
ATCCAGGCCA GCGGCGCGGA GATCGTCACT ATTGCTATTC GTCGGAGTAA TATTGGGCAA
AATCCGGGGG AGCCAAATCT ACTCGATGTC ATATCGCCGC ATTGCTATAC ACTCTTGCCC
AATACCGCCG GTTGCTATAA TGCCAAGGAG GCAGTACGTA CCTGCCGCTT GGCTCGAGAG
CTGCTAGATG GCCATAGCTT GGTAAAGCTG GAAGTTCTAG GAGATGAAAA AACTCTATTC
CCAGATCTAG TAGAAACCTA CCAGGCCGCT GAAGTGCTTA TCAAGGAAGA CTTTCAAGTG
ATGGTCTATA CTAATGACGA TCCCATTGCC GCCAAACGCT TGGAAGAGAT GGGATGCGTC
GCGGTCATGC CCCTGGCGGC ACCCATTGGC TCTGGGCTAG GCATTCGAAA TCCCTACAAT
ATCCTTGAAA TTGTCCAGAA TGCCACCGTG CCTATCCTGG TAGATGCGGG CGTTGGCACC
GCTTCCGATG CGGCGGTAGC CATGGAACTA GGCTGCGATG GAGTACTCAT GAATACCGCC
ATTGCCGGGG CTCAAAACCC TATTTTGATG GCTTCGGCAA TGAAAAAGGC GGTGGAAGCA
GGTCGTGACG CCTACCTGGC CGGGCGTATC CCCCGGAGAC GCTATGCCAG CGCCTCCTCC
CCCCTCGAGG GCACCTTCTT TTAA
 
Protein sequence
MEILLNGKIH QVPENCLISE LIALLKLQGK RLAVEVNQEI VSRSEYTQRP LQSGDKVEIV 
YAIGGGSDSG ATASPLTQNA KTEEIMSTLD TPLVVAGKTY HSRLMVGTGK YQDLEETQNA
IQASGAEIVT IAIRRSNIGQ NPGEPNLLDV ISPHCYTLLP NTAGCYNAKE AVRTCRLARE
LLDGHSLVKL EVLGDEKTLF PDLVETYQAA EVLIKEDFQV MVYTNDDPIA AKRLEEMGCV
AVMPLAAPIG SGLGIRNPYN ILEIVQNATV PILVDAGVGT ASDAAVAMEL GCDGVLMNTA
IAGAQNPILM ASAMKKAVEA GRDAYLAGRI PRRRYASASS PLEGTFF