Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3290 |
Symbol | |
ID | 3915937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3507383 |
End bp | 3508378 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640446075 |
Product | bifunctional sulfur carrier protein/thiazole synthase protein |
Protein accession | YP_498559 |
Protein GI | 87201302 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis [COG2104] Sulfur transfer protein involved in thiamine biosynthesis |
TIGRFAM ID | [TIGR01683] thiamine biosynthesis protein ThiS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAC AGCTATCCCT TACCGTCAAC GGCGAACCCC GCCGCGCCGC GCCCGGATCG ATCGCCGACC TGGTGCGCAG CCTGGAACTC GATCCGGCCA AGGTCGCGGT CGAACGCAAT GGCGAGATCG TCCCGCGCTC GACCTTGGCC AGCGTGGCGA TCGCCGATGG GGACGTGCTG GAAATCGTGC ATTTCGTGGG TGGAGGACAA TCGGACGTGA CCGACAACAA CGATACCTGG ACCGTCGCCG GACGCACCTT CACCTCGCGC CTGATCGTGG GCACGGGCAA GTACAAGGAC TTCGAGCAGA ACGCCGCCGC GGTCGAAGCA TCGGGCGCGG AGATCGTCAC CGTCGCCGTG CGCAGGGTCA ACGTCTCGGA CCCCAAGGCG CCGATGCTGA CCGACTACAT CGACCCGAAG AAAATCACCT ACCTGCCCAA CACCGCCGGC TGCTTTACCG CCGAGGACGC GATCCGCACG CTGCGCCTTG CGCGCGAGGC GGGCGGCTGG GATCTGGTGA AGCTGGAAGT CCTGGGCGAG GCGCGCACGC TCTATCCCAA CATGATCGAA ACGATCCGCG CGACCGAAGT CCTGGCCAAG GAAGGCTTCC TGCCAATGGT CTATTGCGTC GACGATCCGA TCGCTGCCAA GCAGCTTGAA GACGCGGGCG CGGTCGCCGT CATGCCACTG GGCGCGCCGA TCGGTTCGGG CCTCGGCATC CAGAACAAGG TAACGGTGCG GCTGATCGTC GAAGGCGCCA AGGTGCCGGT GCTCGTCGAC GCAGGCGTGG GCACCGCTTC CGAAGCCGCC GTGGCGATGG AGCTTGGCTG CGATGGCGTG CTGATGAACA CCGCCATCGC CGAGGCCAAG GACCCGATCC GCATGGCCCG CGCAATGAAG CTGGCCGTTC AGGCCGGACG CGACGCCTAT CTCGCCGGCC GCATGCCGAC GCGCAAGTAC GCCGATCCGT CGAGCCCGCT GGCCGGGTTG ATCTGA
|
Protein sequence | MTGQLSLTVN GEPRRAAPGS IADLVRSLEL DPAKVAVERN GEIVPRSTLA SVAIADGDVL EIVHFVGGGQ SDVTDNNDTW TVAGRTFTSR LIVGTGKYKD FEQNAAAVEA SGAEIVTVAV RRVNVSDPKA PMLTDYIDPK KITYLPNTAG CFTAEDAIRT LRLAREAGGW DLVKLEVLGE ARTLYPNMIE TIRATEVLAK EGFLPMVYCV DDPIAAKQLE DAGAVAVMPL GAPIGSGLGI QNKVTVRLIV EGAKVPVLVD AGVGTASEAA VAMELGCDGV LMNTAIAEAK DPIRMARAMK LAVQAGRDAY LAGRMPTRKY ADPSSPLAGL I
|
| |