Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_0503 |
Symbol | |
ID | 3965617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 615993 |
End bp | 617444 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637919566 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_525979 |
Protein GI | 90020152 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000447545 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTTG TTGTTCGTTT ATTCCCTGAA ATATCCATCA AAAGTGCCCC CGTGCGCAAG CGCTGGACAA AAATGCTAAC CGATAACTTA CGGTTGATGG CTAAGCGTAT CCATCCAACC GCTAGGGTGA ACAAAGAGTG GGATCGTATC GAGGTGACGG CCAAGGTGGA TGACCCCGTT GTGGAGGGGC AGTTAATCGA TATGTTGGCG CGTACACCGG GAATTGCGAA CTTTTCCCAT GTGCAAACGC ACCCTTTTGA ATCATTGCAT GATATTTACG AGTTGGTACA AGCATCATGG GGTGACCAAC TTAAGGGCAA AACCTTTTGT GTGCGAGTAA AGCGCACTGG CAACCACGAT TTTACTTCTA CCGAAGTCGA GCGCTATGTG GGTGGTGGCC TTAATCAAAA TAACCCCACT GGCGGCGTAA AACTAAAAGA CCCTGATGTG TCTATTGGTT TGGAAGTAAA AGATGATCAG GTGTATTTGG TAACCAAAAA ATATCAGGGC TTAGGCGGCT TCCCCATGGG GACGCAGGAG TCTGTGCTAT CTCTTATCTC TGGCGGCTAC GATTCTACCG TTGCCAGCTT TCAAATGATC AAGCGCGGTT TGCGCACCCA TTACTGCTTT TTTAATTTAG GGGGGCGCGA GCACGAGCTA GCAGTTAAAG AAATCGCGTT CTTTTTATGG AATCGATTCG GTTCTACTCA CCGCGTGCGT TTTATCTCGG TGCCGTTTGA AGGTGTTGTA GGGGAAATTC TGCAAAAGGT TGGCCCTTCA AATATGGGCG TAGTACTTAA GCGTATGATG TTGCGTGCAG GTGAGCGCAT TGCCGAGCGC GGCGGTATTG AGGCAATGGT GACCGGCGAG GCGGTGGCGC AGGTATCGAG CCAAACCATT CCCAACTTAT CTGTGATTGA TAGCGTTACC GATATGATGG TATTGCGACC GCTTATTGTG ATGGATAAAC GCGATATTAT TGATATCTCC CGCAAAATTG GCGCAGAAGA GTTCTCTGCT GCTGTGCCTG AGTATTGCGG TGTTATTTCG GTAAAACCCT CTGCAAAAGT GAATCGTGCC AAGCTTGAGG CGGAAGAAGA AAAGTTTGAT TTTTCTATCC TCGAACACGC GCTAGAAAAC GCGGTAGTGC AATCTATCGA TGAGGTGATG GATGATGCGC AAGAGCTTGC CGAGGTTGAA TTGGTGTCGG AATTGCCCGT CAGTGCCAAA GTTATTGATA TTCGCCATCA CACCGAGCAA GAACTGCGAC CCCTTACTGT TGAAGGGCGC GAAGTGTTGG AAATACCTTT TTACCAATTA AGCACAGCTT ATGCAGAGTT GGATAAAGCC GTTAACTATT ATTTATTTTG CGATAAGGGG GTAATGAGCG GGCTGCATGC TCGGCATTTG CTAGATGCTG GCTATACTAA TGTAGGTGTT TATCGGCCTT AG
|
Protein sequence | MRFVVRLFPE ISIKSAPVRK RWTKMLTDNL RLMAKRIHPT ARVNKEWDRI EVTAKVDDPV VEGQLIDMLA RTPGIANFSH VQTHPFESLH DIYELVQASW GDQLKGKTFC VRVKRTGNHD FTSTEVERYV GGGLNQNNPT GGVKLKDPDV SIGLEVKDDQ VYLVTKKYQG LGGFPMGTQE SVLSLISGGY DSTVASFQMI KRGLRTHYCF FNLGGREHEL AVKEIAFFLW NRFGSTHRVR FISVPFEGVV GEILQKVGPS NMGVVLKRMM LRAGERIAER GGIEAMVTGE AVAQVSSQTI PNLSVIDSVT DMMVLRPLIV MDKRDIIDIS RKIGAEEFSA AVPEYCGVIS VKPSAKVNRA KLEAEEEKFD FSILEHALEN AVVQSIDEVM DDAQELAEVE LVSELPVSAK VIDIRHHTEQ ELRPLTVEGR EVLEIPFYQL STAYAELDKA VNYYLFCDKG VMSGLHARHL LDAGYTNVGV YRP
|
| |