Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2301 |
Symbol | |
ID | 5745360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2833335 |
End bp | 2834510 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641293391 |
Product | thiamine biosynthesis/tRNA modification protein ThiI |
Protein accession | YP_001559401 |
Protein GI | 160880433 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.297058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAAG CATTTTTAAT AAAATACGCA GAAATAGGTC TAAAAGGTAA AAATCGTCAC ATATTTGAGA ACGCTTTAAA AGATCAGATT CGTTTTAACC TAAATAAACT AGGAAACTTT GAAGTCTCTA GAGAACAAGG ACGTGTTTTT GTAGAGTGTC CAGATGATTT CGATTACGAT GAAACTGTTG CAGCATTACA AAGAGTATTT GGTATTACAG GAATAAGCCC AGTTATAGTA ATTAATTCAA CCGATTGGGA AGATATTAAA CAAGAAGTTG GTGATTATGT AGAGAAATTT TATGGACGTA AACCATTTAC GTTTAAAGTA GAAGCGAAAC GTGGTAACAA GCAGTATCCA ATTCAATCTC CAGAGATTTG TTCCAAGATG GGAGCTTATT TATTAGACCG TTTTCCAGAA CTATCAGTAG ATGTTCACAC ACCACAGGAA TATATTACCG TGGAAGTTCG TAACAAAGCT TATGTATATT CTAACACCTT AAAAGGACCA GGCGGTATGC CGGTTGGTAC AGGTGGAAAA GCGATGTTAT TACTATCTGG TGGTATTGAT AGCCCGGTAG CTGGATATAT GATTTCAAAA CGTGGTGTAA CAATAGAAGC AACTTATTTT CATGCGCCTC CTTATACCAG TGAGCGAGCG AAGCAAAAAG TAGTTGATTT AGCTAAGATA ATATCTGCTT ACACAGGACC TATTAAGCTC CATGTCGTGA ATTTTACCGA TATTCAGTTA TATATTTATG AGAAGTGTCC TCATGAAGAA TTAACAATCA TTATGCGCCG TTATATGATG AAAATTGCAG AGTCAATTGC AAATCGCAGT AAGTGTCTTG GTTTAATTAC AGGTGAGAGT ATTGGTCAGG TAGCAAGTCA GACCATGCAA TCCTTAGCTG CAACAAACGC TGTTTGTACA ATGCCAGTAT ATCGTCCGCT AATCGGTATG GATAAGCAAG AGATTATCGA TATCTCCGAG CGAATTGGGA CTTTTGAAAC ATCAGTATTG CCTTTTGAAG ATTGCTGTAC AATTTTCGTA GCAAAACATC CGGTAACAAG ACCAATCCTA TCTGTAATAG AAAAGAATGA GCTTAATCTA TCAGAAAAAA TTGATGAATT GGTTAAGACA GCGCTTGAGA CAAGAGAAGT TATTACAGTT AAGTAA
|
Protein sequence | MYKAFLIKYA EIGLKGKNRH IFENALKDQI RFNLNKLGNF EVSREQGRVF VECPDDFDYD ETVAALQRVF GITGISPVIV INSTDWEDIK QEVGDYVEKF YGRKPFTFKV EAKRGNKQYP IQSPEICSKM GAYLLDRFPE LSVDVHTPQE YITVEVRNKA YVYSNTLKGP GGMPVGTGGK AMLLLSGGID SPVAGYMISK RGVTIEATYF HAPPYTSERA KQKVVDLAKI ISAYTGPIKL HVVNFTDIQL YIYEKCPHEE LTIIMRRYMM KIAESIANRS KCLGLITGES IGQVASQTMQ SLAATNAVCT MPVYRPLIGM DKQEIIDISE RIGTFETSVL PFEDCCTIFV AKHPVTRPIL SVIEKNELNL SEKIDELVKT ALETREVITV K
|
| |