Gene Cphy_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2301 
Symbol 
ID5745360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2833335 
End bp2834510 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content36% 
IMG OID641293391 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_001559401 
Protein GI160880433 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.297058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAG CATTTTTAAT AAAATACGCA GAAATAGGTC TAAAAGGTAA AAATCGTCAC 
ATATTTGAGA ACGCTTTAAA AGATCAGATT CGTTTTAACC TAAATAAACT AGGAAACTTT
GAAGTCTCTA GAGAACAAGG ACGTGTTTTT GTAGAGTGTC CAGATGATTT CGATTACGAT
GAAACTGTTG CAGCATTACA AAGAGTATTT GGTATTACAG GAATAAGCCC AGTTATAGTA
ATTAATTCAA CCGATTGGGA AGATATTAAA CAAGAAGTTG GTGATTATGT AGAGAAATTT
TATGGACGTA AACCATTTAC GTTTAAAGTA GAAGCGAAAC GTGGTAACAA GCAGTATCCA
ATTCAATCTC CAGAGATTTG TTCCAAGATG GGAGCTTATT TATTAGACCG TTTTCCAGAA
CTATCAGTAG ATGTTCACAC ACCACAGGAA TATATTACCG TGGAAGTTCG TAACAAAGCT
TATGTATATT CTAACACCTT AAAAGGACCA GGCGGTATGC CGGTTGGTAC AGGTGGAAAA
GCGATGTTAT TACTATCTGG TGGTATTGAT AGCCCGGTAG CTGGATATAT GATTTCAAAA
CGTGGTGTAA CAATAGAAGC AACTTATTTT CATGCGCCTC CTTATACCAG TGAGCGAGCG
AAGCAAAAAG TAGTTGATTT AGCTAAGATA ATATCTGCTT ACACAGGACC TATTAAGCTC
CATGTCGTGA ATTTTACCGA TATTCAGTTA TATATTTATG AGAAGTGTCC TCATGAAGAA
TTAACAATCA TTATGCGCCG TTATATGATG AAAATTGCAG AGTCAATTGC AAATCGCAGT
AAGTGTCTTG GTTTAATTAC AGGTGAGAGT ATTGGTCAGG TAGCAAGTCA GACCATGCAA
TCCTTAGCTG CAACAAACGC TGTTTGTACA ATGCCAGTAT ATCGTCCGCT AATCGGTATG
GATAAGCAAG AGATTATCGA TATCTCCGAG CGAATTGGGA CTTTTGAAAC ATCAGTATTG
CCTTTTGAAG ATTGCTGTAC AATTTTCGTA GCAAAACATC CGGTAACAAG ACCAATCCTA
TCTGTAATAG AAAAGAATGA GCTTAATCTA TCAGAAAAAA TTGATGAATT GGTTAAGACA
GCGCTTGAGA CAAGAGAAGT TATTACAGTT AAGTAA
 
Protein sequence
MYKAFLIKYA EIGLKGKNRH IFENALKDQI RFNLNKLGNF EVSREQGRVF VECPDDFDYD 
ETVAALQRVF GITGISPVIV INSTDWEDIK QEVGDYVEKF YGRKPFTFKV EAKRGNKQYP
IQSPEICSKM GAYLLDRFPE LSVDVHTPQE YITVEVRNKA YVYSNTLKGP GGMPVGTGGK
AMLLLSGGID SPVAGYMISK RGVTIEATYF HAPPYTSERA KQKVVDLAKI ISAYTGPIKL
HVVNFTDIQL YIYEKCPHEE LTIIMRRYMM KIAESIANRS KCLGLITGES IGQVASQTMQ
SLAATNAVCT MPVYRPLIGM DKQEIIDISE RIGTFETSVL PFEDCCTIFV AKHPVTRPIL
SVIEKNELNL SEKIDELVKT ALETREVITV K