Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1248 |
Symbol | |
ID | 4809753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1512396 |
End bp | 1513418 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106671 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001037673 |
Protein GI | 125973763 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00288462 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACAT ATAAAGATGC GGGTGTTGAT GTGGAAGCCG GTTATGAAGC GGTCAGGCTT ATGAGAAATG ATGTAAAAAG GACATTCAGG CCTGAGGTGC TTACTGATAT AGGTGGCTTT GGTGGATTGT TTGGCTTAAA CAAAGATAAA TATTCGGAAC CCGTTCTTGT ATCAGGAACT GATGGCGTGG GTACAAAACT GAAAATTGCT TTCCTGCTGG ACAAGCATGA TACCGTTGGT ATTGACTGTG TGGCGATGTG TGTAAATGAT ATTGTGTGCA GTGGTGCGGA ACCTCTGTTT TTCCTCGACT ATATAGCCTT GGGCAAAAAC CGTCCCGAAA AAGTGGCTCA GATTGTGAAA GGTATAGCCG ACGGATGTGT TGAAGCAGGA TGTGCCCTAA TCGGAGGAGA AACGGCGGAA ATGCCGGGAT TTTATCCTGA GGATGAATAT GATTTGGCCG GATTTGCGGT CGGAATAGTG GAAAAAAGCA AGATTATAGA CGGCAGTAAA ATCAAGGCGG GGGACAAATT AATAGGACTT GCGTCATCAG GTATTCACAG CAATGGATAT TCCCTTGTAA GGAAGATTTT GGCGCCTACT GCGAAAAAAC TTGCGGAAGA GATTAAGATG CTTGGAACCA CTTTGGGTGA AGAGCTTATA AAGCCCACAA GACTGTATGT CAAGACGATC CTGGATTTGA AAGAAAAGTT TGAAATCAAG GGAATTGCCC ATATTACAGG CGGAGGATTC ATTGAAAACA TACCGAGAAT GCTGCCTCAA GGTTTGGGAG TCAAAGTAGT CAGGGGTAGC TGGCCTGTAC TTCCGATATT CACTCTCTTA AAAGATCTTG GAAACCTTGA CGAAATGGAT ATGTACAATA CCTTTAACAT GGGAATAGGT ATGACAATTG CCGTGGATGC TGAAATTGCA AACAGTGTTG TGGAGTATTT GAACAAGGAT AAAGAGCAGG CTTACATAAT CGGAGAAGTT GTATCAGACA AGGAAGGGCT TGAAATATGT TAA
|
Protein sequence | MTTYKDAGVD VEAGYEAVRL MRNDVKRTFR PEVLTDIGGF GGLFGLNKDK YSEPVLVSGT DGVGTKLKIA FLLDKHDTVG IDCVAMCVND IVCSGAEPLF FLDYIALGKN RPEKVAQIVK GIADGCVEAG CALIGGETAE MPGFYPEDEY DLAGFAVGIV EKSKIIDGSK IKAGDKLIGL ASSGIHSNGY SLVRKILAPT AKKLAEEIKM LGTTLGEELI KPTRLYVKTI LDLKEKFEIK GIAHITGGGF IENIPRMLPQ GLGVKVVRGS WPVLPIFTLL KDLGNLDEMD MYNTFNMGIG MTIAVDAEIA NSVVEYLNKD KEQAYIIGEV VSDKEGLEIC
|
| |