Gene Cthe_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1248 
Symbol 
ID4809753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1512396 
End bp1513418 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content43% 
IMG OID640106671 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001037673 
Protein GI125973763 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00288462 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAT ATAAAGATGC GGGTGTTGAT GTGGAAGCCG GTTATGAAGC GGTCAGGCTT 
ATGAGAAATG ATGTAAAAAG GACATTCAGG CCTGAGGTGC TTACTGATAT AGGTGGCTTT
GGTGGATTGT TTGGCTTAAA CAAAGATAAA TATTCGGAAC CCGTTCTTGT ATCAGGAACT
GATGGCGTGG GTACAAAACT GAAAATTGCT TTCCTGCTGG ACAAGCATGA TACCGTTGGT
ATTGACTGTG TGGCGATGTG TGTAAATGAT ATTGTGTGCA GTGGTGCGGA ACCTCTGTTT
TTCCTCGACT ATATAGCCTT GGGCAAAAAC CGTCCCGAAA AAGTGGCTCA GATTGTGAAA
GGTATAGCCG ACGGATGTGT TGAAGCAGGA TGTGCCCTAA TCGGAGGAGA AACGGCGGAA
ATGCCGGGAT TTTATCCTGA GGATGAATAT GATTTGGCCG GATTTGCGGT CGGAATAGTG
GAAAAAAGCA AGATTATAGA CGGCAGTAAA ATCAAGGCGG GGGACAAATT AATAGGACTT
GCGTCATCAG GTATTCACAG CAATGGATAT TCCCTTGTAA GGAAGATTTT GGCGCCTACT
GCGAAAAAAC TTGCGGAAGA GATTAAGATG CTTGGAACCA CTTTGGGTGA AGAGCTTATA
AAGCCCACAA GACTGTATGT CAAGACGATC CTGGATTTGA AAGAAAAGTT TGAAATCAAG
GGAATTGCCC ATATTACAGG CGGAGGATTC ATTGAAAACA TACCGAGAAT GCTGCCTCAA
GGTTTGGGAG TCAAAGTAGT CAGGGGTAGC TGGCCTGTAC TTCCGATATT CACTCTCTTA
AAAGATCTTG GAAACCTTGA CGAAATGGAT ATGTACAATA CCTTTAACAT GGGAATAGGT
ATGACAATTG CCGTGGATGC TGAAATTGCA AACAGTGTTG TGGAGTATTT GAACAAGGAT
AAAGAGCAGG CTTACATAAT CGGAGAAGTT GTATCAGACA AGGAAGGGCT TGAAATATGT
TAA
 
Protein sequence
MTTYKDAGVD VEAGYEAVRL MRNDVKRTFR PEVLTDIGGF GGLFGLNKDK YSEPVLVSGT 
DGVGTKLKIA FLLDKHDTVG IDCVAMCVND IVCSGAEPLF FLDYIALGKN RPEKVAQIVK
GIADGCVEAG CALIGGETAE MPGFYPEDEY DLAGFAVGIV EKSKIIDGSK IKAGDKLIGL
ASSGIHSNGY SLVRKILAPT AKKLAEEIKM LGTTLGEELI KPTRLYVKTI LDLKEKFEIK
GIAHITGGGF IENIPRMLPQ GLGVKVVRGS WPVLPIFTLL KDLGNLDEMD MYNTFNMGIG
MTIAVDAEIA NSVVEYLNKD KEQAYIIGEV VSDKEGLEIC