Gene Cthe_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1249 
Symbol 
ID4809754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1513511 
End bp1514977 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content44% 
IMG OID640106672 
Productamidophosphoribosyltransferase 
Protein accessionYP_001037674 
Protein GI125973764 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.917797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATG TAATAAGGGG AAAGCATTTT AATTACGTAC ACAATTTGTT GAAAAAAGAT 
GAGTTTGGGT TTGACAAGCC GAAAGAAGAG TGTGGAGTTT TCGGTATTTA CAGTAAAGGT
AATTTGGATA CCGCACGTCT GACTTATTAT GCCCTTTATG CGCTTCAGCA CAGAGGACAG
GAAAGTGCGG GAATTGCCGT CAACAATGGT GGGACACTTC TCTTTCACAA GGACATGGGG
CTTGTTCCCG AGATTTTCAA TGAAAAAATT TTAAACAGTC TCAAAGGCAA AATTGCAATA
GGACATGTGA GATATTCGAC TACCGGTGCG AGCAGCAGGG AAAACTCCCA GCCTATGGTT
ATAAAGTACA AGAACGGACA GATGGCCATG GCTCATAACG GCAATCTTGT TAATGCCGCA
AAGATAAGAG AAAAACTTGA GGAAGAGGGT ATTATATTCC AGTCGACTAT AGATTCCGAG
GTAATTTTGA ATTTGATTTC AAGATTCAGG CTGACCAGCA ACAATATTGA AGAGGCCATT
GTCAAGGTAA TGAAGGAGAT AAAAGGTGCG TATTCGCTGG TTATTCTCAC ACCAAACAAG
CTTATTGGTA TCAGAGACCC TCACGGTATA AGACCGCTTT GCATCGGCCG TATAGATGAT
TCCTATGTTC TTGCTTCAGA GACTTGCGCT CTTGATGCAG TAGATGCCGA ATATGTAAGA
GATGTAAATC CCGGAGAGAT TATCGTTATT GAAGAGAGCG GAATGACTTC AATACAAACG
GAAGTTCCGG AAAAGACGGC ACTTTGCATT TTTGAGTATA TTTACTTTGC AAGACCCGAC
AGCTATATTG ACGGTGTAAG TGTTCACAGA GCGAGAATTG AGGCCGGAAG AAGGCTTGCC
CGGGAGCATC CTGTGGAAGC CGACCTTGTT TTCGGAGTTC CGGATTCGGG TGTATCCGCG
GCACTGGGTT ATTCCATGGA GTCGGGAATA CCTTATGATT TGGGACTTAT AAAAAACAAA
TATATCGGAA GAACCTTTAT TCAGCCGGAA CAGGGACAGA GGGAAAGCGG AGTGAAAATT
AAGCTTAATG CTTTGAAGGA AGCCGTTAAC GGTAAAAGGG TTGTTATGAT AGATGACTCA
ATAGTCAGAG GTACTACCAG CAAGAGACTT GTTCAAATTT TAAGGGATGC CGGTGCGAAG
GAAGTTCATA TGAGAATCAG CTCTCCGCCT TATATGTATC CATGTTTCTT TGGAGTTGAC
ACATCGAGCA GGTCCCAGCT TATTGCGGCG GAATGTTCCG TTGAGGAAAT CAGAAAGATG
ACAGGTGCGG ACAGCCTTGG GTACTTAAGT CTCGAAGGGC TCTTGAAAAC GCCGGTGGGA
GCAAAATGCG GTTTTTGTAC CGGATGCTTC ACAGGCAAAT ATCCGATGGA AGTACCTAAA
GATGCCAGCA AGTATAGTTG CGGGTAA
 
Protein sequence
MFDVIRGKHF NYVHNLLKKD EFGFDKPKEE CGVFGIYSKG NLDTARLTYY ALYALQHRGQ 
ESAGIAVNNG GTLLFHKDMG LVPEIFNEKI LNSLKGKIAI GHVRYSTTGA SSRENSQPMV
IKYKNGQMAM AHNGNLVNAA KIREKLEEEG IIFQSTIDSE VILNLISRFR LTSNNIEEAI
VKVMKEIKGA YSLVILTPNK LIGIRDPHGI RPLCIGRIDD SYVLASETCA LDAVDAEYVR
DVNPGEIIVI EESGMTSIQT EVPEKTALCI FEYIYFARPD SYIDGVSVHR ARIEAGRRLA
REHPVEADLV FGVPDSGVSA ALGYSMESGI PYDLGLIKNK YIGRTFIQPE QGQRESGVKI
KLNALKEAVN GKRVVMIDDS IVRGTTSKRL VQILRDAGAK EVHMRISSPP YMYPCFFGVD
TSSRSQLIAA ECSVEEIRKM TGADSLGYLS LEGLLKTPVG AKCGFCTGCF TGKYPMEVPK
DASKYSCG