Gene Cthe_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1923 
SymbolpyrG 
ID4810781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2293020 
End bp2294648 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content44% 
IMG OID640107340 
ProductCTP synthetase 
Protein accessionYP_001038335 
Protein GI125974425 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0447943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTCA AGTACATTTT TGTAACCGGC GGTGTTGTTT CGGGGCTTGG AAAGGGAATA 
ACTGCCGCCT CACTGGGAAG ACTTTTGAAA GCCAGAGGTC TTCGAGTCAC TATTCAAAAA
TTTGACCCGT ATATTAATGT CGACCCGGGT ACTATGAGCC CGTATCAGCA CGGAGAAGTG
TTTGTAACAG ATGACGGGGC TGAGACGGAT TTGGACCTTG GGCATTATGA AAGATTTATA
GATGAAAACT TGAGCAAAAA CAATAATGTC ACCACCGGTA AAATATACTG GTCCATTATC
AGCAAGGAAA GAAAAGGAGA TTTTCTCGGT GGTACGGTGC AAGTTATACC ACATGTAACC
AACGAGATTA AAAATAGAAT TTATTTAGTG GGCAAAGAAG GAAAAACCGA TGTGGTGATT
ACCGAAATCG GCGGTACTGT TGGCGATATT GAAAGCTTGC CTTTTTTAGA GGCCATTCGT
CAGGTGGCAA GTGACGTAGG CAGAGAGAAT GTGGTTTACA TACATGTTAC CCTTGTCCCT
TACCTCAGCA AGTCGGGAGA GCTTAAGACA AAGCCCACCC AGCACAGTGT CAAAGAGTTA
AGAAGCCTCG GTATACAGCC GGATATAATT GTATGCCGTA CGGAAAAGAG ACTTTCAAAG
GAATTGAAGG ACAAGATAGG ACTGTTTTGC AATATACCCG GAGAATGGGT AATTCAGAAT
CTTGACGCCG AGTCCCTTTA TGAGATTCCT TTGATGCTGG AGGAAGAAGG ACTGGCCAAC
ATTGTTTGTG AGCGTTTGAA TCTTGGATGC GTTAAGCCGG ATATGACAGA ATGGTGTGAG
TTGGTAAACA GGCAGAAAAA CTTAAGTAAG TCCTTGACCA TAGCTCTTGT GGGGAAATAC
GTTGAACTTC ACGATGCGTA TCTTAGTGTT GTAGAGTCTT TAAATCACGG AGGAATATAT
AATGACGCCG AGGTAAAGAT AAAGTGGGTG AATTCCGAGG AACTTACTGA AGACAATCTT
GAGGAAACTC TCTGTGATGT GGACGGAATA CTGGTTCCCG GCGGTTTTGG AGACAGGGGT
ATAGAGGGAA AGATTCTTGC AGCCAAATAT GCCCGTGAAA ACAAAGTTCC GTACTTTGGT
ATCTGCCTTG GAATGCAGAT GGCGGTTGTG GAGTTTGCAA GGAATGTAGC CGGGCTTAAG
ATGGCCAACA GCTCGGAATT TGACGCAAAC AGTCCGCATC CTGTAATTGA TCTCATGCCG
GAACAGAAGG ATATAGATGA AAAGGGAGGT ACAATGAGGC TTGGACTGTA CCCCTGCAAG
ATTATAAAAG ATTCTTCTGC TTATAGAATA TATGAAAGCG AGCTTATATA TGAAAGGCAC
AGACACAGAT ATGAGTTCAA CAATGAATAC AGGGAGCTTT TGACCTCAAA GGGACTGATT
CTGGCAGGTT TGTCTCCAAG TGAGAAGCTT GTGGAAATTA TTGAACTGAA AGATCATCCA
TGGTTTATCG GAGTTCAGTT CCATCCGGAG TTTAAATCAA GGCCGAACAG ACCCCACCCG
TTGTTTAAGG ACTTTATAAG GGCCGCGGTG GAAAGACGTT ACAAACAGCA GGAAAATAAA
AACGATTAA
 
Protein sequence
MSVKYIFVTG GVVSGLGKGI TAASLGRLLK ARGLRVTIQK FDPYINVDPG TMSPYQHGEV 
FVTDDGAETD LDLGHYERFI DENLSKNNNV TTGKIYWSII SKERKGDFLG GTVQVIPHVT
NEIKNRIYLV GKEGKTDVVI TEIGGTVGDI ESLPFLEAIR QVASDVGREN VVYIHVTLVP
YLSKSGELKT KPTQHSVKEL RSLGIQPDII VCRTEKRLSK ELKDKIGLFC NIPGEWVIQN
LDAESLYEIP LMLEEEGLAN IVCERLNLGC VKPDMTEWCE LVNRQKNLSK SLTIALVGKY
VELHDAYLSV VESLNHGGIY NDAEVKIKWV NSEELTEDNL EETLCDVDGI LVPGGFGDRG
IEGKILAAKY ARENKVPYFG ICLGMQMAVV EFARNVAGLK MANSSEFDAN SPHPVIDLMP
EQKDIDEKGG TMRLGLYPCK IIKDSSAYRI YESELIYERH RHRYEFNNEY RELLTSKGLI
LAGLSPSEKL VEIIELKDHP WFIGVQFHPE FKSRPNRPHP LFKDFIRAAV ERRYKQQENK
ND