Gene Cthe_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0953 
SymbolpyrB 
ID4811246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1142483 
End bp1143421 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content42% 
IMG OID640106372 
Productaspartate carbamoyltransferase catalytic subunit 
Protein accessionYP_001037380 
Protein GI125973470 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0540] Aspartate carbamoyltransferase, catalytic chain 
TIGRFAM ID[TIGR00670] aspartate carbamoyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0935774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTAA AATCAAAAGA CCTCTTAGGA CTTAAGGATT TAACGGCAGA AGAAATACAA 
TACATTTTAA ATACTGCAAA AACAATGAAG GTTATTCTTT TATCCAAAAA CAAGAAGGCT
CCTCATTTGC AGGGAAAATC AATAATTACC CTGTTTTATG AGAACAGCAC AAGAACAAGA
TTATCCTTCG AGCTTGCTTC AAAATACTTA AGTGCCAATG CAGCAAATAT TTCGGTGGCC
GCAAGCAGCG TTGCCAAAGG TGAAACACTT ATAGACACCG GAAAGACAAT TGACATGATG
GGTGCGGATG TAATCGTAAT AAGGCATTCC ATGTCGGGAG CGCCTCACCT TCTTGCCAGG
AATGTAAAGG CATCGGTAAT AAATGCAGGT GACGGTATGA ATGAACATCC TACCCAGGCA
CTTCTTGACA TGTTCACTAT TATTGAGAAA AAAGGAAGTC TTAAGGGGTT GAAGGTTGCC
ATAATCGGTG ATATATATCA CAGCAGAGTG GCAAGGAGCA ACATCTGGGG GATGACCAAG
CTTGGTGCAG AAGTAAGTGT TGCAGGACCC TCCACCCTTA TGCCTCCGGA ATTGGACAAG
ACCGGCGTGA AAGTCTTTAC CACTGTCCAG GAAGCTTTGA TTGACGCAGA TGTGGTTATG
GGACTTAGAA TTCAGAAGGA AAGACAAAAA AGTGGCTTGT TCCCGAGCCT GAGAGAATAT
TCGAGATTTT TCGGATTGGA TGAAAAGCGC TTGAAACTTG CAAAAGAGGA TGCTCTGATA
CTGCATCCGG GACCGGTTAA CAGAGGAGTT GAATTACCGT CGTCGGTAAT CGATTCCGAA
AGGTCGTTTA TAAACGAACA GGTTACCAAC GGAGTTGCTG TAAGGATGGC TCTTCTTTAT
CTTTTAACAA GGAGGGATAG CGGTGAGAGT GTTAATTAA
 
Protein sequence
MILKSKDLLG LKDLTAEEIQ YILNTAKTMK VILLSKNKKA PHLQGKSIIT LFYENSTRTR 
LSFELASKYL SANAANISVA ASSVAKGETL IDTGKTIDMM GADVIVIRHS MSGAPHLLAR
NVKASVINAG DGMNEHPTQA LLDMFTIIEK KGSLKGLKVA IIGDIYHSRV ARSNIWGMTK
LGAEVSVAGP STLMPPELDK TGVKVFTTVQ EALIDADVVM GLRIQKERQK SGLFPSLREY
SRFFGLDEKR LKLAKEDALI LHPGPVNRGV ELPSSVIDSE RSFINEQVTN GVAVRMALLY
LLTRRDSGES VN