Gene Cthe_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1867 
Symbol 
ID4809198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2213439 
End bp2214503 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content42% 
IMG OID640107286 
Productcarbamoyl-phosphate synthase small subunit 
Protein accessionYP_001038281 
Protein GI125974371 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCTG TTTTGGTTTT GGAAGATGGA ACTTATTTTA CGGGGGAAGC CTTTGGCAAG 
ACCGGCGAAG TAGTTGGAGA AATTGTTTTT AACACATGCA TGACAGGTTA TCAGGAAATA
CTGACCAACC CGTCTTACAA CGGTCAAATA GTTGCAATGA CCTATCCGTT GATAGGAAAC
TACGGGTTCA ATCGGTATGA CAACGAATCC GACAGAATCC ATGTTCAGGG ATTCATTGTC
AAAGAGCTTT CGGACACGCC CACCAACTGG CGTTGCGAAA TTACTCCCGA AGAATATTTC
GTTACAAACG GAATTGTGGG AATCAAAGGC ATTGACACAA GAAACCTCAC CAAACACATA
AGGAGCAAAG GCAGCATGCA CGGCATAATT TCCACCGAGT CAGGAAACAT AGATTTGCTC
CTTGAAAAAC TTATGAAAAA GAAAACGGAG AAAAAGAATG CGGTAATGGA GGTTTCCACA
AAGTCTCCAA TACACAAACC CGGCAGAGGA AAACGCGTGG TGGTAATGGA CTTCGGAGTA
AAACACAGCA TTATAAAAGC GCTGGAGAAA CTTGACTGTG ATATATACAT TCTTCCGGCT
TCATCCCCGG CAAATGAAAT TATGAGCTAC AATCCCGACG GTATACTTTT ATCCAACGGA
CCAGGGGACC CGTCCGAACT TCCTTTTGTC AAGTCCACCG TACAGGAGCT TATAGGCAAA
AAACCAATGC TCGGAATAGG ATTGGGCCAC CAGCTTTTAG GCCTTGCCCT TGGCGGCAAA
GTAAAGAAGC TTCCCTTTGG TCACCATGGA GCAAATCAGC CCGTGAGGGA TTATATCAAA
GGGAAATGTT ATGTAACTTC TCAATGCCAC AACTATGCGC TGGAAAATGA TTTTAGCGAT
GATATATTTA TTACCCACAT AAATATTAAC GACAATACTG TGGAAGGCTT TAAGCACAAA
CACCATCCTG TTTTGGGAGT TCAATACCAT CCCAAAGCCA TTTTGGGACA GGATGATTCA
TCTTACATAT TTGATGAATT TATTAAAATG ATGGACAACC TATAA
 
Protein sequence
MKSVLVLEDG TYFTGEAFGK TGEVVGEIVF NTCMTGYQEI LTNPSYNGQI VAMTYPLIGN 
YGFNRYDNES DRIHVQGFIV KELSDTPTNW RCEITPEEYF VTNGIVGIKG IDTRNLTKHI
RSKGSMHGII STESGNIDLL LEKLMKKKTE KKNAVMEVST KSPIHKPGRG KRVVVMDFGV
KHSIIKALEK LDCDIYILPA SSPANEIMSY NPDGILLSNG PGDPSELPFV KSTVQELIGK
KPMLGIGLGH QLLGLALGGK VKKLPFGHHG ANQPVRDYIK GKCYVTSQCH NYALENDFSD
DIFITHININ DNTVEGFKHK HHPVLGVQYH PKAILGQDDS SYIFDEFIKM MDNL