Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1867 |
Symbol | |
ID | 4809198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2213439 |
End bp | 2214503 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107286 |
Product | carbamoyl-phosphate synthase small subunit |
Protein accession | YP_001038281 |
Protein GI | 125974371 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCTG TTTTGGTTTT GGAAGATGGA ACTTATTTTA CGGGGGAAGC CTTTGGCAAG ACCGGCGAAG TAGTTGGAGA AATTGTTTTT AACACATGCA TGACAGGTTA TCAGGAAATA CTGACCAACC CGTCTTACAA CGGTCAAATA GTTGCAATGA CCTATCCGTT GATAGGAAAC TACGGGTTCA ATCGGTATGA CAACGAATCC GACAGAATCC ATGTTCAGGG ATTCATTGTC AAAGAGCTTT CGGACACGCC CACCAACTGG CGTTGCGAAA TTACTCCCGA AGAATATTTC GTTACAAACG GAATTGTGGG AATCAAAGGC ATTGACACAA GAAACCTCAC CAAACACATA AGGAGCAAAG GCAGCATGCA CGGCATAATT TCCACCGAGT CAGGAAACAT AGATTTGCTC CTTGAAAAAC TTATGAAAAA GAAAACGGAG AAAAAGAATG CGGTAATGGA GGTTTCCACA AAGTCTCCAA TACACAAACC CGGCAGAGGA AAACGCGTGG TGGTAATGGA CTTCGGAGTA AAACACAGCA TTATAAAAGC GCTGGAGAAA CTTGACTGTG ATATATACAT TCTTCCGGCT TCATCCCCGG CAAATGAAAT TATGAGCTAC AATCCCGACG GTATACTTTT ATCCAACGGA CCAGGGGACC CGTCCGAACT TCCTTTTGTC AAGTCCACCG TACAGGAGCT TATAGGCAAA AAACCAATGC TCGGAATAGG ATTGGGCCAC CAGCTTTTAG GCCTTGCCCT TGGCGGCAAA GTAAAGAAGC TTCCCTTTGG TCACCATGGA GCAAATCAGC CCGTGAGGGA TTATATCAAA GGGAAATGTT ATGTAACTTC TCAATGCCAC AACTATGCGC TGGAAAATGA TTTTAGCGAT GATATATTTA TTACCCACAT AAATATTAAC GACAATACTG TGGAAGGCTT TAAGCACAAA CACCATCCTG TTTTGGGAGT TCAATACCAT CCCAAAGCCA TTTTGGGACA GGATGATTCA TCTTACATAT TTGATGAATT TATTAAAATG ATGGACAACC TATAA
|
Protein sequence | MKSVLVLEDG TYFTGEAFGK TGEVVGEIVF NTCMTGYQEI LTNPSYNGQI VAMTYPLIGN YGFNRYDNES DRIHVQGFIV KELSDTPTNW RCEITPEEYF VTNGIVGIKG IDTRNLTKHI RSKGSMHGII STESGNIDLL LEKLMKKKTE KKNAVMEVST KSPIHKPGRG KRVVVMDFGV KHSIIKALEK LDCDIYILPA SSPANEIMSY NPDGILLSNG PGDPSELPFV KSTVQELIGK KPMLGIGLGH QLLGLALGGK VKKLPFGHHG ANQPVRDYIK GKCYVTSQCH NYALENDFSD DIFITHININ DNTVEGFKHK HHPVLGVQYH PKAILGQDDS SYIFDEFIKM MDNL
|
| |