Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0950 |
Symbol | |
ID | 4811243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1139034 |
End bp | 1140107 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106369 |
Product | carbamoyl-phosphate synthase small subunit |
Protein accession | YP_001037377 |
Protein GI | 125973467 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCTA TTTTAGCTCT TGAAGACGGC ACAATTTTTC ACGGCGAAAG TTTCGGAGCT CAGGGAGAAG TAATCGGAGA GATTGTTTTC AACACAGGTA TGACGGGCTA TCAGGAGGTT TTGACAGATC CGTCTTACTG CGGACAGATT GTTGCAATGA CCTATCCTTT GATAGGCAAT TACGGTGTAA ACAGTGAAGA TATAGAGTCG GAAAAGCCAC AAGTAAAGGG TTTTATTGTA AGGGAGCTTT GTCAAAACCC AAGCAACTGG AGAGCCGAGG AGACGTTAAA CAACTATCTG AAAAGAAATA ACATAATAGG AATTGAGAAA ATTGATACCA GGGCTCTTAC GAGGATTTTG AGGGAAAAAG GAACGATGAA GGGAATGATT TCAACGGATC CGAATTTCAA TCTTGATGAC AAGATTGACG AAATAAAAGC TTATGTTATA AAGGATCCGG TTATGTGTGT CACAACAAAA GAAGTTTTGC ATTATAAAGG TGACGGATTT AAAGTTGCAT TGATAGATTT GGGCTTAAAG AAAAATATTG TGCGCTCCCT TTTAAAAAGA GGATGTGACG TGCATGTTTT CCCTGCCAAT TCCAAAGCGG AGGACATCCT TGCGATTAAT CCCGACGGAA TAATGCTTTC AAACGGACCG GGGGATCCGA AGGATTGTGT TGAGACAATT GAGACCATAA AGAAGCTTAT GGGCAAAAAA CCCATGTTTG GCATCTGCCT TGGGCATCAG CTTACAGCCC TTGCCAACGG TGCCGATACC GAAAAACTCA AATACGGCCA CAGGGGAGCA AACCATCCGG TGAAGGACCT CGAAAAGGAC CTGACATATA TTACTTCCCA AAACCATGGC TACACTATTG TTGAGTCATC CATGGACAAA TCAAGGATGA CGGTAAGCCA CAGAAACATG AACGACGGCA CTGTCGAAGG CGTAAGGTAC AAGGATATGC CGGTGTTTAC CGTGCAGTTT CATCCGGAAG CCTCACCGGG GCCTGAGGAC ACGGCTTATC TGTTTGACGA GTTTATTGAT ATGATGAAAA AATATTCGCG TTAA
|
Protein sequence | MKAILALEDG TIFHGESFGA QGEVIGEIVF NTGMTGYQEV LTDPSYCGQI VAMTYPLIGN YGVNSEDIES EKPQVKGFIV RELCQNPSNW RAEETLNNYL KRNNIIGIEK IDTRALTRIL REKGTMKGMI STDPNFNLDD KIDEIKAYVI KDPVMCVTTK EVLHYKGDGF KVALIDLGLK KNIVRSLLKR GCDVHVFPAN SKAEDILAIN PDGIMLSNGP GDPKDCVETI ETIKKLMGKK PMFGICLGHQ LTALANGADT EKLKYGHRGA NHPVKDLEKD LTYITSQNHG YTIVESSMDK SRMTVSHRNM NDGTVEGVRY KDMPVFTVQF HPEASPGPED TAYLFDEFID MMKKYSR
|
| |