Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0179 |
Symbol | |
ID | 4808667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 215545 |
End bp | 216762 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105590 |
Product | argininosuccinate synthase |
Protein accession | YP_001036613 |
Protein GI | 125972703 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0137] Argininosuccinate synthase |
TIGRFAM ID | [TIGR00032] argininosuccinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAA AAGAAAAAGT AATCCTTGCC TATTCCGGCG GTCTGGATAC TTCAATAATC ATTCCCTGGC TTAAAGAAAA TTATGACTAT GAAGTCATTG CAATGGCAGC TGACGTTGGA CAGGGCGAAG AGCTTGAACC CTTAAGAGAG AAAGCTATAA AAACCGGGGC AAGCAAAATA TATATTGAAG ACTTGAAAGA AGAATTTGTG ACAGACTTTA TTTTCCCCAC ATTAAAAGCA GGTGCCGTAT ATGAAGGAAA ATATCTTTTG GGTACATCCT TTGCAAGACC TCTTATAGCA AAGAGAATGG TTGAAATTGC CTTAAAAGAA GGCGCTACAG CAGTCGCTCA CGGAGCAACG GGAAAAGGAA ACGACCAGGT CCGTTTCGAG CTGACCGTCA AAGCTCTTGC TCCCCATCTT AAAATCATTG CTCCGTGGAG AATATGGGAC ATTAAATCCA GAGAAGATGA AATCGAATAC GCCCAGGCAA GAAACATCCC CATTCCTGTA TCAAAAGAAG ACAACTACAG CATGGACAGA AACCTCTGGC ACCTCAGCCA TGAAGGCTTG GATTTGGAAG ACCCATGGAA CGAACCGCAG TATGACAAAA TATTAAAGCT CATGGTTCCG CCGGAAAAAG CTCCGGACAA ACCCACCTAT GTTGAAATAT ATTTTGAAAA GGGTATACCT AAAAAAGTTA ACGGTGTGGA ATACGGACCT GTGGAGCTTA TTGAAGTTCT TAACAAAATC GGCGGTGAAA ACGGTATAGG AATCGTTGAC ATAGTGGAAA ACAGATTGGT CGGAATGAAA TCCAGAGGCG TTTATGAAAC TCCGGGTGGA ACCATTCTTT ATGCGGCTCA CAGAGAGCTT GAGCTTTTGT GTCTTGACCG TGACACTCTC CATTACAAAG ATCTTGTGGC TCAAAGATTT GCAGAGCTTG TTTATTACGG ACAGTGGTAT ACTCCTCTGC GTGAGGCTAT TTCAGCCTTT GTTGATGTTA CTCAGGAGAC GGTTACCGGT ACAGTAAGAT TAAAACTTTA CAAAGGAAAT ATAATAAGCG CGGGAGCCAA ATCCGATTAT TCCCTCTACA GTGAAGAACT TTCCACGTTC GGAGAAGACA ATGTATACAA TCAAAAGGAT GCCGAAGGAT TTATAAATCT CTTTGGTCTT CCGATGAAGG TTCAGGCTCT TATGAAAGAA AAGAATAAAG GCAAATAA
|
Protein sequence | MSQKEKVILA YSGGLDTSII IPWLKENYDY EVIAMAADVG QGEELEPLRE KAIKTGASKI YIEDLKEEFV TDFIFPTLKA GAVYEGKYLL GTSFARPLIA KRMVEIALKE GATAVAHGAT GKGNDQVRFE LTVKALAPHL KIIAPWRIWD IKSREDEIEY AQARNIPIPV SKEDNYSMDR NLWHLSHEGL DLEDPWNEPQ YDKILKLMVP PEKAPDKPTY VEIYFEKGIP KKVNGVEYGP VELIEVLNKI GGENGIGIVD IVENRLVGMK SRGVYETPGG TILYAAHREL ELLCLDRDTL HYKDLVAQRF AELVYYGQWY TPLREAISAF VDVTQETVTG TVRLKLYKGN IISAGAKSDY SLYSEELSTF GEDNVYNQKD AEGFINLFGL PMKVQALMKE KNKGK
|
| |