Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1540 |
Symbol | |
ID | 7409048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1627341 |
End bp | 1628411 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643715912 |
Product | carbamoyl-phosphate synthase, small subunit |
Protein accession | YP_002573411 |
Protein GI | 222529529 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG CGGTACTTGT TTTAGAAGAC GGGCTGACAT TTGAAGGAAT TTCACTTGGT AGTGAAGGTG AAACAATTGG TGAAATTGTA TTCAATACAT GCATGACAGG ATACCAAGAG GTACTGACTG ACCCTTCCTA CAATGGTCAG ATTGTCACAA TGACCTACCC TCTTATAGGC AATTACGGAA TAAATGGTGA AGATGTGGAG TCATACAAAC CTCATGTTGA AGGTTTTATT GTAAGAGAGG CATGTAAAAC ACCTTCAAAT TTCAGGGCTC AGAAGACATT GCATGAATAT TTAAAAGAGA ATAATATTGT GGCAATTGAG GGTGTTGATA CACGAGCTAT AACTGAACAC ATACGAAATA AAGGCTCAAT GCTTGGTATT ATCTCAACAG AGACCGACAA TAAAGAGATA CTTTTGAGCA AAATATTTGA GTATAAAAAA CAAAAGCCAT CTTTGGTAAA AGAAGTTTCG ACCACTCAGG TATACAGAAT TGAAGGCAGT GGCAAGAAGG TTGCTGTTTT AGACTTTGGA ATAAAGCAAA ACATCTTGAG AGAACTTTCA AAAAGAGGGC TTGATTTATA TGTGTTTCCG TATAATAGCT CTCTTGACCA AATCATGAAT ATTAATCCAG AAGGGTTTGT GTTCTCAAAC GGACCGGGTG ATCCTACTGA CTTAGAAGAA TTCTTCCCCA CTTTGAAACA AATAATAGAG CTAAAAAAAC CTATCCTTGG AATCTGTCTT GGCCATCAGC TTTTAGGTTT GTGTTTAGGA CTTAAGACAT ACAAGCTGAA ATTTGGTCAC CATGGTGGAA ACCACCCGGT TAAAGATTTA ATTTCCGGCA AGGTTTATAT AACCTCACAG AATCATAACT ATGCAATTGA ATATAAAGAA TATGAAAAAA TAAAAATCAC ACATGTAAAT GTAAATGATA AAACTGTTGA GGGGTTTGCT CATCTTGATT TGCCTATAGT ATCTGTTCAA TACCATCCTG AGGCAAGCCC GGGTCCACAT GACTCAAAAT ATATATTTGA CCAATTTACT AAGCTTTTGA ATGGAGTGTA G
|
Protein sequence | MKKAVLVLED GLTFEGISLG SEGETIGEIV FNTCMTGYQE VLTDPSYNGQ IVTMTYPLIG NYGINGEDVE SYKPHVEGFI VREACKTPSN FRAQKTLHEY LKENNIVAIE GVDTRAITEH IRNKGSMLGI ISTETDNKEI LLSKIFEYKK QKPSLVKEVS TTQVYRIEGS GKKVAVLDFG IKQNILRELS KRGLDLYVFP YNSSLDQIMN INPEGFVFSN GPGDPTDLEE FFPTLKQIIE LKKPILGICL GHQLLGLCLG LKTYKLKFGH HGGNHPVKDL ISGKVYITSQ NHNYAIEYKE YEKIKITHVN VNDKTVEGFA HLDLPIVSVQ YHPEASPGPH DSKYIFDQFT KLLNGV
|
| |