Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1989 |
Symbol | |
ID | 5103376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1923508 |
End bp | 1924626 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507877 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_001192053 |
Protein GI | 146304737 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATACT GCAAGAGGGG AACTGAAGGG TTAATCTACC TAGAGGACGG AACCCTGTTG AGGGGTTGTG GCTTCGGCGC TAAGGGAGTG AGGTACGGGG AAGTGGTTTT CACTACGGCC ATGAACGGGT ATCCAGAGTC CATGACCGAT CCGTCTTACA GGGGTCAGAT ACTTATCATA ACGCATCCCC TCGTGGGAAA TTACGGTGTC CCAAACCCCA TTGTGAGAAA TGGGATTCTC CAGAACTTTG AGTCAGAGCA GATCCAGATC GAGGGGCTCG TGGTTACGGA GGAGACTGAT CCCTCAAAGT GGAACTCTTC CAAGAGCCTT CACCAGTGGA TGGCTGAACA GGGTATTCCA GGAGTCTCCT CCGTGGACAC TAGACTCCTG GTTAAGAAAG TAAGAACCCT GGGCTCCATG ATGGGGGTAA TTGCCTCCGG GGAACATGTG GAGGATCCTA GGAAATACAT AGAGATGAGG TATGACGAGA TAGACTTCAC TAAGTTTACC TCCCCCAAGT CCCCTATCAT CCACCAGAAC AACTCCCCAG ATATTATTGT GTTAGTGGAC TGCGGAATAA AGCACGGCAT ACTAGAGGAG CTATACAAGA CTGGCTTCAC CATAGTGAGG GTTCCGTGCA AGTCCAGTGC CGATGAGATC ATGAACTATT CTCCAAAGGG CATAGTCTTT GGAAACGGAC CTGGAAATCC TAACATCCTG AAGGATCTAG TGAAGAACTT CTCCGCAGTC ATGGAGTATA AATTGCCCAC CCTTGGAATA TGCCTTGGGC ATCAAGTGGC AACTCTAGCC TTGGGTGGCA ATGTAAGAAA GATGAAGTTT GGACATAGGG CAATAAATAA GCCCGTGACG GATATCTCAA ATAACAAGTG CTACATATCT ACACACAATC ACGGTTATGG AGTATACAAG GAGGACATTC CGCCTGACAC GCAGATCTGG TTCGTGAATC CAGATGACGG GGTAGTCGAG GGGTTAATAC ACAAGAGACT TCCCCTGATC ACAACTCAAT TTCACCCGGA AGCTAGGCCG GGACCCAATG ATACGACTTG GGTTTTCCAG AAGTTTAAGA AGATGGTGAT AAAGGATGAA GGGAATTAA
|
Protein sequence | MTYCKRGTEG LIYLEDGTLL RGCGFGAKGV RYGEVVFTTA MNGYPESMTD PSYRGQILII THPLVGNYGV PNPIVRNGIL QNFESEQIQI EGLVVTEETD PSKWNSSKSL HQWMAEQGIP GVSSVDTRLL VKKVRTLGSM MGVIASGEHV EDPRKYIEMR YDEIDFTKFT SPKSPIIHQN NSPDIIVLVD CGIKHGILEE LYKTGFTIVR VPCKSSADEI MNYSPKGIVF GNGPGNPNIL KDLVKNFSAV MEYKLPTLGI CLGHQVATLA LGGNVRKMKF GHRAINKPVT DISNNKCYIS THNHGYGVYK EDIPPDTQIW FVNPDDGVVE GLIHKRLPLI TTQFHPEARP GPNDTTWVFQ KFKKMVIKDE GN
|
| |