Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1988 |
Symbol | |
ID | 5103375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1922187 |
End bp | 1923533 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507876 |
Product | argininosuccinate lyase |
Protein accession | YP_001192052 |
Protein GI | 146304736 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.415578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATACA GGAAGTGGGG TTCAAAGGAG GACAAGGTAG TTACTTACAC TTCCTCATCG AGGGAGGATG TTAGGATACT GGATAAGGTA AAGGAAGTCA TGAAGGCACA TGTAATAGAG CTTTTCCTTT CAGGTAATCT TTCCAAGGAG GATGCGAGGA AGTTACTCCA AGGTATAAAC TCCTTCACCA AGGTGGATCC CTCCTATGAG GACGTACATG AGGCACTCGA GGATCACCTA ATTAAGACGG CGGGTGAGGC TGGAGGCTCA ATTGGCCTAG GGAGGAGCAG GAACGATCAT GTGGCCGCTG CCCTGAGGCT GGAGATCAGG GAAGAACTAA TGGACCTTCT GGAACAACTC CTAGACTTCA GGAAGATGAT CCTTAAGAGA GCCGAGGAGA ACGTCAATAC GCCCTTTGTA GTTTACACTC ACTTTCAACC TGCTCAGCCA ACTACCTTTG GACACTACCT ACTTTACGTG GAGGAGGAAG TTGCCTCCAG ATGGAGCTCG ATCTTCAGAA CTCTGGACCT TGTTAATCGG TCTCCCCTTG GGAGCGGGGC CATAGTGGGC ACTTCAGTTA AACTCAATAG GGACAGGGAA TCCAACCTTC TAGGTTTCAG GGAGGTTCTG GTTAACACGA TTTCAGCCAC GTCCTCCAGG GCAGACCTGA TCTCTGCCGT TATGGAGGTT GTTAACCTTA TGCTCGCCCT AAGCAGGGTG GTTGAGGACA TGATCCTGTT GTCCTCCAAG TTTGTGGGAA TACTGGAATT ACCGGACACC CACGTGAGCA CGAGCTCGCT TATGCCACAG AAGAGGAACG CAGTAACCAT GGAGGTGCTC AGGGCAAGGG TCTCAAGGGT ACTGGGTTAC TTGACCTCGA TTGCCTCGAC TTACAAATCA CTTCCATCGG GATATAACCT AGATCTTCAG GAGATCAACC CACTCTTCTG GGAGATAATT GACGAGACTA GAACGGGAAT AGAGGTACTT CATGACCTCC TCTCCAAGGT GAAGGTGAAC GAATTCAAGA TAGATAAGGA GGTACTCTCC ACCGATGAGG CAGAGATACT TGCCGAGAAG GGAATGAGGT ACAGGGATGC CTACTTCGCC GTGGCCAAGG CGGTGAGGGA GGGCAAGTTC TCCCCCACGA TTACGCCTGA ACAGTCAATT TATAGAAAAG CAGTCAAGGG AAGTCCTAGC CCAGAGAGGG TCAGGGAAAG TATTTCCCTC GCTAGGGAAA GGTTAAATGG TGACGAGAAT TTGTTAAGAG AATATAAAAA CTGGACTCTC AAAGGAGAAG GGGAATTGAG GTTGATGGAA AATGACATAC TGCAAGAGGG GAACTGA
|
Protein sequence | MLYRKWGSKE DKVVTYTSSS REDVRILDKV KEVMKAHVIE LFLSGNLSKE DARKLLQGIN SFTKVDPSYE DVHEALEDHL IKTAGEAGGS IGLGRSRNDH VAAALRLEIR EELMDLLEQL LDFRKMILKR AEENVNTPFV VYTHFQPAQP TTFGHYLLYV EEEVASRWSS IFRTLDLVNR SPLGSGAIVG TSVKLNRDRE SNLLGFREVL VNTISATSSR ADLISAVMEV VNLMLALSRV VEDMILLSSK FVGILELPDT HVSTSSLMPQ KRNAVTMEVL RARVSRVLGY LTSIASTYKS LPSGYNLDLQ EINPLFWEII DETRTGIEVL HDLLSKVKVN EFKIDKEVLS TDEAEILAEK GMRYRDAYFA VAKAVREGKF SPTITPEQSI YRKAVKGSPS PERVRESISL ARERLNGDEN LLREYKNWTL KGEGELRLME NDILQEGN
|
| |