Gene Msed_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1988 
Symbol 
ID5103375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1922187 
End bp1923533 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content49% 
IMG OID640507876 
Productargininosuccinate lyase 
Protein accessionYP_001192052 
Protein GI146304736 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.415578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATACA GGAAGTGGGG TTCAAAGGAG GACAAGGTAG TTACTTACAC TTCCTCATCG 
AGGGAGGATG TTAGGATACT GGATAAGGTA AAGGAAGTCA TGAAGGCACA TGTAATAGAG
CTTTTCCTTT CAGGTAATCT TTCCAAGGAG GATGCGAGGA AGTTACTCCA AGGTATAAAC
TCCTTCACCA AGGTGGATCC CTCCTATGAG GACGTACATG AGGCACTCGA GGATCACCTA
ATTAAGACGG CGGGTGAGGC TGGAGGCTCA ATTGGCCTAG GGAGGAGCAG GAACGATCAT
GTGGCCGCTG CCCTGAGGCT GGAGATCAGG GAAGAACTAA TGGACCTTCT GGAACAACTC
CTAGACTTCA GGAAGATGAT CCTTAAGAGA GCCGAGGAGA ACGTCAATAC GCCCTTTGTA
GTTTACACTC ACTTTCAACC TGCTCAGCCA ACTACCTTTG GACACTACCT ACTTTACGTG
GAGGAGGAAG TTGCCTCCAG ATGGAGCTCG ATCTTCAGAA CTCTGGACCT TGTTAATCGG
TCTCCCCTTG GGAGCGGGGC CATAGTGGGC ACTTCAGTTA AACTCAATAG GGACAGGGAA
TCCAACCTTC TAGGTTTCAG GGAGGTTCTG GTTAACACGA TTTCAGCCAC GTCCTCCAGG
GCAGACCTGA TCTCTGCCGT TATGGAGGTT GTTAACCTTA TGCTCGCCCT AAGCAGGGTG
GTTGAGGACA TGATCCTGTT GTCCTCCAAG TTTGTGGGAA TACTGGAATT ACCGGACACC
CACGTGAGCA CGAGCTCGCT TATGCCACAG AAGAGGAACG CAGTAACCAT GGAGGTGCTC
AGGGCAAGGG TCTCAAGGGT ACTGGGTTAC TTGACCTCGA TTGCCTCGAC TTACAAATCA
CTTCCATCGG GATATAACCT AGATCTTCAG GAGATCAACC CACTCTTCTG GGAGATAATT
GACGAGACTA GAACGGGAAT AGAGGTACTT CATGACCTCC TCTCCAAGGT GAAGGTGAAC
GAATTCAAGA TAGATAAGGA GGTACTCTCC ACCGATGAGG CAGAGATACT TGCCGAGAAG
GGAATGAGGT ACAGGGATGC CTACTTCGCC GTGGCCAAGG CGGTGAGGGA GGGCAAGTTC
TCCCCCACGA TTACGCCTGA ACAGTCAATT TATAGAAAAG CAGTCAAGGG AAGTCCTAGC
CCAGAGAGGG TCAGGGAAAG TATTTCCCTC GCTAGGGAAA GGTTAAATGG TGACGAGAAT
TTGTTAAGAG AATATAAAAA CTGGACTCTC AAAGGAGAAG GGGAATTGAG GTTGATGGAA
AATGACATAC TGCAAGAGGG GAACTGA
 
Protein sequence
MLYRKWGSKE DKVVTYTSSS REDVRILDKV KEVMKAHVIE LFLSGNLSKE DARKLLQGIN 
SFTKVDPSYE DVHEALEDHL IKTAGEAGGS IGLGRSRNDH VAAALRLEIR EELMDLLEQL
LDFRKMILKR AEENVNTPFV VYTHFQPAQP TTFGHYLLYV EEEVASRWSS IFRTLDLVNR
SPLGSGAIVG TSVKLNRDRE SNLLGFREVL VNTISATSSR ADLISAVMEV VNLMLALSRV
VEDMILLSSK FVGILELPDT HVSTSSLMPQ KRNAVTMEVL RARVSRVLGY LTSIASTYKS
LPSGYNLDLQ EINPLFWEII DETRTGIEVL HDLLSKVKVN EFKIDKEVLS TDEAEILAEK
GMRYRDAYFA VAKAVREGKF SPTITPEQSI YRKAVKGSPS PERVRESISL ARERLNGDEN
LLREYKNWTL KGEGELRLME NDILQEGN