Gene Msed_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1987 
Symbol 
ID5103374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1921012 
End bp1922187 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content50% 
IMG OID640507875 
Productargininosuccinate synthase 
Protein accessionYP_001192051 
Protein GI146304735 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.118414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG TCCTAGCCTA TTCAGGAGGT TTAGATACAA CTGTTGCGAT AAAGTGGTTA 
AGTGAGACCT TTCACGCCGA AGTGATAAGC GTAAGCGTCG ATGTAGGACA GAAGGACGAC
TTTAAGAAAA TTGAGGAAAG AGCGTACAAG GCTGGAAGTG CTAAGCATTA CCTAGTGGAT
GCCAAGAGGG AGTTTGCTGA AAACTTCGCT CTGAAGGACA TTAAGATGAA CGGCCTTTAT
GAGGAGGTGT ACCCGCTTGC CACTGCGCTC GCGAGACCAC TCATAGCCGA GAAGGTCGCT
GAGGTTGCGA AAAAGGAAGG CACAGAATAT GTTGCGCACG GGTCTACATC CAAGGGGAAT
GACCAGGTTA GGTTTGACCT GGCCCTTAAG ACAGCGTTAG ATAACGTCAA GATAATAGCT
CCAGCCAGGA TCTGGAAGAT GACAAGGGAG GATGAAATAG CCTACGCCAG GGAAAGGGGA
ATTCCCATAA AGACCGAGAG CAGTAAGTAC AGTATTGATG AAAACCTTTG GGGGAGAAGC
ATAGAGGGGG ACATAATCTC GGATCCCGCG TCAGAGGTTC CAGAGGACGC ATTTGAGTGG
ACTGCTGTGA GGAAACAAGA CAAACTGAAG TTGAGCGTGG AGTTCGAGAA AGGAGTTCCC
GTTAGAGTTA ACGGCGAGAA GCTTGATCCG GTTAAGCTCA TTTCCCTGTT GAACGAGGAG
GTAGGATCCA GGGGATTCGG AAGGGTAGAA CACCTTGAGA ACAGGGTAGT TGGTTTCAAG
TCAAGGGAGG TGTATGAGGC ACCCGCAGCT CTAGCCCTCA TAGCGGCGCA TAAGGATCTG
GAAAAAACTG TCCTCACTCC CTTGGAGCTC AGGTTCAAGA GACACCTTGA CTCCTTGTGG
TCTGATCTAG TGTACCAGGG ACTCTGGTAT GAACCGCTGA GGAATACCCT TGAGCTCGCA
GGAGATGAGA TAAACAAGTG GGTCTCCGGA GAGGTTAAGC TAGAAGTGGA CCTGAAGAGT
CTCAGGGTAG TGGGTAGGAC CTCTCCTTAC TCGCCATACT CAGAAAAAAT ATCCTCCTAC
AACAAGGGAT GGTATCCCTC GGATGAGGAG GCCAGAGGGT TCATTGAGAT CTGGGGAATG
CACTCCCTAC TAACAAGGAA GGCGAGGTAT GGCTAA
 
Protein sequence
MKIVLAYSGG LDTTVAIKWL SETFHAEVIS VSVDVGQKDD FKKIEERAYK AGSAKHYLVD 
AKREFAENFA LKDIKMNGLY EEVYPLATAL ARPLIAEKVA EVAKKEGTEY VAHGSTSKGN
DQVRFDLALK TALDNVKIIA PARIWKMTRE DEIAYARERG IPIKTESSKY SIDENLWGRS
IEGDIISDPA SEVPEDAFEW TAVRKQDKLK LSVEFEKGVP VRVNGEKLDP VKLISLLNEE
VGSRGFGRVE HLENRVVGFK SREVYEAPAA LALIAAHKDL EKTVLTPLEL RFKRHLDSLW
SDLVYQGLWY EPLRNTLELA GDEINKWVSG EVKLEVDLKS LRVVGRTSPY SPYSEKISSY
NKGWYPSDEE ARGFIEIWGM HSLLTRKARY G