Gene Smed_4987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4987 
Symbol 
ID5318800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1500441 
End bp1501907 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content64% 
IMG OID640776769 
Productargininosuccinate lyase 
Protein accessionYP_001313701 
Protein GI150377105 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00718333 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGAAC CCACTCAGCT CTGGGGTGGA CGATTCAAGT CCGGACCGTC CGAAGCGCTT 
GCAAATCTGT CACGCGCGCC GAGCTCTTAC TTTCGCCTTT ACAGAGAGGA TATTGCGGGG
TCGCGCGCTC ATGCTTCGGA ATTGAAGCGC GCCGGCGTCC TTGACGAGGG CGAATTCTCC
GCGATACGAG CAGCCCTGGA AGCGATCGAA ACCGATGTCG GTGCCGGCCA TGAGAAGCCG
ATTGCCGCTG ACGAGGATCT GCATACCTTT CTCGAACGCC TGCTGATGGC GCGCCTCGGC
GCCCTTGGCG GCAAGCTTCG CGCCGGACGT TCCCGCAACG ATCAGACTGC GAACAATACG
CGCCTTTATC TACGGCGTAT GGCGCGGGAG CTTTCCCGAG GCATTATCGC CGTCGAGGAA
GCGCTGACGG AGCAGGCATC CCGGCATACG GAAACGGTAA TGCCCGGCTT TACTCATCTC
CAGCCGGCCC AGCCGGTCGT GCTCGGGCAC CACCTCATGG CGCATGCGCA GTCGCTGCTG
CGCGACCTTC AGCGTTTCGC GGATTGGGAC CGCCGATTCG ATCGGTCGCC GCTTGGCGCG
GCCGCGCTAG CGGGATCGGG CATTGCCCGC CGTCCCGACC TTTCCGCCGT CGATCTCGGC
TATTCGGCCG CGTGCGAGAA CTCCATCGAT GCTGTCGCAG CGCGCGACCA TGTCGCGGAG
TTTCTCTTCA TCTGCTCGCT GGTGGCGGTG GATCTCTCCC GGCTTGCGGA GGAAATCTGC
CTTTGGAGCT CCAAACAGTT CAGCTGGGTG CGGCTCGATG ATGCCTATTC CACAGGTTCC
TCGATCATGC CGCAGAAGAA GAATCCCGAC GTCGCCGAAC TGACGCGCGG CATGTCCGGC
ACGCTGATCG GCAACATTGC CGGGTTCCTG GCGACCATGA AGGCGATGCC GCTCGCCTAT
AATCGCGACC TTGCCGAAGA CAAGCGCAGC CTGTTCGAGA CGATCGACAT TCTCGACCTG
GTCCTGCCGG CCTTTGCCGG CATGGTGGGG ACGCTGGAAT TCGACGTGGA GAAACTGCGG
GAGGAAGCGC CGAAGGGCTT CACCCTGGCG ACCGAAGTCG CCGACTGGCT GGTCGGACGG
GACGTGCCCT TTGCGGAAGC GCACGAGATT ACCGGGGCCG TGGTCCGCTA CTGCGAAGAG
CGCGGTCATG ATCTTGCCGG GCTGACCCCC GACGACCTGG CGAAGATCGA TCCGCGTCTT
CACGCCGGGA TGCTCGCAGC GCTCACACTC GACAAGGCGC TTGCGAGCCG CACCGGATAC
GGCGCAACCG CGCCGGAAAG GGTTCGCGAG CAGATCGCCC GTTTCGAAAC GGCACTTGCC
GAATGCCGGG CCTTTGCTGC CGCCCCATCC GGCGGGGCGG CCTTTGCGGG TCCGAAGAGC
GACGTAGAGG AGGAGCGACG TCGATGA
 
Protein sequence
MAEPTQLWGG RFKSGPSEAL ANLSRAPSSY FRLYREDIAG SRAHASELKR AGVLDEGEFS 
AIRAALEAIE TDVGAGHEKP IAADEDLHTF LERLLMARLG ALGGKLRAGR SRNDQTANNT
RLYLRRMARE LSRGIIAVEE ALTEQASRHT ETVMPGFTHL QPAQPVVLGH HLMAHAQSLL
RDLQRFADWD RRFDRSPLGA AALAGSGIAR RPDLSAVDLG YSAACENSID AVAARDHVAE
FLFICSLVAV DLSRLAEEIC LWSSKQFSWV RLDDAYSTGS SIMPQKKNPD VAELTRGMSG
TLIGNIAGFL ATMKAMPLAY NRDLAEDKRS LFETIDILDL VLPAFAGMVG TLEFDVEKLR
EEAPKGFTLA TEVADWLVGR DVPFAEAHEI TGAVVRYCEE RGHDLAGLTP DDLAKIDPRL
HAGMLAALTL DKALASRTGY GATAPERVRE QIARFETALA ECRAFAAAPS GGAAFAGPKS
DVEEERRR