Gene GM21_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3748 
Symbol 
ID8139122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4317755 
End bp4319131 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content62% 
IMG OID644871367 
Productargininosuccinate lyase 
Protein accessionYP_003023525 
Protein GI253702336 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones126 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAG ACAAGCTGTG GGGCGGGCGC TTCACCCAAC CCACCGACAA GTTCGTAGAA 
GAATTCACCG CCTCCATCAA TTTCGACAAG CGCCTGTACC ATCAGGACAT CCGCGGCTCC
ATCGCCCACG CAACCATGCT GGGCAAGCAG GGGATCATCC CGATAGCCGA CGTCGAGAAC
ATCGTATCGG GACTGAAGGC TATCCTGGAG CAGATCGAGG CGGGCAAGTT CGACTTCTCG
GTCTCCTTGG AAGATATCCA CATGAACATC GAGGCGCGGC TCTCCGAGAA GATCGGCGAC
GCCGGCAAGA GGCTCCACAC CGGCCGCTCC AGAAACGACC AGGTGGCGCT CGACATCAGG
CTCTACCTGC GGGACGAGCT GGTGGAGGTC TCGGCGTACA TCGACCTCTT GATCGACTCC
ATCATCCACC AGGCCGAGGA GAACCTCGGC GTCATCATGC CGGGCTTCAC CCACCTGCAG
ACCGCCCAGC CGATCCTCTT CTCGCACCAC ATGATGGCCT ACCACGAGAT GCTCAAGCGT
GACAAGGCCC GCATGGAGGA CTGCCTGAAA AGGACCAACG TACTTCCCTT GGGCGCGGGG
GCGCTGGCCG GGACCACCTT CCCCATCGAC CGGGAGTACG TCGCGGAGCT TCTCGACTTC
GCCGAGGTCA CCCGCAACTC GCTCGACTCG GTCTCGGACC GCGACTTCGC CATGGAGTTC
TGCGCCGCCT CGTCGATCCT GATGGTGCAC CTCTCCCGCT TCTCGGAGGA ACTGATCCTC
TGGTCCACCA GCGAGTTCAA GTTCGTGGAA CTGTCCGACT CTTTCTGCAC CGGCTCCTCC
ATCATGCCGC AGAAGAAGAA CCCGGACGTC CCGGAACTGG TGCGCGGCAA GACAGGCCGC
GTGAACGGCA ACCTGGTGGC CCTCTTGACC CTGATGAAAT CGCTTCCGCT TGCCTACAAC
AAGGACATGC AGGAGGACAA GGAGCCGCTG TTCGACACCA TAGACACCGT GAAAGGGTGC
CTCAAGGTCT TCGCCGACAT GGTGCGCGAG ATGAAGATCA ACCCGGAGCG GATGGAGGTG
GCCGCGGCCG CGGGTTTCTC CACCGCGACC GACGTGGCCG ACTACCTGGT GCGCAAGGGA
ATCCCCTTCC GCGACGCCCA CGAGATCGTG GGGAAGACGG TGCGCTACTG CATCGAGAAC
GAGATAGACA TCCCCGAGCT TTCGCTTGCC GAGTGGCAGC TCTTCTCAGG GCGCATCGAG
GAGGACATCT TCGAATCGAT CACCCTGGAG GCCTCGGTCA ACGCCCGTCG CGCGACCGGC
GGGACCGCGC TGGAACGGGT GCGCGCCGAG ATCGCCCGGG CCAAGGAAGG TAGGTAA
 
Protein sequence
MSKDKLWGGR FTQPTDKFVE EFTASINFDK RLYHQDIRGS IAHATMLGKQ GIIPIADVEN 
IVSGLKAILE QIEAGKFDFS VSLEDIHMNI EARLSEKIGD AGKRLHTGRS RNDQVALDIR
LYLRDELVEV SAYIDLLIDS IIHQAEENLG VIMPGFTHLQ TAQPILFSHH MMAYHEMLKR
DKARMEDCLK RTNVLPLGAG ALAGTTFPID REYVAELLDF AEVTRNSLDS VSDRDFAMEF
CAASSILMVH LSRFSEELIL WSTSEFKFVE LSDSFCTGSS IMPQKKNPDV PELVRGKTGR
VNGNLVALLT LMKSLPLAYN KDMQEDKEPL FDTIDTVKGC LKVFADMVRE MKINPERMEV
AAAAGFSTAT DVADYLVRKG IPFRDAHEIV GKTVRYCIEN EIDIPELSLA EWQLFSGRIE
EDIFESITLE ASVNARRATG GTALERVRAE IARAKEGR