Gene GSU0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0156 
SymbolargH 
ID2687872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp173833 
End bp175209 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID637124823 
Productargininosuccinate lyase 
Protein accessionNP_951218 
Protein GI39995267 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACACG AAAAACTGTG GGGCGGCCGC TTTTCCGAGC CGACCGACCA ATTCGTCGAG 
GAATTCACCG CTTCCATCGA CTTCGACAAG CGGCTCTACC ACCAGGACAT CCGGGGCTCC
ATCGCCCACG CGCGGATGCT GGGCAAGCAA GGCATCCTCC CCATGGCTGA GGTCGAAAAG
ATCGTGGCGG GGCTCCAGGA GGTCCTTGCC CGGATCGAGG CCGGGAAATT TGACTTCTCC
GTGGCCCTGG AAGACATCCA CATGAACATC GAGGCGCGTC TCACGGAGAA GATCGGCGAG
GCCGGCAAGC GGCTCCACAC CGGCCGTTCC CGCAACGACC AGGTTGCCCT CGACATCCGC
CTCTACCTGC GGGACGAGAT CGTGGAGATC TCCGCCTACC TGGATATGTT GGTGGACTCC
CTCATTTACC AGGCGGAAGC CAACCTGGGC GTCATCATGC CCGGCTACAC CCACCTGCAG
ACCGCCCAGC CCATCCTCTT CTCCCACCAC ATGATGGCCT ACGTGGAGAT GTTCACCAGG
GACAAGGGGC GGATGGAGGA CTGCCTGCGG CGGATGAACG TCCTTCCGCT CGGCGCCGGC
GCCCTGGCCG GGACCACCTT CCCCATCGAC CGGGAACACG TGGCCGAGCT GCTGGACTTC
CCCGGCGTGA CCCGCAATTC CCTCGATTCG GTATCGGACC GGGACTTCGC CCTGGAGTTC
ATGGGGGCCT CCTCGATCCT GATGATGCAC CTCTCCCGTT TCTCCGAGGA GCTGATCCTC
TGGTCCACCA GCGAGTTCAA GTTCGTGGAG CTGACCGATT CCTTCTGCAC CGGCTCCTCG
ATCATGCCCC AGAAGAAGAA CCCGGACGTG CCGGAACTGG TCCGCGGCAA AACCGGCCGG
GTCTACGGCA ACCTCATGGC GCTCCTCACG GTCATGAAGG CCCTGCCCCT GGCTTACAAC
AAGGACATGC AGGAAGACAA GGAGCCTCTC TTCGACACCA TCGACACCGT GAAGGGGAGC
CTCAAGATTT TCGCCGACAT GGTGCGGGAG ATGCGGATCA ACGCCGGGAA CATGCGGGCC
GCGGCCGCCA AGGGTTTCTC CACCGCCACC GACGTGGCCG ACTACCTGGT CCGCCAGGGG
ATGCCCTTCC GCGACGCCCA CGAGGTGGTG GGGAAGACCG TGGCCTACTG TATCGCCAAC
GGCAAGGATC TCCCGGATCT GACCATGGAT GAATGGCAGG GGTTCTCGGA CAAGATCGGC
GAGGACATCT TCGACGCGAT TACCCTGGAA GCGTCGGTCA ACGCCCGCGT CGCCACCGGC
GGCACGGCCC TGGAGCGCGT CAAGGCGGAG ATCGAGCGGG CCAAGGTGGG GAGATAG
 
Protein sequence
MAHEKLWGGR FSEPTDQFVE EFTASIDFDK RLYHQDIRGS IAHARMLGKQ GILPMAEVEK 
IVAGLQEVLA RIEAGKFDFS VALEDIHMNI EARLTEKIGE AGKRLHTGRS RNDQVALDIR
LYLRDEIVEI SAYLDMLVDS LIYQAEANLG VIMPGYTHLQ TAQPILFSHH MMAYVEMFTR
DKGRMEDCLR RMNVLPLGAG ALAGTTFPID REHVAELLDF PGVTRNSLDS VSDRDFALEF
MGASSILMMH LSRFSEELIL WSTSEFKFVE LTDSFCTGSS IMPQKKNPDV PELVRGKTGR
VYGNLMALLT VMKALPLAYN KDMQEDKEPL FDTIDTVKGS LKIFADMVRE MRINAGNMRA
AAAKGFSTAT DVADYLVRQG MPFRDAHEVV GKTVAYCIAN GKDLPDLTMD EWQGFSDKIG
EDIFDAITLE ASVNARVATG GTALERVKAE IERAKVGR