Gene GSU0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0153 
SymbolargG 
ID2687910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp170807 
End bp172027 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID637124820 
Productargininosuccinate synthase 
Protein accessionNP_951215 
Protein GI39995264 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAG CGCACAAAGA CGTCAAGAAG ATCGTCCTTG CCTATTCGGG CGGACTCGAC 
ACATCCATCA TCCTCAAGTG GCTCAAGAAC GAGTACGGCT GCGAGGTCAT CGCCTTCTCC
GCCGATCTGG GCCAAGGCGA CGAACTGGCG CCGATCCGCG ACAAGGCCAT CGCAACCGGC
GCCGACAAGG TCTACATCGA CGACCTGAAG GAAGAATTCG TCAAGGACTT CGTATTCCCC
ATGTTCCGCG CCAACGCCAT TTACGAGGGG CACTACCTCC TGGGCACCTC CATCGCCCGC
CCCCTCATCG CCAAGCGTCA GATGGAGATC GCGAAGATCG AGGGTGCCGA CGCCGTTTCC
CACGGCGCCA CCGGCAAGGG GAACGACCAG GTCCGCTTCG AGCTGGCCTA CTACCACTTT
GACCCGGCCA TCACCGTGGT TGCCCCCTGG CGCGAGTGGA AGCTCAACAG CCGCCAGGCC
CTGGTGAACT ACGCCAGGAA GAACGGGATC CCGATCCCGG TCACCAAGAA GCGTCCCTGG
TCCTCGGACC GCAACCTCCT CCACATCTCC TTCGAGGGCG GCATCCTGGA GGATACCTGG
GCCGAGCCCC CCGAGAACAT GTACGTGCTG ACCAAGGCGC CGGAAAAGGC CCCCAACAAG
CCCCAGTTCG TGGAGATCGA ATTCAAAAAC GGCAATGCCG TGGCCGTTGA CGGCGAGAAG
ATGAGCCCGG CCCAGCTCCT GGCTCACCTG AACTACATCG GCGGCGAGCA CGGTATCGGC
CGGGTCGATC TCCTGGAGAA CCGCTCCGTG GGCATGAAGT CCCGGGGCGT GTACGAGACC
CCCGGCGGCA CCATCCTGCG CGAGGCCCAC TCGGCGGTGG AGCAGATCAC CATGGACCGC
GAGGTCATGC GGATCCGCGA CTCCCTCATC CCCGAGTACG CCCGCCAGGT CTATGCCGGC
TACTGGTTCT CGCCGGAGCG GGAGATGCTC CAGACCCTGA TCGACGATTC CCAGAAGTGC
GTGAACGGCG TGGCCCGGGT GAAACTCTAC AAGGGGCACT GCCGTACCGT GGGGCGCAAG
TCCGAGACCA ACTCTCTCTT CAACCTGGAC TTCGCCACCT TCGAAAAGGA TCAGGTCTTC
AACCAGGCCG ACGCCACCGG CTTCATCAAG ATCAACTCCC TGCGGCTGCG GATCAGGTCG
CTCATGCAGG GGAAGAAGTA G
 
Protein sequence
MAKAHKDVKK IVLAYSGGLD TSIILKWLKN EYGCEVIAFS ADLGQGDELA PIRDKAIATG 
ADKVYIDDLK EEFVKDFVFP MFRANAIYEG HYLLGTSIAR PLIAKRQMEI AKIEGADAVS
HGATGKGNDQ VRFELAYYHF DPAITVVAPW REWKLNSRQA LVNYARKNGI PIPVTKKRPW
SSDRNLLHIS FEGGILEDTW AEPPENMYVL TKAPEKAPNK PQFVEIEFKN GNAVAVDGEK
MSPAQLLAHL NYIGGEHGIG RVDLLENRSV GMKSRGVYET PGGTILREAH SAVEQITMDR
EVMRIRDSLI PEYARQVYAG YWFSPEREML QTLIDDSQKC VNGVARVKLY KGHCRTVGRK
SETNSLFNLD FATFEKDQVF NQADATGFIK INSLRLRIRS LMQGKK