Gene GM21_3745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3745 
Symbol 
ID8139119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4314436 
End bp4315656 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID644871364 
Productargininosuccinate synthase 
Protein accessionYP_003023522 
Protein GI253702333 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones130 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA AAGAAGTGAA AAAGATCGTC CTCGCCTACT CCGGCGGGCT TGACACCTCC 
ATCATCCTGA AATGGCTCAA AAACGAGTAC GGCTGCGAAG TGGTCACCTT CTCCGCGGAC
CTGGGACAGG GGGACGAGCT GGAGCCGGTC CGCGAGAAGG CGTTCAAGAC GGGCGCCGAC
AAGGTCTACA TCGACGACCT GCGCGAAGAG TTCGTGCGCG ACTTCGTGTT CCCGATGTTC
CGCGCCAACG CGATCTACGA AGGGTCGTAC CTGCTCGGCA CCTCCATCGC GCGCCCGCTG
ATCGCGAAAC GCCAGATGGA AATCGCCCAG ATCGAGGGTT GCGACGCGGT CTCCCACGGC
GCCACCGGCA AGGGTAACGA CCAGGTCCGC TTCGAGCTCG CCTACTACCA CTTCAACCCC
GGCATCACCG TCGTGGCACC GTGGAGGGAA TGGAAGCTCA ACTCCCGCCA GGCGCTGATC
AACTACGCGA AGAGAAACGA CATCCCGATC CCGATCACCA AGAAGCGCCC CTGGTCTTCC
GACAGGAACC TGCTGCACAT CTCCTTCGAG GGCGGCATCC TGGAGGACAC CTGGCTGGAG
CCCCCCGAGA ACATGTTCGT GCTGACCAAG CCGCCCGAAA AGGCGCCGAA CAAGCCGCAG
TACGTCGAGA TCGAGTTCGA GAAGGGTAAC GCTGTGGCTG TCGACGGCGT GCGCATGTCC
CCGGCTGAGC TTCTGGCTCA CCTGAACACC ATCGGCGGCG AGCACGGCAT CGGCCGCGTC
GACCTCCTGG AGAACCGCTC GGTCGGCATG AAGTCCCGCG GCGTCTACGA GACCCCGGGC
GGCACCATCC TGCGCGAGGC GCACATGGCC GTCGAGCAGA TCACCATGGA CCGCGAAGTC
ATGCATCTGC GGGACTCCCT GATCCCGCGC TACGCCGAGA TGATCTACAA CGGCTACTGG
TTCTCGCCGG AGCGCGAGAT GATGCAGTGC ATGATCGACG AGTCCCAGAA GACGGTGAAC
GGCGTGGCGA GGCTGAAGCT CTACAAGGGT CACTGCCGCA CCGTGGGCAG GAAGTCCGAG
AGCGACTCGC TCTTCAACCT CGACTTCGCC ACCTTCGAGA AGGATCAGGT CTACAACCAG
GCGGACGCCG AGGGCTTCAT CAAGCTGAAC TCCCTGAGGC TCAGGATCCG TTCGCTCATG
CTGGCCAACA AGAACAAGTA A
 
Protein sequence
MAKKEVKKIV LAYSGGLDTS IILKWLKNEY GCEVVTFSAD LGQGDELEPV REKAFKTGAD 
KVYIDDLREE FVRDFVFPMF RANAIYEGSY LLGTSIARPL IAKRQMEIAQ IEGCDAVSHG
ATGKGNDQVR FELAYYHFNP GITVVAPWRE WKLNSRQALI NYAKRNDIPI PITKKRPWSS
DRNLLHISFE GGILEDTWLE PPENMFVLTK PPEKAPNKPQ YVEIEFEKGN AVAVDGVRMS
PAELLAHLNT IGGEHGIGRV DLLENRSVGM KSRGVYETPG GTILREAHMA VEQITMDREV
MHLRDSLIPR YAEMIYNGYW FSPEREMMQC MIDESQKTVN GVARLKLYKG HCRTVGRKSE
SDSLFNLDFA TFEKDQVYNQ ADAEGFIKLN SLRLRIRSLM LANKNK