Gene ECH74115_4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4494 
SymbolargG 
ID6971335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4162902 
End bp4164245 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content53% 
IMG OID643388207 
Productargininosuccinate synthase 
Protein accessionYP_002272643 
Protein GI209396632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.976918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA TTCTCAAGCA TCTCCCGGTA GGTCAACGTA TTGGTATCGC TTTTTCCGGC 
GGTCTGGACA CCAGTGCCGC ACTGCTGTGG ATGCGACAAA AGGGAGCGGT TCCTTATGCA
TATACTGCAA ACCTGGGCCA GCCAGACGAA GAGGATTATG ATGCGATCCC TCGTCGTGCC
ATGGAATACG GCGCGGAGAA CGCACGTCTG ATCGACTGCC GCAAACAACT GGTGGCCGAA
GGTATTGCCG CTATTCAGTG TGGCGCATTT CATAACACCA CCGGCGGCCT GACCTATTTC
AACACGACGC CGCTGGGCCG CGCCGTGACT GGTACCATGC TGGTTGCTGC GATGAAAGAA
GATGGCGTGA ATATCTGGGG TGACGGTAGC ACCTATAAAG GAAACGATAT CGAACGTTTC
TATCGTTATG GTCTGCTGAC CAATGCTGAA CTGCAGATTT ACAAACCGTG GCTTGATACT
GACTTTATTG ATGAACTGGG CGGTCGTCAT GAAATGTCTG AATTTATGAT TGCCTGCGGT
TTCGACTACA AAATGTCTGT CGAAAAAGCT TACTCCACGG ACTCCAACAT GCTTGGTGCA
ACGCATGAAG CGAAGGATCT GGAATACCTC AACTCCAGCG TCAAAATCGT CAACCCGATT
ATGGGCGTGA AATTCTGGGA TGAGAGCGTG AAGATCCCGG CAGAAGAAGT CACAGTACGC
TTTGAGCAAG GTCATCCGGT GGCGTTGAAC GGTAAAACCT TTAGCGACGA CGTAGAAATG
ATGCTGGAAG CTAACCGCAT CGGCGGTCGT CACGGCCTGG GCATGAGCGA CCAGATTGAA
AACCGTATCA TCGAAGCGAA AAGCCGTGGT ATTTACGAAG CTCCGGGGAT GGCACTGCTG
CACATTGCGT ATGAACGCCT GTTGACCGGT ATTCACAACG AAGACACCAT TGAGCAGTAT
CACGCGCATG GCCGTCAGTT GGGCCGTCTG CTGTACCAGG GGCGTTGGTT TGATTCCCAG
GCGCTGATGC TGCGTGACTC TCTGCAACGC TGGGTTGCCA GCCAGATCAC TGGTGAAGTT
ACCCTGGAGC TGCGCCGTGG GAACGATTAT TCAATCCTGA ATACCGTCTC AGAGAACCTG
ACCTACAAGC CAGAGCGTCT GACGATGGAA AAAGGCGACT CGGTGTTCTC GCCAGATGAT
CGTATCGGTC AACTGACCAT GCGCAACCTG GATATCACCG ATACCCGCGA GAAACTTTTC
GGTTATGCCA AAACTGGCCT GCTTTCCTCC TCTGCCGCTT CAGGCGTGCC GCAGATGGAG
AATCTGGAAA ACAAAGGCCA GTAA
 
Protein sequence
MTTILKHLPV GQRIGIAFSG GLDTSAALLW MRQKGAVPYA YTANLGQPDE EDYDAIPRRA 
MEYGAENARL IDCRKQLVAE GIAAIQCGAF HNTTGGLTYF NTTPLGRAVT GTMLVAAMKE
DGVNIWGDGS TYKGNDIERF YRYGLLTNAE LQIYKPWLDT DFIDELGGRH EMSEFMIACG
FDYKMSVEKA YSTDSNMLGA THEAKDLEYL NSSVKIVNPI MGVKFWDESV KIPAEEVTVR
FEQGHPVALN GKTFSDDVEM MLEANRIGGR HGLGMSDQIE NRIIEAKSRG IYEAPGMALL
HIAYERLLTG IHNEDTIEQY HAHGRQLGRL LYQGRWFDSQ ALMLRDSLQR WVASQITGEV
TLELRRGNDY SILNTVSENL TYKPERLTME KGDSVFSPDD RIGQLTMRNL DITDTREKLF
GYAKTGLLSS SAASGVPQME NLENKGQ