Gene EcSMS35_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3468 
SymbolargG 
ID6143631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3545393 
End bp3546736 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content53% 
IMG OID641618297 
Productargininosuccinate synthase 
Protein accessionYP_001745445 
Protein GI170681371 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.202724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.808331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA TTCTCAAGCA TCTCCCGGTA GGTCAACGTA TTGGTATCGC TTTTTCCGGC 
GGTCTGGACA CCAGTGCCGC ACTGCTGTGG ATGCGACAAA AGGGAGCGGT TCCTTATGCA
TATACTGCAA ACCTGGGCCA GCCAGACGAA GAGGATTATG ATGCGATCCC TCGTCGTGCC
ATGGAATACG GCGCGGAGAA CGCACGTCTG ATCGACTGCC GCAAACAACT GGTGGCCGAA
GGTATTGCCG CTATTCAGTG TGGCGCATTT CATAACACCA CCGGCGGCCT GACCTATTTC
AACACGACGC CGCTGGGCCG CGCCGTGACC GGCACCATGC TGGTTGCTGC TATGAAAGAA
GATGGCGTGA ATATCTGGGG TGACGGCAGC ACCTATAAAG GAAACGATAT CGAACGTTTC
TACCGTTACG GTCTGCTGAC CAATGCTGAA CTGCAGATTT ACAAACCGTG GCTTGATACT
GACTTTATTG ATGAACTGGG CGGCCGTCAT GAGATGTCTG AATTTATGAT TGCCTGCGGT
TTCGACTACA AAATGTCTGT CGAAAAAGCC TACTCCACGG ACTCCAACAT GCTTGGTGCA
ACGCATGAAG CGAAGGATCT GGAATACCTC AACTCCAGCG TCAAAATCGT CAATCCAATT
ATGGGCGTGA AGTTTTGGGA TGAGAGCGTG AAAATCCCGG CAGAAGAAGT CACAGTACGC
TTTGAGCAAG GTCATCCGGT GGCGCTGAAC GGTAAAACCT TTAGCGACGA CGTAGAAATG
ATGCTGGAAG CTAACCGCAT CGGCGGTCGT CACGGCCTGG GCATGAGCGA CCAGATTGAA
AACCGTATCA TCGAAGCAAA AAGCCGCGGC ATTTATGAAG CCCCTGGGAT GGCACTGCTG
CACATTGCGT ATGAACGTCT CTTGACCGGT ATTCACAACG AAGACACCAT TGAGCAGTAT
CACGCGCATG GTCGTCAGTT GGGCCGTCTG CTGTACCAGG GACGTTGGTT TGACTCCCAG
GCGCTGATGC TGCGGGACTC TCTGCAACGC TGGGTTGCCA GCCAGATCAC TGGTGAGGTT
ACCCTGGAGC TGCGCCGTGG GAACGATTAT TCAATCCTGA ATACCGTCTC AGAGAACCTG
ACCTACAAGC CAGAGCGTCT GACGATGGAA AAAGGCGACT CGGTGTTCTC GCCAGATGAT
CGTATCGGTC AACTGACCAT GCGCAACCTG GATATCACCG ATACCCGCGA GAAACTTTTC
GGTTATGCCA AAACTGGCCT CCTCTCCTCC TCTGCCGCTT CGGGCGTGCC GCAGGTGGAG
AATCTGGAAA ATAAAGGCCA GTAA
 
Protein sequence
MTTILKHLPV GQRIGIAFSG GLDTSAALLW MRQKGAVPYA YTANLGQPDE EDYDAIPRRA 
MEYGAENARL IDCRKQLVAE GIAAIQCGAF HNTTGGLTYF NTTPLGRAVT GTMLVAAMKE
DGVNIWGDGS TYKGNDIERF YRYGLLTNAE LQIYKPWLDT DFIDELGGRH EMSEFMIACG
FDYKMSVEKA YSTDSNMLGA THEAKDLEYL NSSVKIVNPI MGVKFWDESV KIPAEEVTVR
FEQGHPVALN GKTFSDDVEM MLEANRIGGR HGLGMSDQIE NRIIEAKSRG IYEAPGMALL
HIAYERLLTG IHNEDTIEQY HAHGRQLGRL LYQGRWFDSQ ALMLRDSLQR WVASQITGEV
TLELRRGNDY SILNTVSENL TYKPERLTME KGDSVFSPDD RIGQLTMRNL DITDTREKLF
GYAKTGLLSS SAASGVPQVE NLENKGQ