Gene EcHS_A4194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4194 
SymbolargH 
ID5594502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4186516 
End bp4187889 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content56% 
IMG OID640923296 
Productargininosuccinate lyase 
Protein accessionYP_001460755 
Protein GI157163437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC 
GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG
TCCAAAGCCC TGGTCACGGT AGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG
GCGCTGAACG TGTTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC
GCCGAAGATA TCCATAGCTG GGTGGAAGGC AAACTGATCG ACAAAGTGGG CCAGTTAGGC
AAAAAGCTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CGACTGACCT GAAACTGTGG
TGCAAAGATA CCGTTAGCGA GTTACTGACG GCTAACCGGC AGCTGCAATC GGCGCTGGTG
GAAACCGCAC AAAACAATCA GGACGCGGTA ATGCCAGGTT ACACTCACCT GCAACGCGCC
CAGCCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTATGTTG AGATGCTGGC GCGTGATGAA
AGCCGTTTGC AGGATGCGCT TAAGCGTCTG GATGTCAGCC CGCTAGGCTG TGGCGCGCTG
GCGGGAACGG CCTATGAAAT CGACCGTGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG
GCGACCCGTA ACAGTCTCGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTTTCT
GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC
ACCGGCGAAG CGGGGTTTGT GGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCATTAATG
CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG TCGGGTGCAG
GGGGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT
ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT
ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CACGTTGCCA GGAAGCGGCT
CAGCAGGGTT ACGCCAACGC CACCGAACTG GCGGATTATC TGGTGGCGAA AGGCGTACCG
TTCCGCGAGG CGCACCATAT TGTTGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA
CCGCTGGAAG ATCTGCCGCT CAGTGAGTTG CAGAAATTCA GTCAGGTGAT TGACGAAGAT
GTCTATCCGA TTCTGTCGCT GCAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGCGTC
TCACCGCAGC AGGTGGCGCA GGCGATTGCT TTTGCGCAGG CTCGGTTGGG GTAA
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE 
ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEDLPLSEL QKFSQVIDED
VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG