Gene EcE24377A_4499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4499 
SymbolargH 
ID5587646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4490916 
End bp4492289 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content57% 
IMG OID640928113 
Productargininosuccinate lyase 
Protein accessionYP_001465457 
Protein GI157156952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.512894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC 
GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG
TCCAAAGCCC TGGTCACGGT AGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG
GCGCTGAACG TGCTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC
GCCGAAGATA TCCATAGCTG GGTGGAAGGC AAACTGATCG ACAAAGTGGG CCAGTTAGGC
AAAAAGCTGC ATACCGGGCG TAGTCGTAAC GATCAGGTAG CGACTGACCT GAAACTGTGG
TGCAAAGATA CCGTTAGCGA GTTACTGACG GCTAACCGAC AGCTGCAATC GGCGCTGGTG
GAAACCGCAC AAAACAATCA GGACGCGGTC ATGCCAGGTT ACACTCACCT GCAACGCGCC
CAGCCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTATGTTG AGATGCTGGC GCGTGATGAA
AGCCGTTTGC AGGATGCGCT TAAGCGTCTG GATGTCAGCC CATTAGGCTG TGGCGCGCTG
GCGGGAACGG CCTATGAAAT CGACCGTGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG
GCGACCCGTA ACAGTCTGGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTGTCG
GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC
ACCGGCGAAG CGGGGTTTGT GGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCATTAATG
CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG TCGGGTGCAG
GGGGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT
ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT
ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CACGTTGCCA GGAAGCGGCT
CAGCAGGGTT ACGCCAACGC CACCGAACTG GCGGATTATC TGGTGGCGAA AGGCGTACCG
TTCCGCGAGG CGCACCATAT TGTTGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA
CCGCTGGAAG ATCTGCCGCT CGACGAGTTG CAGAAATTCA GCCCGGTGAT TGACGAAGAT
GTCTATCCGA TTCTGTCGCT GCAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGCGTC
TCACCGCAGC AGGTGGCGCA GGCAATTGCT TTTGCGCAGG CGCGGTTAGA GTAA
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE 
ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEDLPLDEL QKFSPVIDED
VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLE