Gene EcolC_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4056 
Symbol 
ID6065213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4473851 
End bp4475224 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content56% 
IMG OID641603479 
Productargininosuccinate lyase 
Protein accessionYP_001726982 
Protein GI170022028 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.52602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC 
GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG
TCCAAAGCCC TGGTCACGGT AGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG
GCGCTGAACG TGTTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC
GCCGAAGATA TCCATAGCTG GGTGGAAGGC AAACTGATCG ACAAAGTGGG CCAGTTAGGC
AAAAAGCTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CGACTGACCT GAAACTGTGG
TGCAAAGATA CCGTTAGCGA GTTACTGACG GCTAACCGGC AGCTGCAATC GGCGCTGGTG
GAAACCGCAC AAAACAATCA GGACGCGGTA ATGCCAGGTT ACACTCACCT GCAACGCGCC
CAGCCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTATGTTG AGATGCTGGC GCGTGATGAA
AGCCGTTTGC AGGATGCGCT TAAGCGTCTG GATGTCAGCC CGCTAGGCTG TGGCGCGCTG
GCGGGAACGG CCTATGAAAT CGACCGTGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG
GCGACCCGTA ACAGTCTCGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTTTCT
GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC
ACCGGCGAAG CGGGGTTTGT GGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCATTAATG
CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG TCGGGTGCAG
GGGGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT
ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT
ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CACGTTGCCA GGAAGCGGCT
CAGCAGGGTT ACGCCAACGC CACCGAACTG GCGGATTATC TGGTGGCGAA AGGCGTACCG
TTCCGCGAGG CGCACCATAT TGTTGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA
CCGCTGGAAG ATCTGCCGCT CAGTGAGTTG CAGAAATTCA GTCAGGTGAT TGACGAAGAT
GTCTATCCGA TTCTGTCGCT GCAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGCGTC
TCACCGCAGC AGGTGGCGCA GGCGATTGCT TTTGCGCAGG CTCGGTTGGG GTAA
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE 
ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEDLPLSEL QKFSQVIDED
VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG