Gene ECD_03845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03845 
SymbolargH 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4065151 
End bp4066524 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content56% 
IMG OID 
Productargininosuccinate lyase 
Protein accessionACT45638 
Protein GI253979968 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC 
GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG
TCCAAAGCCC TGGTCACGGT AGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG
GCGCTGAACG TGTTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC
GCCGAAGATA TCCATAGCTG GGTGGAAGGC AAACTGATCG ACAAAGTGGG CCAGTTAGGC
AAAAAGCTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CGACTGACCT GAAACTGTGG
TGCAAAGATA CCGTTAGCGA GTTACTGACG GCTAACCGGC AGCTGCAATC GGCGCTGGTG
GAAACCGCAC AAAACAATCA GGACGCGGTA ATGCCAGGTT ACACTCACCT GCAACGCGCC
CAGCCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTATGTTG AGATGCTGGC GCGTGATGAA
AGCCGTTTGC AGGATGCGCT TAAGCGTCTG GATGTCAGCC CGCTAGGCTG TGGCGCGCTG
GCGGGAACGG CCTATGAAAT CGACCGTGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG
GCGACCCGTA ACAGTCTCGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTTTCT
GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC
ACCGGCGAAG CGGGGTTTGT GGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCATTAATG
CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG CCGGGTGCAG
GGGGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT
ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT
ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CACGTTGCCA GGAAGCGGCT
CAGCAGGGTT ACGCCAACGC CACCGAACTG GCGGATTATC TGGTGGCGAA AGGCGTACCG
TTCCGCGAGG CGCACCATAT TGTTGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA
CCGCTGGAAG ATCTGCCGCT CAGTGAGTTG CAGAAATTCA GTCAGGTGAT TGACGAAGAT
GTCTATCCGA TTCTGTCGCT GCAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGCGTC
TCACCGCAGC AGGTGGCGCA GGCGATTGCT TTTGCGCAGG CTCGGTTAGG GTAA
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE 
ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEDLPLSEL QKFSQVIDED
VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG