Gene SbBS512_E4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4446 
SymbolargH 
ID6271248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4157197 
End bp4158570 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content57% 
IMG OID641728241 
Productargininosuccinate lyase 
Protein accessionYP_001882654 
Protein GI187731864 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTTT GGGGCGGGCG TTTTACCCAG GCAGCAGATC AACGGTTCAA ACAATTCAAC 
GACTCACTGC GCTTTGATTA CCGTCTGGCG GAGCAGGATA TTGTTGGCTC TGTGGCCTGG
TCCAAAGCCC TAGTAACGGT CGGCGTGTTA ACCGCAGAAG AGCAGGCGCA ACTGGAAGAG
GCGCTGAACG TGTTGCTGGA AGATGTTCGC GCCAGGCCAC AACAAATCCT TGAAAGCGAC
GCCGAAGATA TCCATAGCTG GGTGGAAGGC AAACTGATCG ACAAAGTGGG CCAGTTAGGC
AAAAAGCTGC ATACCGGGCG TAGCCGTAAT GATCAGGTAG CGACTGACCT GAAACTGTGG
TGCAAAGATA CCGTTAGCGA GTTACTGACG GCTAACCGGC AGCTGCAATC GGCGCTGGTG
GAAACCGCAC AAAACAATCA GGACGCGGTA ATGCCAGGTT ACACTCACCT GCAACGCGCC
CAGCCGGTGA CGTTCGCGCA CTGGTGCCTG GCCTATGTTG AGATGCTGGC GCGTGATGAA
AGCCGTTTGC AGGATGCGCT TAAGCGTCTG GATGTCAGCC CATTAGGCTG TGGCGCGCTG
GCGGGAACGG CCTATGAAAT CGACCGTGAA CAGTTAGCAG GCTGGCTGGG CTTTGCTTCG
GCGACCCGTA ACAGTCTGGA CAGCGTTTCT GACCGTGACC ATGTGTTGGA ACTGCTGTCG
GCTGCCGCTA TCGGCATGGT GCATCTGTCG CGTTTTGCTG AAGATCTGAT TTTCTTTAAC
ACCGGCGAAG CGGGGTTTGT GGAGCTTTCT GACCGCGTGA CTTCCGGTTC ATCATTAATG
CCGCAGAAGA AAAACCCGGA TGCGCTGGAG CTGATTCGCG GTAAATGCGG TCGGGTGCAG
GGTGCGTTAA CCGGCATGAT GATGACGCTG AAAGGTTTGC CGCTGGCTTA CAACAAAGAT
ATGCAGGAAG ACAAAGAAGG TCTGTTCGAC GCGCTCGATA CCTGGCTGGA CTGCCTGCAT
ATGGCGGCGC TGGTGCTGGA CGGCATTCAG GTGAAACGTC CACGTTGCCA GGAAGCGGCT
CAGCAGGGTT ACGCCAACGC CACCGAACTG GCGGATTACC TGGTGGCGAA AGGCGTACCT
TTCCGCGAGG CGCACCATAT CGTGGGTGAA GCGGTGGTGG AAGCCATTCG TCAGGGCAAA
CCGCTGGAAG ATCTGCCGCT CGACGAGTTG CAGAAATTCA GCCACGTGAT TGGCGAAGAT
GTCTATCCGA TTCTGTCGCT ACAATCGTGC CTCGACAAGC GTGCGGCAAA AGGCGGTGTC
TCACCGCAGC AGGTGGCCCA AGCGATTGCT TTTGCGCAGG CGCGGTTGGG GTAA
 
Protein sequence
MALWGGRFTQ AADQRFKQFN DSLRFDYRLA EQDIVGSVAW SKALVTVGVL TAEEQAQLEE 
ALNVLLEDVR ARPQQILESD AEDIHSWVEG KLIDKVGQLG KKLHTGRSRN DQVATDLKLW
CKDTVSELLT ANRQLQSALV ETAQNNQDAV MPGYTHLQRA QPVTFAHWCL AYVEMLARDE
SRLQDALKRL DVSPLGCGAL AGTAYEIDRE QLAGWLGFAS ATRNSLDSVS DRDHVLELLS
AAAIGMVHLS RFAEDLIFFN TGEAGFVELS DRVTSGSSLM PQKKNPDALE LIRGKCGRVQ
GALTGMMMTL KGLPLAYNKD MQEDKEGLFD ALDTWLDCLH MAALVLDGIQ VKRPRCQEAA
QQGYANATEL ADYLVAKGVP FREAHHIVGE AVVEAIRQGK PLEDLPLDEL QKFSHVIGED
VYPILSLQSC LDKRAAKGGV SPQQVAQAIA FAQARLG