Gene SeAg_B1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1952 
SymbolpurB 
ID6797012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1893285 
End bp1894655 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content52% 
IMG OID642776178 
Productadenylosuccinate lyase 
Protein accessionYP_002146809 
Protein GI197247372 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC 
AGCGCGCTGC GCGGAATTTT TAGCGAATAC GGTTTGCTGA AATTTCGTGT ACAAGTCGAA
GTACGTTGGC TGCAGAAATT AGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT
GCCGACGCAA ACGGTTACCT GGATACGCTT GTGGCAAACT TCAATGAAGA AGATGCCGCG
CGCATTAAAA CCATTGAGCG TACGACTAAC CATGATGTGA AGGCAGTTGA GTATTTCCTG
AAAGAAAAAG TCGCCGCGAT CCCGGCGCTA CATGACGTTT CCGAATTTAT CCACTTTGCC
TGCACTTCTG AGGACATTAA CAACCTGTCG CACGCGTTAA TGCTCAAAAC CGCGCGCGAT
GAAGTGATCC TGCCTTACTG GCGTCAGGTG ATTAACGCGG TTAAAGATCT CGCCACGCAG
TATCGCGACA TTCCTCTGCT CTCCCGCACC CACGGCCAGC CGGCAACGCC TTCCACTCTG
GGTAAAGAGA TGGCGAACGT GGCGTATCGT ATGGAGCGTC AGTTCCGCCA GCTCAACCAG
GTGGAGATCC TCGGTAAAAT CAACGGCGCC GTAGGCAACT ATAACGCGCA TATCGCCGCC
TATCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGCATCCAG
TGGAACCCTT ACACCACCCA GATTGAACCG CATGATTATA TTGCGGAACT GTTTGACTGT
ATCGCGCGCT TTAACACCAT CCTGATCGAT TTCGATCGCG ATGTCTGGGG CTATATTGCG
TTGAACCATT TCAAACAGAA AACCATCGCC GGGGAGATCG GTTCTTCTAC CATGCCGCAT
AAAGTTAACC CCATTGACTT TGAAAACTCA GAAGGCAACC TCGGTCTGTC TAATGCAGTG
TTGCACCATC TGGCAAACAA ACTGCCGGTT TCCCGCTGGC AGCGCGATCT GACCGACTCA
ACCGTCCTGC GTAACCTGGG TGTCGGCATC GGCTATGCGC TTATCGCTTA TCAGTCCACC
CTGAAGGGCG TCAGCAAGCT GGAAGTAAAC CGCGATCATC TGCTTGACGA ACTGGATCAC
AACTGGGAAG TATTAGCCGA GCCGATCCAG ACCGTCATGC GCCGCTATGG TATTGAAAAA
CCCTATGAAA AACTGAAAGA ATTGACCCGT GGCAAGCGTG TTGATGCCGA AGGAATGAAA
CAGTTTATTG ATAGTCTGGC CCTGCCGGAA GCAGAAAAAA CGCGCCTTAA AGCCATGACG
CCGGCAAATT ATATCGGTCG CGCTGTGACT CTGGTCGACG AACTTAAATA A
 
Protein sequence
MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA 
ADANGYLDTL VANFNEEDAA RIKTIERTTN HDVKAVEYFL KEKVAAIPAL HDVSEFIHFA
CTSEDINNLS HALMLKTARD EVILPYWRQV INAVKDLATQ YRDIPLLSRT HGQPATPSTL
GKEMANVAYR MERQFRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ
WNPYTTQIEP HDYIAELFDC IARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH
KVNPIDFENS EGNLGLSNAV LHHLANKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST
LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK
QFIDSLALPE AEKTRLKAMT PANYIGRAVT LVDELK