Gene SNSL254_A1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1333 
SymbolpurB 
ID6484063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1321268 
End bp1322638 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content52% 
IMG OID642736731 
Productadenylosuccinate lyase 
Protein accessionYP_002040488 
Protein GI194443864 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.887926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.184863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC 
AGCGCGCTGC GCGGAATTTT TAGCGAATAC GGTTTGCTGA AATTTCGTGT ACAAGTCGAA
GTACGTTGGC TGCAGAAATT AGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT
GCCGACGCAA ACGGTTACCT GGATACGCTT GTGGCAAACT TCAATGAAGA AGATGCCGCG
CGCATTAAAA CCATTGAGCG CACCACGAAT CACGATGTGA AAGCGGTTGA GTATTTTCTG
AAAGAAAAAG TCGCCGCGAT CCCGGCGCTA CATGACGTTT CCGAATTTAT CCACTTTGCC
TGCACTTCTG AGGACATTAA CAACCTGTCG CACGCGTTAA TGCTCAAAAC CGCGCGCGAT
GAAGTGATCC TGCCTTACTG GCGTCAGGTG ATTAACGCGG TTAAAGATCT CGCCACGCAG
TATCGCGACA TTCCTCTGCT CTCCCGCACC CACGGCCAGC CGGCAACGCC TTCCACTCTG
GGTAAAGAGA TGGCGAACGT GGCGTATCGG ATGGAGCGTC AGTTCCGCCA GCTCAACCAG
GTGGAGATCC TCGGTAAAAT TAACGGCGCC GTAGGCAACT ATAACGCGCA TATCGCCGCC
TATCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGCATCCAG
TGGAACCCTT ACACCACCCA GATTGAACCA CATGATTATA TTGCGGAACT GTTTGACTGT
ATCGCGCGCT TTAACACCAT CCTGATCGAT TTCGATCGCG ATGTCTGGGG CTATATTGCG
TTGAACCATT TCAAACAGAA AACCATCGCC GGGGAGATCG GTTCTTCTAC TATGCCGCAT
AAAGTTAACC CCATTGACTT TGAAAACTCA GAAGGCAACC TCGGTCTGTC TAATGCAGTG
TTGCACCATC TGGCAAACAA ACTGCCGGTT TCCCGCTGGC AGCGCGATCT GACCGACTCA
ACCGTCCTGC GTAACCTGGG TGTCGGCATC GGCTATGCGC TTATCGCTTA TCAGTCCACC
CTGAAGGGCG TCAGCAAGCT GGAAGTAAAC CGCGATCATC TGCTTGACGA ACTGGATCAC
AACTGGGAAG TATTAGCCGA ACCGATCCAG ACCGTCATGC GCCGCTATGG TATTGAAAAA
CCATATGAAA AACTGAAAGA GTTGACCCGT GGCAAGCGTG TCGATGCCGA AGGAATGAAA
CAGTTTATTG ATAGTCTGGC CCTGCCGGAA GCAGAAAAAA CGCGCCTTAA AGCCATGACG
CCGGCAAATT ATATCGGTCG CGCTGTGACT CTGGTCGACG AACTTAAATA A
 
Protein sequence
MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA 
ADANGYLDTL VANFNEEDAA RIKTIERTTN HDVKAVEYFL KEKVAAIPAL HDVSEFIHFA
CTSEDINNLS HALMLKTARD EVILPYWRQV INAVKDLATQ YRDIPLLSRT HGQPATPSTL
GKEMANVAYR MERQFRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ
WNPYTTQIEP HDYIAELFDC IARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH
KVNPIDFENS EGNLGLSNAV LHHLANKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST
LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK
QFIDSLALPE AEKTRLKAMT PANYIGRAVT LVDELK