Gene SeD_A2134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2134 
SymbolpurB 
ID6872059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2046144 
End bp2047514 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content52% 
IMG OID642785240 
Productadenylosuccinate lyase 
Protein accessionYP_002215903 
Protein GI198243498 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0323604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC 
AGCGCGCTGC GCGGAATTTT TAGCGAATAC GGTTTGCTGA AATTTCGTGT ACAAGTCGAA
GTACGTTGGC TGCAGAAATT AGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT
GCCGACGCAA ACGGTTACCT GGATACGCTT GTTGCAAACT TCAATGAAGA AGATGCCGCG
CGCATTAAAA CCATTGAGCG CACCACGAAT CACGATGTGA AAGCGGTTGA GTATTTTCTG
AAAGAAAAAG TCGCCGCGAT CCCGGCGCTA CATGACGTTT CCGAATTTAT CCACTTTGCC
TGCACTTCTG AGGACATTAA CAACCTGTCG CACGCGTTAA TGCTCAAAAC CGCGCGCGAT
GAAGTGATCC TGCCTTACTG GCGTCAGGTG ATTAACGCGG TTAAAGATCT CGCCACGCAG
TATCGCGACA TTCCTCTGCT CTCCCGCACC CACGGCCAGC CGGCAACGCC TTCCACTCTG
GGTAAAGAGA TGGCGAACGT GGCGTATCGG ATGGAGCGTC AGTTCCGCCA GCTCAACCAG
GTGGAGATCC TCGGTAAAAT TAACGGCGCC GTAGGCAACT ATAACGCGCA TATCGCCGCC
TATCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGCATCCAG
TGGAATCCTT ACACCACCCA GATTGAACCG CATGATTATA TTGCGGAACT GTTTGACTGT
ATCGCGCGCT TTAACACCAT CCTGATCGAT TTCGATCGCG ATGTCTGGGG CTATATTGCG
TTGAACCATT TCAAACAGAA AACCATCGCC GGGGAGATCG GTTCTTCTAC CATGCCGCAT
AAAGTTAACC CCATTGACTT TGAAAACTCA GAAGGCAACC TCGGTCTGTC TAACGCAGTG
TTGCACCATC TGGCAAACAA ACTGCCGGTT TCCCGCTGGC AGCGCGATCT GACCGACTCA
ACCGTCCTGC GTAACCTGGG TGTCGGCATC GGCTATGCGC TTATCGCTTA TCAGTCCACC
CTGAAGGGCG TCAGCAAGCT GGAAGTAAAC CGCGATCATC TGCTTGACGA ACTGGATCAC
AACTGGGAAG TATTAGCCGA ACCGATCCAG ACCGTCATGC GCCGCTATGG TATTGAAAAA
CCATATGAAA AACTGAAAGA GTTGACCCGT GGCAAGCGTG TTGATGCCGA AGGAATGAAA
CAGTTTATTG ATAGTCTGGC CCTGCCGGAA GCAGAAAAAA CGCGCCTTAA AGCCATGACG
CCGGCAAATT ATATCGGTCG CGCTGTGACT CTGGTCGACG AACTTAAATA A
 
Protein sequence
MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA 
ADANGYLDTL VANFNEEDAA RIKTIERTTN HDVKAVEYFL KEKVAAIPAL HDVSEFIHFA
CTSEDINNLS HALMLKTARD EVILPYWRQV INAVKDLATQ YRDIPLLSRT HGQPATPSTL
GKEMANVAYR MERQFRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ
WNPYTTQIEP HDYIAELFDC IARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH
KVNPIDFENS EGNLGLSNAV LHHLANKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST
LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK
QFIDSLALPE AEKTRLKAMT PANYIGRAVT LVDELK