Gene SeHA_C1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1349 
SymbolpurB 
ID6491676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1323827 
End bp1325197 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content51% 
IMG OID642741584 
Productadenylosuccinate lyase 
Protein accessionYP_002045234 
Protein GI194448620 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC 
AGCGCGCTGC GCGGAATTTT TAGCGAATAC GGTTTGCTGA AATTTCGTGT ACAAGTCGAA
GTACGTTGGC TGCAGAAATT AGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT
GCCGACGCAA ACGGTTACCT GGATACGCTT GTGGCAAACT TCAATGAAGA AGATGCCGCG
CGCATTAAAA CCATTGAGCG CACCACGAAT CACGATGTGA AAGCGGTTGA GTATTTTCTT
AAAGAAAAAG TCGCCGCGAT CCCGGCGCTA CATGACGTTT CCGAATTTAT CCACTTTGCC
TGCACTTCTG AGGACATTAA CAACCTGTCG CACGCGTTAA TGCTCAAAAC CGCGCGCGAT
GAAGTGATCC TGCCTTACTG GCGTCAGGTG ATTAACGCGG TTAAAGATCT CGCCACGCAG
TATCGCGACA TTCCTCTGCT CTCCCGCACC CATGGCCAGC CGGCAACGCC TTCCACTCTG
GGTAAAGAGA TGGCGAACGT GGCGTATCGT ATGGAGCGTC AGTTCCGCCA GCTCAACCAG
GTGGAGATCC TCGGTAAAAT CAACGGTGCC GTAGGCAACT ATAACGCGCA TATCGCCGCC
TATCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGCATCCAG
TGGAACCCTT ACACCACCCA GATTGAACCA CATGATTATA TTGCGGAACT GTTTGACTGT
ATCGCGCGCT TTAACACCAT CCTGATCGAT TTCGATCGCG ATGTCTGGGG CTATATTGCG
TTGAACCATT TCAAACAGAA AACCATCGCC GGGGAGATCG GTTCTTCTAC CATGCCGCAT
AAAGTTAACC CCATTGACTT TGAAAACTCA GAAGGCAACC TCGGTCTGTC TAATGCAGTA
TTGCACCATC TGGCAAACAA ACTGCCGGTT TCCCGCTGGC AGCGCGATCT GACCGACTCA
ACCGTTCTGC GTAACCTGGG TGTCGGCATC GGCTATGCGC TTATCGCTTA TCAGTCCACC
CTGAAGGGCG TCAGCAAGCT GGAAGTAAAC CGCGATCATC TGCTTGACGA ACTGGATCAC
AACTGGGAAG TATTAGCCGA ACCGATCCAG ACCGTCATGC GCCGCTATGG TATTGAAAAA
CCATATGAAA AACTGAAAGA GTTGACCCGT GGCAAGCGTG TCGATGCCGA AGGAATGAAA
CAGTTTATTG ATAGCCTGGC CCTGCCGGAA GCAGAAAAAA CGCGCCTTAA AGCCATGACG
CCGGCAAATT ATATCGGTCG CGCTGTGACT CTGGTCGACG AACTTAAATA A
 
Protein sequence
MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA 
ADANGYLDTL VANFNEEDAA RIKTIERTTN HDVKAVEYFL KEKVAAIPAL HDVSEFIHFA
CTSEDINNLS HALMLKTARD EVILPYWRQV INAVKDLATQ YRDIPLLSRT HGQPATPSTL
GKEMANVAYR MERQFRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ
WNPYTTQIEP HDYIAELFDC IARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH
KVNPIDFENS EGNLGLSNAV LHHLANKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST
LKGVSKLEVN RDHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK
QFIDSLALPE AEKTRLKAMT PANYIGRAVT LVDELK