Gene EcSMS35_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1994 
SymbolpurB 
ID6146250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2014524 
End bp2015894 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID641616870 
Productadenylosuccinate lyase 
Protein accessionYP_001744046 
Protein GI170682479 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.922086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAT CCTCACTGAC CGCCGTTTCC CCTGTCGATG GACGCTACGG CGATAAAGTC 
AGCGCGCTGC GCGGGATTTT CAGCGAATAT GGTTTGCTGA AATTCCGTGT ACAAGTTGAA
GTACGTTGGC TGCAAAAACT GGCCGCGCAC GCAGCGATCA AGGAAGTTCC TGCTTTTGCT
GCCGACGCAA TCGGTTACCT TGATGCAATT GTCGCCAGTT TCAGCGAAGA AGATGCCGCA
CGCATCAAAA CCATCGAGCG TACCACTAAC CACGACGTTA AAGCGGTTGA GTATTTCCTG
AAAGAAAAAG TGGCGGAGAT CCCGGAACTG CACGCGGTTT CTGAATTCAT CCACTTTGCC
TGTACTTCGG AAGATATCAA TAACCTCTCC CACGCATTAA TGCTGAAAAC CGCGCGTGAT
GAAGTGATCC TGCCGTACTG GCGTCAACTG ATTGATGGCA TTAAAGATCT CGCCGCTCAG
TACCGCGATA TCCCGCTGCT GTCCCGTACC CACGGTCAGC CAGCCACGCC GTCAACCATC
GGTAAAGAGA TGGCTAACGT CGCCTACCGT ATGGAGCGCC AGTACCGCCA GCTTAACCAG
GTGGAGATCC TCGGCAAAAT CAACGGTGCG GTCGGTAACT ATAACGCCCA CATCGCCGCT
TACCCGGAAG TTGACTGGCA TCAGTTCAGC GAAGAGTTCG TCACCTCGCT GGGTATTCAG
TGGAACCCGT ACACCACCCA GATCGAACCG CACGACTACA TTGCCGAACT GTTTGATTGC
GTTGCGCGCT TCAACACCAT TCTGATCGAC TTTGACCGTG ACGTCTGGGG TTATATCGCC
CTTAACCACT TCAAACAGAA AACCATTGCT GGTGAGATTG GTTCTTCCAC CATGCCGCAT
AAAGTTAACC CGATCGACTT CGAAAACTCC GAAGGAAACC TGGGCCTTTC CAACGCGGTA
TTGCAGCACC TGGCAAGCAA ACTGCCAGTT TCCCGCTGGC AGCGTGACCT GACCGACTCC
ACCGTGCTGC GTAACCTCGG CGTGGGTATC GGTTATGCGC TGATTGCGTA TCAATCCACC
CTGAAAGGCG TGAGCAAACT GGAAGTGAAC CGTGGCCATC TGCTGGATGA ACTGGATCAC
AACTGGGAAG TGCTGGCTGA GCCAATCCAG ACAGTTATGC GTCGCTATGG CATCGAAAAA
CCGTACGAGA AGCTGAAAGA GCTGACTCGC GGTAAGCGCG TTGACGCCGA AGGCATGAAG
CAGTTTATCG ACGGTCTGGC GCTGCCGGAA GAAGAGAAAG CCCGCCTTAA AGCGATGACG
CCGGCAAACT ACATTGGTCG CGCCATCACC ATGGTTGATG AGCTGAAATA A
 
Protein sequence
MELSSLTAVS PVDGRYGDKV SALRGIFSEY GLLKFRVQVE VRWLQKLAAH AAIKEVPAFA 
ADAIGYLDAI VASFSEEDAA RIKTIERTTN HDVKAVEYFL KEKVAEIPEL HAVSEFIHFA
CTSEDINNLS HALMLKTARD EVILPYWRQL IDGIKDLAAQ YRDIPLLSRT HGQPATPSTI
GKEMANVAYR MERQYRQLNQ VEILGKINGA VGNYNAHIAA YPEVDWHQFS EEFVTSLGIQ
WNPYTTQIEP HDYIAELFDC VARFNTILID FDRDVWGYIA LNHFKQKTIA GEIGSSTMPH
KVNPIDFENS EGNLGLSNAV LQHLASKLPV SRWQRDLTDS TVLRNLGVGI GYALIAYQST
LKGVSKLEVN RGHLLDELDH NWEVLAEPIQ TVMRRYGIEK PYEKLKELTR GKRVDAEGMK
QFIDGLALPE EEKARLKAMT PANYIGRAIT MVDELK