Gene EcSMS35_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1116 
Symbol 
ID6143285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1131997 
End bp1133397 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content48% 
IMG OID641615996 
Productputative aspartate ammonia-lyase 
Protein accessionYP_001743188 
Protein GI170683617 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGTA TTGAAAGCGA TTTCCTTGGT GAGCGTGAAG TTCCGGATGA TTGCTATTAT 
GGTGTTCAGA CACTTCGTGG TGAGGATAAT TTCCATATTA CTGAAATGCC AATGAGTCAG
GAGCCCTTCT TCATCATTGC CTTCGGTTAT GTGAAAAAAG CAGCTGCGAT GGCTAATAAG
GAGTTAGGTA CAATTCCTGC TGATGTTGCG GATGCTTTAA TCTGGGCTTG CGACCAATTA
ATTGATGGTA AGTACCGGGA ACAGTTTGTG ACCGACTGGC TTCAGGGGGG AGCCGGAACG
TCAACCAATA TGAACTGCAA TGAAGTGATT TGTAATCTTG CTGCAGAAAA ACTTGGTAGC
GTTAAAGGGG ATTACAAACG CGTTTCGCCA AACGACCATG CTAATTTTGG TCAGTCCACC
AATGATACAT ATCCGACAGC ATTACATCTG GCATTGCTGC TGCGAAGCAA TGTATTGCTT
GAAGCTGTTG AACACCTCGT AGGAGCATTC TACAAAAAGG CTGACGAGTT CAGCACAGTG
CTCAAGATGG GACGTACTCA CCTGCAGGAT GCAGTGCCAA TGACTCTCGG CCAGGAGTTC
CATGGCTGGG GTTTTACAAT TAATGATGAA ATTCGGGTCA TCCGTAATGC ACAGGAACAC
CTGCGGGTTG TAAACCTTGG TGCTACTGCG ATTGGTACCT GTGTTACCGC TCACCCGGAT
TATCCCGCTT TAGCCGTGAA ATATCTGGCC CAGATCACAG GCATTAATTT CAGGAACAGT
GAAGATCTCA TCGCTGCCAC AAGCGACTGT GGTGCATATG TTGCACTCAG TTCGGCCATG
AAGAGCCTCT CTGTGAAGCT TACCAAGGTT TGTAATGACA TTCGACTGCT TGCTTCAGGC
CCCCGTTGCG GCCTGGCTGA AATTAACCTG CCTCAATTGC AACCGGGTTC TTCCATTATG
CCTGGTAAGG TTAACCCCGT TATCCCGGAA GTAACAAACC AGTCTTGCTT CCTGGTTCAG
GGGCTGGACA CCACAGTGAT GCTGGCGGCA TCTGCTGGTC AGCTTGAGCT TAACGTTATG
GAGCCGGTCA TCACCTTTGC GCTGTTCACC TCACTCAAGG TGATGACGAA TGCCTGTAAC
ACACTCCGAA CTAAATGCAT TGATGGCATT ACAGCTAATT CCGATCGAAC TGCAGAGATG
GTAATGCATT CCTGCGGTAT TGTGACTCTC CTGAAGCCAC ATCTGGGATA TAAGGTGTGT
TCTGAAATGG CGCACGAAGC ATACCATACA GGCAAATCCC TCCATCAGAT AGTGGTTGTC
GAACGTAAGC TACTCACACA GGAAGAATGG GAGAAGACAT TCAATCTGGA TAATCTGATT
GCTCCGAAGT TCGAACAATA A
 
Protein sequence
MSRIESDFLG EREVPDDCYY GVQTLRGEDN FHITEMPMSQ EPFFIIAFGY VKKAAAMANK 
ELGTIPADVA DALIWACDQL IDGKYREQFV TDWLQGGAGT STNMNCNEVI CNLAAEKLGS
VKGDYKRVSP NDHANFGQST NDTYPTALHL ALLLRSNVLL EAVEHLVGAF YKKADEFSTV
LKMGRTHLQD AVPMTLGQEF HGWGFTINDE IRVIRNAQEH LRVVNLGATA IGTCVTAHPD
YPALAVKYLA QITGINFRNS EDLIAATSDC GAYVALSSAM KSLSVKLTKV CNDIRLLASG
PRCGLAEINL PQLQPGSSIM PGKVNPVIPE VTNQSCFLVQ GLDTTVMLAA SAGQLELNVM
EPVITFALFT SLKVMTNACN TLRTKCIDGI TANSDRTAEM VMHSCGIVTL LKPHLGYKVC
SEMAHEAYHT GKSLHQIVVV ERKLLTQEEW EKTFNLDNLI APKFEQ