Gene EcSMS35_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1087 
Symbol 
ID6145966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1103143 
End bp1104309 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content48% 
IMG OID641615973 
Productlyase 
Protein accessionYP_001743165 
Protein GI170682542 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGTA TTGAAAGCGA TTTCCTTGGT GAGCGTGAAA TTCCGGATGA TTGCTATTAT 
GGTGTTCAGA CACTTCGTGG AAAAGATAAT TTCCATATCA CTGAAATGCC AATGAGTCAG
GAGCCTTTCT TCATCATTGC CTTCGGTTAT GTGAAAAAAG CAGCTGCGAT GGCTAATAAG
GAGTTAGGTA CAATTCCTGC TGATGTTGCG GATGCTTTAA TCTGGGCTTG CGACCAATTA
ATTGATGGTA AGTACCGGGA ACAGTTTGTG ACCGACTGGC TTCAGGGGGG AGCCGGAACG
TCAACCAATA TGAACTGCAA TGAAGTGATT TGTAATCTTG CTGCAGAAAA ACTTGGTAGC
GTTAAAGGGG ATTACAAACG CGTTTCGCCA AACGACCATG CTAATTTTGG TCAGTCCACC
AATGATACAT ATCCGACAGC ATTACATCTG GCATTGCTGC TGCGAAGCAA TGTATTGCTT
GAAGCTGTTG AACACCTCGT AGGAGCATTC TACAAAAAGG CTGACGAGTT CAGCACAGTG
CTCAAGATGG GACGTACTCA CCTGCAGGAT GCAGTGCCAA TGACTCTCGG CCAGGAGTTC
CATGGCTGGG GTTTTACAAT TAATGATGAA ATTCGGGTCA TCCGTAATGC ACAGGAACAC
CTGCGGGTTG TAAACCTTGG TGCTACTGCG ATTGGTACCT GTGTTACCGC TCACCCGGAT
TATCCCGCTT TAGCCGTGAA ATATCTGGCC CAGATCACAG GCATTAATTT CAGGAACAGT
GAAGATCTCA TCGCTGCCAC AAGCGACTGT GGTGCATATG TTGCACTCAG TTCGGCCATG
AAGAGCCTCT CTGTGAAGCT TACCAAGGTT TGTAATGACA TTCGACTGCT TGCTTCAGGC
CCCCGTTGCG GCCTGGCTGA AATTAACCTG CCTCAATTGC AACCGGGTTC TTCCATTATG
CCTGGTAAGG TTAACCCCGT TATCCCGGAA GTAACAAACC AGTCTTGCTT CCTGGTTCAG
GGGCTGGACA CCACAGTGAT GCTGGCGGCA TCTGCTGGTC AGCTTGAGCT TAACGTTATG
TGGATTTGCC CCTATATTTC CAGACACCTG TTATCACTTA ACCCATTACT GGCCTGCTGC
CGCAGATATT CCCGTGGCGA GCGATAA
 
Protein sequence
MSRIESDFLG EREIPDDCYY GVQTLRGKDN FHITEMPMSQ EPFFIIAFGY VKKAAAMANK 
ELGTIPADVA DALIWACDQL IDGKYREQFV TDWLQGGAGT STNMNCNEVI CNLAAEKLGS
VKGDYKRVSP NDHANFGQST NDTYPTALHL ALLLRSNVLL EAVEHLVGAF YKKADEFSTV
LKMGRTHLQD AVPMTLGQEF HGWGFTINDE IRVIRNAQEH LRVVNLGATA IGTCVTAHPD
YPALAVKYLA QITGINFRNS EDLIAATSDC GAYVALSSAM KSLSVKLTKV CNDIRLLASG
PRCGLAEINL PQLQPGSSIM PGKVNPVIPE VTNQSCFLVQ GLDTTVMLAA SAGQLELNVM
WICPYISRHL LSLNPLLACC RRYSRGER