Gene EcSMS35_4468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4468 
SymbolaceB 
ID6145921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4560847 
End bp4562448 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content52% 
IMG OID641619284 
Productmalate synthase 
Protein accessionYP_001746396 
Protein GI170683352 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.554471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0868855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC AGGCAACAAC AACCGATGAA CTGGCTTTCA TAAGGCCGTA TGGCGAGCAG 
GAGAAGCAAA TTCTTACTGC CGAAGCGGTA GAATTTCTGA CTGAGCTGGT GACGCATTTT
ACGCCACAAC GCAATAAACT TCTGGCAGCG CGCATTCAGC AGCAGCAGGA TATCGATAAC
GGAACGTTGC CTGATTTTAT TTCGGAAACA GCTTCCATTC GTGATGCTGA CTGGAAAATT
CGTGGTATTC CCGCGGACTT ACAAGATCGT CGAGTCGAGA TAACTGGCCC GGTTGAGCGC
AAGATGGTGA TCAACGCGCT CAACGCCAAT GTGAAAGTCT TTATGGCCGA TTTCGAAGAT
TCACTGGCCC CGGACTGGAA CAAAGTGATC GACGGGCAAA TTAACCTGCG CGATGCGGTT
AACGGCACCA TCAGCTATAC CAATGAAGCA GGCAAAATTT ATCAGCTCAA GCCCAATCCA
GCGGTGTTGA TTTGTCGGGT TCGCGGTCTG CACTTGCCGG AAAAACATGT CACCTGGCGC
GGCGAAGCAA TTCCTGGCAG CCTGTTTGAT TTTGCGCTCT ATTTCTTCCA CAACTACCAG
GCTCTGTTAG CAAAAGGCAG CGGTCCCTAT TTCTATCTGC CGAAAACCCA GTCCTGGCAG
GAAGCGGCCT GGTGGAGCGA AGTCTTCAGC TATGCAGAAG ATCGCTTTAA TCTGCCGCGC
GGCACCATCA AGGCGACGTT GCTGATTGAA ACGCTGCCCG CCGTGTTCCA GATGGACGAA
ATCCTTCACG CGCTGCGTGA CCATATTGTT GGTCTGAACT GCGGTCGTTG GGATTACATC
TTCAGTTATA TCAAAACGTT GAAAAACTAT CCCGATCGCG TCCTGCCAGA TAGACAGGCA
GTGACGATGG ATAAACCCTT CCTGAATGCT TACTCACGCC TGTTGATTAA AACCTGCCAT
AAACGCGGCG CATTTGCGAT GGGCGGCATG GCGGCGTTTA TTCCGAGCAA AGATGAAGAG
CGCAATAACC AGGTGCTCAA CAAAGTAAAA GCGGATAAAT CGCTGGAAGC CAATAACGGT
CACGATGGCA CATGGATCGC TCACCCAGGT CTTGCGGATA CGGCAATGGC GGTATTCAAC
GACATTCTCG GCTCCCGTAA AAATCAGCTT GAAGTGATGC GCGAACAAGA CGCGCCGATT
ACTGCCGATC AGCTGCTGGC ACCTTGTGAT GGTGAACGCA CCGAAGAAGG TATGCGCGCC
AACATTCGCG TGGCTGTGCA GTACATCGAA GCATGGATCT CCGGCAACGG CTGCGTGCCG
ATTTATGGCC TGATGGAAGA TGCGGCGACG GCTGAAATTT CCCGTACCTC AATCTGGCAG
TGGATCCATC ATCAAAAAAC GTTGAGCAAT GGCAAACCGG TGACCAAAGC CTTGTTCCGC
CAGATGCTGG GCGAAGAGAT GAAAGTCATT GCCAGCGAAC TGGGCGAAGA ACGTTTCTCC
CAGGGGCGTT TTGACGATGC CGCACGTTTG ATGGAACAGA TCACCACTTC CGATGAGTTA
ATTGATTTCC TGACCCTGCC AGGCTACCGC CTGTTAGCGT AA
 
Protein sequence
MTEQATTTDE LAFIRPYGEQ EKQILTAEAV EFLTELVTHF TPQRNKLLAA RIQQQQDIDN 
GTLPDFISET ASIRDADWKI RGIPADLQDR RVEITGPVER KMVINALNAN VKVFMADFED
SLAPDWNKVI DGQINLRDAV NGTISYTNEA GKIYQLKPNP AVLICRVRGL HLPEKHVTWR
GEAIPGSLFD FALYFFHNYQ ALLAKGSGPY FYLPKTQSWQ EAAWWSEVFS YAEDRFNLPR
GTIKATLLIE TLPAVFQMDE ILHALRDHIV GLNCGRWDYI FSYIKTLKNY PDRVLPDRQA
VTMDKPFLNA YSRLLIKTCH KRGAFAMGGM AAFIPSKDEE RNNQVLNKVK ADKSLEANNG
HDGTWIAHPG LADTAMAVFN DILGSRKNQL EVMREQDAPI TADQLLAPCD GERTEEGMRA
NIRVAVQYIE AWISGNGCVP IYGLMEDAAT AEISRTSIWQ WIHHQKTLSN GKPVTKALFR
QMLGEEMKVI ASELGEERFS QGRFDDAARL MEQITTSDEL IDFLTLPGYR LLA