Gene EcE24377A_4556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4556 
SymbolaceB 
ID5590326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4552077 
End bp4553678 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content51% 
IMG OID640928174 
Productmalate synthase 
Protein accessionYP_001465506 
Protein GI157155521 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC AGGCAACAAC AACCGATGAA CTGGCTTTCA TAAGGCCGTA TGGCGAGCAG 
GAGAAGCAAA TTCTTACTGC CGAAGCGGTA GAATTTCTGA CTGAGCTGGT GACGCATTTT
ACGCCACAAC GCAATAAACT TCTGGCAGCG CGCATTCAGC AGCAGCAAGA TATTGATAAC
GGAACTTTGC CTGATTTTAT TTCGGAAACA GCTTCCATTC GCGATACGGA CTGGAAAATT
CGTGGTATTC CCGCGGACTT ACAAGATCGT CGAGTCGAGA TAACTGGCCC GGTTGAGCGC
AAGATGGTGA TCAACGCACT GAACGCCAAT GTGAAAGTCT TTATGGCCGA TTTCGAAGAT
TCACTGGCAC CAGACTGGAA CAAAGTGATC GACGGGCAAA TTAACCTGCG CGATGCGGTT
AACGGCACCA TCAGCTATAC CAATGAAGCA GGCAAAATTT ATCAGCTCAA GCCCAATCCA
GCGGTTTTGA TTTGTCGGGT ACGCGGTCTG CACTTGCCGG AAAAACATGT CACCTGGCGT
GGTGAGGCAA TCCCCGGTAG CCTGTTTGAT TTTGCGCTCT ATTTCTTCCA CAACTATCAG
GCACTGTTGG CAAAGGGCAG CGGTCCCTAT TTCTATCTGC CGAAAACCCA GTCCTGGCAG
GAAGCGGCCT GGTGGAGCGA AGTCTTCAGC TATGCAGAAG ATCGCTTTAA TCTGCCGCGC
GGCACCATCA AGGCGACGTT GCTGATTGAA ACGCTGCCCG CCGTGTTCCA GATGGATGAA
ATCCTTCACG CGCTGCGTGA CCATATTGTT GGTCTGAACT GCGGTCGTTG GGATTACATC
TTCAGCTATA TCAAAACGTT GAAAAACTAT CCCGATCGCG TCCTGCCAGA CAGACAGGCA
GTGACGATGG ATAAACCATT CCTGAATGCT TACTCACGCC TGTTGATTAA AACCTGCCAT
AAACGCGGTG CTTTTGCGAT GGGCGGCATG GCAGCGTTTA TTCCGAGCAA AGATGAAGAG
CGCAATAACC AGGTGCTCAA CAAAGTAAAA GCGGATAAAG CGTTGGAAGC CAATAACGGT
CACGATGGCA CATGGATTGC TCACCCAGGC CTTGCGGATA CGGCAATGGC GGTATTCAAC
GACATTCTCG GCTCCCGTAA AAATCAGCTT GAAGTGATGC GCGAACAAGA CGCGCCGATT
ACTGCCGATC AGCTGCTGGC ACCTTGTGAT GGTGAACGCA CCGAAGAAGG TATGCGCGCC
AACATTCGCG TGGCAGTGCA GTACATCGAA GCATGGATCT CCGGCAACGG CTGCGTGCCG
ATTTATGGCC TGATGGAAGA TGCGGCGACG GCTGAAATTT CCCGTACCTC AATCTGGCAG
TGGATCCATC ATCAAAAAAC GTTGAGCAAT GGCAAACCGG TGACCAAAGC CTTGTTCCGC
CAGATGCTGG GTGAAGAGAT GAAAGTCATT GCCAGCGAAC TGGGCGAAGA ACGTTTCTCC
CAGGGGCGTT TTGACGATGC CGCACGCTTG ATGGAACAGA TCACCACTTC CGATGAGTTA
ATTGATTTCC TGACCCTGCC AGGCTACCGC CTGTTAGCGT AA
 
Protein sequence
MTEQATTTDE LAFIRPYGEQ EKQILTAEAV EFLTELVTHF TPQRNKLLAA RIQQQQDIDN 
GTLPDFISET ASIRDTDWKI RGIPADLQDR RVEITGPVER KMVINALNAN VKVFMADFED
SLAPDWNKVI DGQINLRDAV NGTISYTNEA GKIYQLKPNP AVLICRVRGL HLPEKHVTWR
GEAIPGSLFD FALYFFHNYQ ALLAKGSGPY FYLPKTQSWQ EAAWWSEVFS YAEDRFNLPR
GTIKATLLIE TLPAVFQMDE ILHALRDHIV GLNCGRWDYI FSYIKTLKNY PDRVLPDRQA
VTMDKPFLNA YSRLLIKTCH KRGAFAMGGM AAFIPSKDEE RNNQVLNKVK ADKALEANNG
HDGTWIAHPG LADTAMAVFN DILGSRKNQL EVMREQDAPI TADQLLAPCD GERTEEGMRA
NIRVAVQYIE AWISGNGCVP IYGLMEDAAT AEISRTSIWQ WIHHQKTLSN GKPVTKALFR
QMLGEEMKVI ASELGEERFS QGRFDDAARL MEQITTSDEL IDFLTLPGYR LLA