Gene SbBS512_E4508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4508 
SymbolaceB 
ID6271027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4215912 
End bp4217513 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content51% 
IMG OID641728297 
Productmalate synthase 
Protein accessionYP_001882695 
Protein GI187731485 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC AGGCAACAAC AACCGATGAA CTGGCTTTCA CAAGGCCGTA TGGCGAGCAG 
GAGAAGCAAA TTCTTACTGC CGAAGCGGTA GAATTTCTGA CTGAGCTGGT GACGCATTTT
ACGCCACAAC GAAATAAACT TCTGGCAGCG CGCATTCAGC AGCAGCAAGA TATTGATAAC
GGAACGTTGC CTGATTTTAT TTCGGAAACA GCTTCCATTT GCGATGCTGA TTGGAAAATT
CGCGGGATTC CTGCGGACTT AGAAGACCGC CGCGTAGAGA TAACTGGCCC GGTAGAGCGC
AAGATGGTGA TCAACGCGCT CAACGCCAAT GTGAAAGTCT TTATGGCCGA TTTCGAAGAT
TCACTGGCCC CGGACTGGAA CAAAGTGATC GACGGGCAAA TTAACCTGCG TGATGCAGTT
AACGGTACCA TCAGCTATAC CAATGAAGCA GGCAAAATTT ACCAGCTCAA GCCCAATCCA
GCGGTTTTGA TTTGTCGGGT ACGCGGTCTG CACTTGCCGG AAAAACATGT CACCTGGCGT
GGTGAGGCAA TCCCCGGCAG CCTGTTTGAT TTTGCGCTCT ATTTCTTCCA CAACTATCAG
GCTCTGTTGG CAAAGGGCAG TGGTCCCTAT TTCTATCTGC CGAAAACCCA GTCCTGGCAG
GAAGCGGCCT GGTGGAGCGA AGTCTTCAGC TATGCAGAAG ATCGCTTTAA TCTGCCGCGC
GGCACCATCA AGGCGACGTT GCTGATTGAA ACGCTGCCCG CCGTGTTCCA GATGGATGAA
ATCCTTCACG CGCTGCGTGA CCATATTGTT GGTCTGAACT GCGGTCGTTG GGATTACATC
TTCAGCTATA TCAAAACGTT GAAAAACTAT CCCGATCGCG TCCTGCCAGA CAGACAGGCA
GTGACGATGG ATAAACCATT CCTGAATGCT TACTCACGCC TGTTGATTAA AACCTGCCAT
AAACGCGGTG CTTTTGCGAT GGGCGGCATG GCGGCGTTTA TTCCGAGCAA AGATGAAGAG
CACAATAACC AGGTGCTCAA CAAAGTAAAA GCGGATAAAT CGCTGGAAGC CAATAACGGT
CACGATGGCA CATGGATCGC TCACCCAGGC CTTGCGGATA CGGCAATGGC GGTATTCAAC
GACATTCTCG GCTCCCGTAA AAATCAGCTT GAAGTGATGC GCGAACAAGA CGCGCCGATT
ACTGCCGATC AGCTACTGGC ACCTTGTGAC GGTGAACGCA CCGAAGAAGG TATGCGCGCA
AATATTCGCG TAGCCGTGCA GTACATCGAA GCGTGGATCT CTGGCAACGG CTGTGTGCCG
ATTTATGGCC TGATGGAAGA TGCGGCGACG GCTGAAATTT CCCGTACCTC AATCTGGCAG
TGGATCCATC ATCAAAAAAT GTTGAGCAAT GGCAAACCGG TAACTAAAGC CTTGTTCCGC
CAGATGCTGG GCGAAGAGAT GAAAGTCATT GCCAGCGAAC TGGGCGAAGA ACGTTTCTCC
CAGGGGCGTT TTGACGATGC CGCACGCTTG ATGGAACAGA TCACCACTTC CGATGAGTTA
ATTGATTTCC TGACCCTGCC AGGCTACTGC CTGTTAGCGT AA
 
Protein sequence
MTEQATTTDE LAFTRPYGEQ EKQILTAEAV EFLTELVTHF TPQRNKLLAA RIQQQQDIDN 
GTLPDFISET ASICDADWKI RGIPADLEDR RVEITGPVER KMVINALNAN VKVFMADFED
SLAPDWNKVI DGQINLRDAV NGTISYTNEA GKIYQLKPNP AVLICRVRGL HLPEKHVTWR
GEAIPGSLFD FALYFFHNYQ ALLAKGSGPY FYLPKTQSWQ EAAWWSEVFS YAEDRFNLPR
GTIKATLLIE TLPAVFQMDE ILHALRDHIV GLNCGRWDYI FSYIKTLKNY PDRVLPDRQA
VTMDKPFLNA YSRLLIKTCH KRGAFAMGGM AAFIPSKDEE HNNQVLNKVK ADKSLEANNG
HDGTWIAHPG LADTAMAVFN DILGSRKNQL EVMREQDAPI TADQLLAPCD GERTEEGMRA
NIRVAVQYIE AWISGNGCVP IYGLMEDAAT AEISRTSIWQ WIHHQKMLSN GKPVTKALFR
QMLGEEMKVI ASELGEERFS QGRFDDAARL MEQITTSDEL IDFLTLPGYC LLA