Gene Spro_4503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4503 
Symbol 
ID5606216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4990506 
End bp4992104 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content55% 
IMG OID640940065 
Productmalate synthase 
Protein accessionYP_001480725 
Protein GI157372736 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00577319 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCAAC AGATAGTAGG CACGGAATTA ACGTTTACGC AGGGTTTTAG CGCTGCTGAA 
CGACAGGTGT TGACGGATGA CGCGGTCGAA TTCCTGGCGG AATTGGTGAG TAAATTTACT
CCACAGCGTA ACAAACTGTT GGCTGCGCGT GCCTGCTGGC AGCAGAAGAT CGATCAAGGT
GAACGTCCAG ACTTCATTTC GGAAACTAAT TCCATTCGCA ATGAAAAGTG GTCGATCCGT
GGCATACCAG AGGATCTTCG CGACCGCCGG GTGGAAATCA CCGGTCCGGT TGAACGCAAG
ATGGTCATCA ACGCCCTGAA CGCCAATGTG AAGGTGTTTA TGGCGGACTT CGAAGACTCA
CTGGCACCGA GCTGGGACAA AGTCATCGAC GGCCAAATCA ACCTGCATGA CGCGGTGAAC
GGCACCATCT CTTACACCAA TGAAGCCGGC AAGATTTATC AGTTAAAGCC GAACCCGGCG
GTATTGATTG CTCGCGTACG CGGCCTGCAT TTGCCGGAAA AACACGTGCA ATGGCAGGGG
GAAGCGATCC CCGGTGGCCT GTTCGATTTT GCGCTGTATT TCTTCCATAA CTATCGTCAA
CTGCTGGCTA AAGGCAGTGG CCCTTATTTC TACCTGCCAA AAACCCAGTC CTGGCAGGAA
GCGGCCTGGT GGAGCGAAGT CTTCAGCTTT GCCGAGGATC GTTTCTCCCT GCCACGCGGC
ACGATCAAAG CCACGGTGCT GATCGAAACG CTGCCGGCAG TATTCCAGAT GGACGAGATC
CTCTACCACC TGCGCGATCA TATCGTCGGC TTGAACTGCG GCCGTTGGGA TTACATCTTC
AGCTACATCA AGACGCTGAA AAACCATGCT GACCGGGTAT TGCCGGATCG TCAGTCGGTC
ACCATGGACA AGTCATTCCT TAGCGCCTAT TCCCGATTGC TGATCAAGAC CTGCCACAAG
CGCGGTGCCT TTGCCATGGG CGGCATGGCG GCGTTTATCC CGAGCAAAGA CGCCGAGAAA
AATGCCTGGG TGCTGAACAA GGTGCGGGCG GATAAAGAGC TGGAGGCCAA TAACGGCCAC
GACGGTACCT GGGTGGCCCA TCCAGGGCTG GCGGATACCG TAATGGAAGT CTTCAGCCGG
GTGCTCGGTG AGCGCCGTAA CCAACTCGAA GTGCTGCGTG AAAACGACGC GCTAATCAGT
GCTGCGCAGT TGCTTGAACC TTGTGACGGG GAGCGTACCG AAGCCGGCAT GCGCGCCAAT
ATCCGCGTGG CGGTGCAGTA CATCGAAGCC TGGATCTCCG GCAATGGCTG CGTCCCGATT
TATGGCCTGA TGGAAGACGC GGCGACGGCG GAAATTTCCC GTACCTCTAT CTGGCAGTGG
ATCCACCATG AAAAGAGCCT GAGTGATGGC CAACTGGTCA CCAAGGCGCT GTTCCGTCAG
ATGCTGAAAG AAGAAATGCT GGTAGTACGT GAAGAGTTGG GTGAGGCACG CTTTAACGCT
GGCCGCTTCG ACGAAGCGGC ACGCCTGATG GAGCGTATCA CTACGCAAGA CGAATTAATC
GATTTCCTGA CTTTACCTGG CTATGAGCTA CTGGCCTGA
 
Protein sequence
MTQQIVGTEL TFTQGFSAAE RQVLTDDAVE FLAELVSKFT PQRNKLLAAR ACWQQKIDQG 
ERPDFISETN SIRNEKWSIR GIPEDLRDRR VEITGPVERK MVINALNANV KVFMADFEDS
LAPSWDKVID GQINLHDAVN GTISYTNEAG KIYQLKPNPA VLIARVRGLH LPEKHVQWQG
EAIPGGLFDF ALYFFHNYRQ LLAKGSGPYF YLPKTQSWQE AAWWSEVFSF AEDRFSLPRG
TIKATVLIET LPAVFQMDEI LYHLRDHIVG LNCGRWDYIF SYIKTLKNHA DRVLPDRQSV
TMDKSFLSAY SRLLIKTCHK RGAFAMGGMA AFIPSKDAEK NAWVLNKVRA DKELEANNGH
DGTWVAHPGL ADTVMEVFSR VLGERRNQLE VLRENDALIS AAQLLEPCDG ERTEAGMRAN
IRVAVQYIEA WISGNGCVPI YGLMEDAATA EISRTSIWQW IHHEKSLSDG QLVTKALFRQ
MLKEEMLVVR EELGEARFNA GRFDEAARLM ERITTQDELI DFLTLPGYEL LA