Gene Spro_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2010 
Symbol 
ID5603213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2202261 
End bp2203349 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID640937548 
Producthypothetical protein 
Protein accessionYP_001478241 
Protein GI157370252 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.694664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG AACACCGTAT CGATCGCCAA TGTCTTGAGC AGTTTGTGCA GGCGATTTGG 
CGCCATGCCG GCAGTACCGA TCGAGAGGCC GGGTTAGTGG CCGAACATCT GGTTCAGGCC
AATCTGGCCG GACACGACTC CCACGGCGTC GGCATGATCC CGAGCTATAT GGCGTCGCTG
GCGCAGGGGC ATTTACAGCT TAACGTTCAT GCCCAGGTGG TACGCGACGC CGGTGCGGTC
CTGACGATTG ACGGTGGTCA GGGTTTCGGC CAGGTGGTCG CCAGTGAGGC GATGGATAAA
GGCATCGAAC GAGCCAGGCA ACTGGGGCTG GCGGCGGTGG CATTAAACAA TTCGCATCAC
ATCGGCCGTA TTGGCCACTG GGCTGAACAG TGTGCACGCG CCGGTTTTAT CTCTATTCAC
TTCGTCAATG TCGTGGGCGA CCCTATGGTG GCACCCTTTG GCGGCAGCGA TCGTCGCTTT
GGTACCAACC CCTTCTGCGT TATTTTCCCC CGTCCCGGTA AAAAGCCGCT GCTGCTGGAT
TTCGCCACCA GCGGCATCGC TTTTGGTAAA ACCCGCGTGG CCTACAACAA AGGTCTGACC
GTCGCGCCGG GCTATCTGAT TGACCAGCAT GGACAACCCA CAGACGAGCC CAAGGTGATG
CACGAGCAGC CGTTTGGTTC GCTGCTGCCC TTTGGTGCGC ATAAAGGTTA TGCACTGGCC
GCCCTGTGCG AGATCCTCGG CGGTGCACTG TCGGGCGGGA GAACTACCCA TAGTGCTACG
CTGAAATCCA ACAGCGACGC CATTTTCAAC TGTATGACCA CCATTATCCT CAACCCAGAG
GCCTTTGCGG CACCTGAGAT GCAAAGTGAG GCTGAGGCGT TTATTGACTG GGTGAAGGCC
TCACCGCCAA GTGACGGTCG GCCGATTGAG GTGCCGGGGG AGTGGGAAGA GGCTAATCGC
GAACAACGTT TGCAACAGGG GATCCCGATA GATGCCAATA CCTGGCAGCA GATTTGTGCG
GCAGCCAGAC AGGCGGGCAT GCCTGACGAG GAGCTTGATG CTTACCTGAC ACAGGCGCTG
CGAGCATAA
 
Protein sequence
MSTEHRIDRQ CLEQFVQAIW RHAGSTDREA GLVAEHLVQA NLAGHDSHGV GMIPSYMASL 
AQGHLQLNVH AQVVRDAGAV LTIDGGQGFG QVVASEAMDK GIERARQLGL AAVALNNSHH
IGRIGHWAEQ CARAGFISIH FVNVVGDPMV APFGGSDRRF GTNPFCVIFP RPGKKPLLLD
FATSGIAFGK TRVAYNKGLT VAPGYLIDQH GQPTDEPKVM HEQPFGSLLP FGAHKGYALA
ALCEILGGAL SGGRTTHSAT LKSNSDAIFN CMTTIILNPE AFAAPEMQSE AEAFIDWVKA
SPPSDGRPIE VPGEWEEANR EQRLQQGIPI DANTWQQICA AARQAGMPDE ELDAYLTQAL
RA