Gene Spro_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4226 
Symbol 
ID5602790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4684931 
End bp4685989 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content59% 
IMG OID640939786 
Productalcohol dehydrogenase 
Protein accessionYP_001480448 
Protein GI157372459 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAAA TGTTGGCGGC TTATTTACCC GGAAACGCCA CGGCAGAACT GCGCGAGGTG 
GATATTCCGC AACCGGGCAT TGGCCAGGTA TTAATTAAAA TGAAATCGTC CGGTATTTGC
GGCAGCGATA TTCATTATAT TTATCATCAG CATCGTGGTA CGGCGGCCGC ACCGGATCAA
CCCTTGTACC GGGGGTTTAT TAACGGTCAC GAGCCTTGTG GCCAGATTGT GGCGCTGGGG
GCCGGCTGCC GCCACTTCCG CGAGAGCGAT CGCGTGCTGG TGTACCATAT TTCCGGCTGC
GGCTTTTGCA GCAACTGCCG GCGAGGCTAT CCGATTTCCT GCACCGGCGT TGGCAAGGCC
GCCTATGGCT GGCAGCGGGA TGGCGGCCAT GCCGACTACC TGTTGGCGGA GGAAAAGGAT
TTGATTCATC TGCCGGATTC GCTCAGCTAT GAAGACGGCG CTTTTATCTC CTGTGGGGTC
GGCACGGCTT ATGAAGGTAT CGTGCGTGGC GAGGTCTCCG GCAGCGACCA CGTACTGGTC
GTGGGGCTGG GCCCGGTCGG TATGATGGCG ATGATGCTGG CGAAGGGACG CGGGGCAAAA
ACGGTGATTG GCGTTGATGT TATCCCGGAG CGTCTGGCGA CCGCGAAACG CCTGGGGCTG
ATGGATCACG GCTTCCTGAG CGGTGACGAC GTGACAGAAC GCATTCGCCA ATTGACCGCT
GGCGGGGCCA ACGTCACGCT CGACTGTTCC GGCAACGCCA AAGGGCGCCT GCTGGCGCTG
CAGGCCTCTT CGGACTGGGG AAGAGTGATC TACATTGGCG AAACCGGCAA GGTGGAATTC
GAGGTCAGCG CAGACCTGAT GCATCACCAG CGGCGGATCA TCGGCTCTTG GGTCACCAGC
CTGCACCACA TGGAAAAATG CTGCACCGAC CTGCACGACT GGAAAATGCA CCCGCATCAG
GCGATCACCC ACCGTTTTAA ACTCGGGCAG GCTGCCGAGG CCTATGCTCT GATGGCTTCT
GGCCAGTGCG GCAAAGTGGT GATCAATTTC GCCGATTAA
 
Protein sequence
MGKMLAAYLP GNATAELREV DIPQPGIGQV LIKMKSSGIC GSDIHYIYHQ HRGTAAAPDQ 
PLYRGFINGH EPCGQIVALG AGCRHFRESD RVLVYHISGC GFCSNCRRGY PISCTGVGKA
AYGWQRDGGH ADYLLAEEKD LIHLPDSLSY EDGAFISCGV GTAYEGIVRG EVSGSDHVLV
VGLGPVGMMA MMLAKGRGAK TVIGVDVIPE RLATAKRLGL MDHGFLSGDD VTERIRQLTA
GGANVTLDCS GNAKGRLLAL QASSDWGRVI YIGETGKVEF EVSADLMHHQ RRIIGSWVTS
LHHMEKCCTD LHDWKMHPHQ AITHRFKLGQ AAEAYALMAS GQCGKVVINF AD