Gene Spro_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1956 
Symbol 
ID5607423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2137153 
End bp2138250 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content59% 
IMG OID640937494 
Productalcohol dehydrogenase 
Protein accessionYP_001478187 
Protein GI157370198 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0230313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAA CAGCCGCAGT CAGTGAAAAA GCGACGGAGG GATTCTCCCT CAAACAGTTA 
CAGCTTGGAG AACCCCGCGC CGATGAGGTG TTGGCAAAAC TGGTCGCGAC CGGCCTGTGC
CATACCGATA TCGCCGCGCA CAAAGGCGTT ATATCGATGC CTGCGCCGGT GGTGCTCGGC
CATGAAGGCG CCGGGGTCGT AGTGCGGGTC GGGGCCGGGG TCAGTAAGGT GGCGCCCGGC
GATCATGTGG TGCTGTCGCT GGCCTCATGC GGCGTCTGCG ACAAGTGTAG CATCGGCATG
CCGACCTATT GCCGTCAACA TGTGCCATTG AACTGGCTGG CGCAGCGCAC CGACGGTTCG
GTCAGCCTGC ATGATGAAAA TGGCGATGTG CACAGCCATT TCTTCGGTCA GTCCTCTTTT
GCGCAGTATG CCGTGGTCAA TGTCAGCAGC ATTGTCCCTG TCGATAAGGC GATCCCGTTG
GAATACCTTG GGCCATTGGC CTGCGGACTG ATGACCGGCG CGGGCGCAGT GATGAACACT
CTGCGGCCGC ATGCGGGTTC TACGCTGGTG GTCTTTGGTC TTGGCGCGGT GGGCCTGGCG
GCGGTGATGG CAGCCCGGGT GGTGGGTTGC GGCCACATCG TCGCGGTGGA TATCAAAGAG
AACCGTCTGG CGTTAGCCAA AGAGTTGGGC GCTACAGAGG TGATCAACCC GAAAACGGCG
AATGTGGATG AAGTGCTTAA TCAACTGACC GAGGGACGCG GTGCGGACTA CAGCGTTGAA
GCCGCCGGGA ACGCGGGCGT CATGGCCGAT GCGGTGCGGG TGTTGGCGGA AAATGGCAAA
TGCGTACTGA CCGGCGTGGT ACCGGAGGGC GAATCTTTGC CGCTCGACAT TATGCACTTT
ATCCGCGGCC GCACGGTGCA GGGTTCGATC ATGGGCGATG CGGCACCGGC GATGTTTATC
CCGATGCTGG CGCAGCTATT CCAGCAAGGG CGGTTCCCGA TCGATCGCCT TATCCGTTTT
TATGCCATGA ATGAGATCAA CCAGGCGATG GCGGACTCAC AATCCGGTGA AACCATTAAA
GCCGTTATTC GTATGTAA
 
Protein sequence
MEITAAVSEK ATEGFSLKQL QLGEPRADEV LAKLVATGLC HTDIAAHKGV ISMPAPVVLG 
HEGAGVVVRV GAGVSKVAPG DHVVLSLASC GVCDKCSIGM PTYCRQHVPL NWLAQRTDGS
VSLHDENGDV HSHFFGQSSF AQYAVVNVSS IVPVDKAIPL EYLGPLACGL MTGAGAVMNT
LRPHAGSTLV VFGLGAVGLA AVMAARVVGC GHIVAVDIKE NRLALAKELG ATEVINPKTA
NVDEVLNQLT EGRGADYSVE AAGNAGVMAD AVRVLAENGK CVLTGVVPEG ESLPLDIMHF
IRGRTVQGSI MGDAAPAMFI PMLAQLFQQG RFPIDRLIRF YAMNEINQAM ADSQSGETIK
AVIRM