Gene Spro_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4016 
Symbol 
ID5605011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4457006 
End bp4458241 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content61% 
IMG OID640939576 
Productgluconate 2-dehydrogenase (acceptor) 
Protein accessionYP_001480239 
Protein GI157372250 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0979655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GTCTCGCACT GCTGATCCTG CTGGTGGTGA TTATCGTCAT CGCCGTGCTG 
TGGTGGCGTG AAAATCGCCA TTACGACGGC CCGGTACAGC AGGTAACCGC CAGTACCGAA
CAGGTTGCCC GTGGCCGCTA TCTGGCTCAG GCCGCCGACT GCGCTGCCTG TCATACCGCC
AGCGGTGGCG CGCCAATGGC AGGGGGGTAT CCGCTGGATA CGCCGTTCGG CACTATCTAT
GGCAGTAACC TGACTCCTTC GGTCGATCAG GGGATCGGCC GCTGGACCAA GGACGACTTC
TTCCTGGCGG TGACCCAGGG CGTGGCACCG GGCGGTCGTC ATCTGTATCC GGCCATGCCT
TATACCTCGT ATAAAGGGAT GTCGCGTCAG GATGCCGATG ATATCTATGC CTATCTGATG
ACCCGTCCGG CGGTGGATGT TGCCATTCCG GAAAACGCCA TGCCGTTCCC GTTTAACCAG
CGCATGGCGC TGATTGGCTG GAATCTGTTG TTCCGTAGTC AGGATCCGCT GCCGGTCAGT
TCGCAGGGTG ATTCGGCGCA GTGGCAACGC GGCCGTTATC TGGCGGATAC GCTGGGCCAC
TGTGGGGAAT GCCACACGCC GCGCGGTATG CTGGGTCAAA TGAATCTGGC CAAACCTATG
CAGGGCGGTG ATCTTGGACG CTTTATGGCA CCGGACATCA CCCCGCACGG GTTGGCGCAG
CGCGGCTGGA CGCCGGACGA TCTGAACCGC TTCCTCTCGA CCGGCCTGGC ACCGCAGGGT
TCCGCCTTTA GTGAGATGCA TATGGTGGTG AGTCTCAGCA CCCGTCATTT GACGCCGGAA
GATGGTCAGG CGCTGGCGAC CTATCTGATG GGTGAACAGC CGCCGGCGGC TGTGCCGGTT
AAAATCGGTC AGGGCAGCGA TGCCGGGCGC ATCAGCTATC AGGATCAGTG CTCCGGTTGC
CATGCGCGTG AAGGCCAGGG TAAGCCTCAT GTGGCGGTCG CGATGCGTGA TAACGCTACG
CTGCGCCAAC CGGACGCTAA AAACCTGATT GTCTCGATAC TGGACGGTTT ACCGGCCCAG
CAATTCCCTA ACGGCGAAAG CATGCAGAGC ATGCCGGCCT TTGGCGAACG TCTCGATGAT
GCACAGGTGG CGGAACTGGT GAACTACCTG CGTGTGACCT GGGGCGGGTT GCCGGCGGAT
GTGACGGCCG AACAGGTGAA AGCGCTACGT AAGTAA
 
Protein sequence
MKKRLALLIL LVVIIVIAVL WWRENRHYDG PVQQVTASTE QVARGRYLAQ AADCAACHTA 
SGGAPMAGGY PLDTPFGTIY GSNLTPSVDQ GIGRWTKDDF FLAVTQGVAP GGRHLYPAMP
YTSYKGMSRQ DADDIYAYLM TRPAVDVAIP ENAMPFPFNQ RMALIGWNLL FRSQDPLPVS
SQGDSAQWQR GRYLADTLGH CGECHTPRGM LGQMNLAKPM QGGDLGRFMA PDITPHGLAQ
RGWTPDDLNR FLSTGLAPQG SAFSEMHMVV SLSTRHLTPE DGQALATYLM GEQPPAAVPV
KIGQGSDAGR ISYQDQCSGC HAREGQGKPH VAVAMRDNAT LRQPDAKNLI VSILDGLPAQ
QFPNGESMQS MPAFGERLDD AQVAELVNYL RVTWGGLPAD VTAEQVKALR K