Gene Spro_1515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1515 
Symbol 
ID5603625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1651113 
End bp1652780 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content59% 
IMG OID640937047 
Productcholine dehydrogenase 
Protein accessionYP_001477747 
Protein GI157369758 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.307618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACG ATTACATCAT TATCGGGGCC GGCTCGGCCG GTAACGTATT AGCCACCCGC 
TTAACCGAAG ACGCTGACGT CAGCGTGCTG CTGCTGGAAG CCGGCGGCCC GGACTATCGG
ATGGATTTTC GCACTCAGAT GCCGGCGGCG TTGGCTTTCC CGCTGCAGGG GCGCCGTTAC
AACTGGGCCT ATGAAACCGA CCCTGAGCCG CACATGAACA ATCGCCGCAT GGAATGTGGC
CGTGGTAAGG GACTGGGCGG TTCATCGCTG ATAAACGGCA TGTGCTACAT CCGCGGCAAC
GCGATGGATT TCGATAACTG GGCGAAGGCA CCGGGTCTGG AAGACTGGAG CTACCTGGAT
TGTCTGCCGT ATTTCCGCAA GGCGGAAACC CGCGATATCG GGCCGAACGA TTTTCACGGC
GGTGATGGCC CGGTCAGCGT TACCACGCCG AAAGCCGGTA ACAACGAATT ATTCCACGCG
ATGGTGGAGG CTGGTGTTCA GGCAGGCTAC CCGCGTACCG AGGATCTGAA CGGCTACCAG
CAGGAAGGCT TCGGCCCGAT GGATCGCACC GTAACGCCGA AAGGTCGTCG TGCCAGCACC
GCTCGCGGTT ATCTGGATCA GGCGCGTTCT CGCCCTAATT TGAAGATTGT GACCCACGCG
CTGACCGATC GCATCCGCTT CGACGGCAAG CGGGCGGTGG GCGTTGACTA CTTGCAGGGT
GAAGCAAAGG ATGTCACCAG TGCCCGTGCG CGTCGGGAAG TGCTGCTGTG CGCCGGGGCG
ATCGCCTCGC CACAGATCCT GCAGCGCTCC GGCGTGGGAC CGGCCGCCTT GCTGAACCGT
CTGGATATTG ATCTGGTACA CGAACTGCCG GGTGTGGGTG AAAACCTGCA AGACCATCTG
GAAATGTACC TGCAGTACGC CTGTAAAAAA CCGGTGTCGC TGTACCCGGC GTTGCAATGG
TTTAACCAGC CTAAAATCGG TGCGGAATGG CTATTCAATG GCAGCGGTAT TGGTGCCAGT
AACCAATTTG AAGCCGGTGG TTTTATCCGC AGCCGTGAAG AGTTTGCCTG GCCGAACATT
CAGTATCATT TCCTGCCGGT AGCAATTAAC TACAACGGCA GTAATGCGGT GAAAGAGCAC
GGTTTCCAGG CACACGTCGG TTCGATGCGC TCCCCGAGCC GTGGCCGGGT GCAGGTCAAA
TCCAAAGATC CGCGCCAGCA CCCGAGCATC TTGTTCAACT ATATGGCGAC CGAGCAGGAC
TGGCAGGAGT TCCGCGATGC CATCCGCATC ACGCGTGAAA TCATGGCGCA ACCGGCGTTG
GATGAATACC GTGGTCGTGA GATAAGCCCA GGGCCGGAAG TGCAGACCGA CGAGCAGTTG
GACGCTTTTG TTCGTGAACA TGCCGAAACC GCCTTCCATC CTTCCTGCTC ATGCAAAATG
GGTGAAGACG AAATGGCGGT GGTGGATGGT CAGGGCCGGG TGCACGGCCT GGAAGGGCTG
CGGGTGGTGG ATGCGTCAAT TATGCCGTTG ATCATTACCG GTAATCTGAA TGCCACCACT
ATCATGATCG CCGAGAAAAT CGCCGACCGC ATCCGCCAAC GCACGCCGCT GCCGCGCAGC
ACCGCAGAAT ACTACGTGGC CGGTAACGCC CCGGCGCGTA AACAGTAA
 
Protein sequence
MEYDYIIIGA GSAGNVLATR LTEDADVSVL LLEAGGPDYR MDFRTQMPAA LAFPLQGRRY 
NWAYETDPEP HMNNRRMECG RGKGLGGSSL INGMCYIRGN AMDFDNWAKA PGLEDWSYLD
CLPYFRKAET RDIGPNDFHG GDGPVSVTTP KAGNNELFHA MVEAGVQAGY PRTEDLNGYQ
QEGFGPMDRT VTPKGRRAST ARGYLDQARS RPNLKIVTHA LTDRIRFDGK RAVGVDYLQG
EAKDVTSARA RREVLLCAGA IASPQILQRS GVGPAALLNR LDIDLVHELP GVGENLQDHL
EMYLQYACKK PVSLYPALQW FNQPKIGAEW LFNGSGIGAS NQFEAGGFIR SREEFAWPNI
QYHFLPVAIN YNGSNAVKEH GFQAHVGSMR SPSRGRVQVK SKDPRQHPSI LFNYMATEQD
WQEFRDAIRI TREIMAQPAL DEYRGREISP GPEVQTDEQL DAFVREHAET AFHPSCSCKM
GEDEMAVVDG QGRVHGLEGL RVVDASIMPL IITGNLNATT IMIAEKIADR IRQRTPLPRS
TAEYYVAGNA PARKQ