Gene Spro_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1808 
Symbol 
ID5605243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1992358 
End bp1994013 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content58% 
IMG OID640937340 
Productputative dehydrogenase subunit 
Protein accessionYP_001478039 
Protein GI157370050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000111783 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000501686 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAC CTGTATTTAC CGCCGATGGC AACGTCTCGG CCGATATTGT GATCGTAGGC 
TCCGGCATCG TCGGCGGCAT GATGGCCGAT CAATTGGTCA GCCAGGGTTA TTCTGTGCTG
GTGCTGGAAG CGGGCTTGCG CATCGAACGC GGCCAGGCGG TAGAGAACTG GCGCAATATG
CCTTTTGACA ACCGCGCCGG CTCAGATTAC CAGGGGCTGT ATCCACAATC TGAATTCGCC
ACCGCGCCGC TCTACTTCCC GGAAAACAAC TATGTTGCGC TGAGCGGCCC GAGCGCCGGC
AGTTTCAAGC AGGGTTATCT GCGCACCGTC GGCGGCACCA CCTGGCACTG GGCCGCCTCC
TGCTGGCGCC ACCTGCCCAG TGATTTCCAA ATGAAAACGC TGTACGGCGT TGGCCGCGAC
TGGCCGATTT CCTACGACGA GCTGGAACCC TACTATTGCC GGGCCGAAGA AGAAATTGGC
GTCGCCGGCC CCAACGATCC GCAACAGCAG TCCCCGGTTG AGCGCAGCAA ACCTTACCCG
ATGGATATGG TGCCCTGGGC TCACGGCGAC ATCCGCTTTG CCGAGGTGGT AAACCCGCAT
GGTTACCGCT CCGTTCCCAT CCCACAGGGG CGCAGTATCC ATCCGTGGAA AGGCCGGCCG
ACCTGCTGCG GTAACAATAA CTGCCAACCC ATTTGCCCGA TAGGCGCCAT GTACAACGGC
ATTCATCATA TTGAACGTGC TGAAATGAAA GGTGCGGTGG TGCTGGCCGA AGCGGTGGTC
TACAAGATCG ACACCGATGA GCAAAATCAG GTGACGGCAG TCCATTGGCT GGACAACAAA
AAACAGTCCC ACCGGGCCAC GGCCAAAGCT TTTGCGTTAG CCTGTAACGG CATAGAAACC
CCGCGCCTGC TGCTGATGGC GGCTAATGAG CGCAATCCCA ACGGTATCGC CAACGCTTCC
GATCAGGTGG GCCGCAATAT GATGGACCAT TCGGGCTTTC ACTGTACCTT CCTGGCGAAA
GAACCGCTGT GGCTGGGGCG TGGCCCGGCA CAAAGCAGTT GTCTGGTTGG CCCACGTGAC
GGTGAGTTTC GCAAAGACTA CTCGGCCAAC AAAATGATCC TCAACAATAT CAACCGGGTG
GTACCGGCTA CCCAGCAAGC GTTGGAAAAA GGTCTGGTCG GCAAAGAGTT GGACGCCGAA
ATCCGCCGAC GCGCCGCCTA TGGCGTCGAT TTATCCATCA GCCTGGAACC GTTACCAGAC
CCCAACAACC GCCTGACCCT GAGTAAAACC CGGAAAGATG CTCATGGCCT GCCTTGCCCG
GACATCCACT ACGACGTCGG CGACTATGTG CGTAAGGGCG CAGAGGCCGC GCATAAACAG
TTGGAGCACA TCGGCCAACT GTTTGATGCC GATGAATTCA ACATCACCAC CAGCCTGAAC
GCCAATAACC ATATTATGGG TGGCACCATC ATGGGCCACA GCCCCGAAGA CTCGGTGGTA
GACGGCAATT GCCGTACTCA TGACCATGCC AACCTTTGGT TGCCGGGCGG CGGTGCCATT
CCCTCCGCCA GCGTGGTGAA CAGCACCCTG ACCATGGCCG CATTGGGCAT CAAGGCCGCC
GATGATATTG CGCGCCAGCT GGCGGTGAAA TCATGA
 
Protein sequence
MKKPVFTADG NVSADIVIVG SGIVGGMMAD QLVSQGYSVL VLEAGLRIER GQAVENWRNM 
PFDNRAGSDY QGLYPQSEFA TAPLYFPENN YVALSGPSAG SFKQGYLRTV GGTTWHWAAS
CWRHLPSDFQ MKTLYGVGRD WPISYDELEP YYCRAEEEIG VAGPNDPQQQ SPVERSKPYP
MDMVPWAHGD IRFAEVVNPH GYRSVPIPQG RSIHPWKGRP TCCGNNNCQP ICPIGAMYNG
IHHIERAEMK GAVVLAEAVV YKIDTDEQNQ VTAVHWLDNK KQSHRATAKA FALACNGIET
PRLLLMAANE RNPNGIANAS DQVGRNMMDH SGFHCTFLAK EPLWLGRGPA QSSCLVGPRD
GEFRKDYSAN KMILNNINRV VPATQQALEK GLVGKELDAE IRRRAAYGVD LSISLEPLPD
PNNRLTLSKT RKDAHGLPCP DIHYDVGDYV RKGAEAAHKQ LEHIGQLFDA DEFNITTSLN
ANNHIMGGTI MGHSPEDSVV DGNCRTHDHA NLWLPGGGAI PSASVVNSTL TMAALGIKAA
DDIARQLAVK S