Gene Sala_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2331 
Symbol 
ID4080578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2459232 
End bp2460818 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content67% 
IMG OID638010711 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_617373 
Protein GI103487812 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.745478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGT TCGACATCAT CGTCATCGGC GGCGGCAGCG CGGGGAGCGC GGCGGCCGGG 
CGGCTCGCCG AGGACGGGGC GCGCACCGTC TGTTTGGTCG AAGCGGGCGG GACGAACGAC
ATCGTGCGGG TGAAGACACC GGGTTTCATG CCCTTCATCC CCAAATCGTC GAACTGGCGA
TATGACACCG TGCCGCAACA GGGACTGAAC GGCCGCATCG GATATCAGCC GCGCGGGCGC
GGGCTGGGCG GGTCGAGCGC GATCAACGCG ATGGTCTATA TCCGGGGGCA CGCCTTCGAT
TACGACCAGT GGGCGGCGCT GGGCGCGACC GGCTGGAGCT ATGCCGACGT GCTGCCTTAT
TTCAAGCGCA GCGAGGGCAA TGAGCGCGGC GGTGACGAGT TTCACGGCGG GGACGGGCCG
CTGAATGTGA TGGACCAGCG CTGGCCCAAT GTGACGAGTC GACGCTTCGT CGAGAGCGCG
ACGGCGCTGC AATTGCCGCG CACTGCTGAT TTCAACGGCC CTGACAATGA AGGCTTCGGC
CTCTATCAGG TGACGCAGAA AGGCGGCGAG CGCTGGTCGG CGGCGCGCGC CTATGTCGAG
CCGCTGCGCG GGCGATCGAA CTTCGACATC CGCACCGGCG CGCTGGTCGA GAAGATTTTG
ATCGAGGAGG GGCGCGCGGT CGGTGTCACG ATCCGCTGCG GGCGCCGCCG CGAGACGCTG
CGCGCACGGG GTGGGGTCGT GTTGTCGGCG GGGGCGTTCG GCAGTCCGCA GATATTGATG
CTGTCGGGGA TCGGGCCCGG CGCGCATTTG CAGGAGATGG GGATTGCCGT CGCGCGCGAC
CATGCCGGGG TCGGCGACAA TCTGCAGGAC CATATCGATT ATGTGTCGAG CTGGGAAACG
CGCTCGACCG ATCCCTTCGG CGACAGTTTC GGTGGCACCT GGCGGATGGT GAAGGCGATC
GTCGAGCATC GCCGCCGCCG GACGGGGATC ATGACGACCT GTTTCGCCGA AGCGGGGGGA
TTCTGGAAAT CGCGCCCCGA CCTGCCTGCG CCCGACGTGC AGTATCATTT CGTGCCCGCG
ATGCTCGAGG ATCATGGCCG CACCAAGGTC AAGGGGCACG GCTTTTCGTG CCACGCCTGC
GTGCTGCGGC CTGAAAGCAG AGGCACGGTG CGGCTGGCGT CCTCCGATGC CGCGGCGGCA
CCGACGATCG ACCCCGGTTT TTTGACCGAC GAGCGCGACA TGGCGACGCT TCGCGCCGGG
GTGCGGATGA TGCACCGCAT CGTCGCGGCG CCGCCGCTCG CCGATTATGC GGGGGTCGAC
CGCCATCCGG TGAACCTCGA TGACGATGCC GCGCTCGACG CGCTGATCCG CAGCCGCGCC
GACACCGTCT ATCATCCCGT CGGCACGTGC CGGATGGGCA GCGATGCCGA TGCGGTGGTC
GATCCGACAC TGAAGCTCAA CGGCATCGAC GGGCTGTGGG TTGCCGATGC GAGCATCATG
CCACGACTGG TCAGCGGCAA CACCAACGCG CCGAGCATCA TGATCGGCGA AAGGGCAGCG
GATTTCGTGA AGGCGGCTTT GAGTTAA
 
Protein sequence
MDQFDIIVIG GGSAGSAAAG RLAEDGARTV CLVEAGGTND IVRVKTPGFM PFIPKSSNWR 
YDTVPQQGLN GRIGYQPRGR GLGGSSAINA MVYIRGHAFD YDQWAALGAT GWSYADVLPY
FKRSEGNERG GDEFHGGDGP LNVMDQRWPN VTSRRFVESA TALQLPRTAD FNGPDNEGFG
LYQVTQKGGE RWSAARAYVE PLRGRSNFDI RTGALVEKIL IEEGRAVGVT IRCGRRRETL
RARGGVVLSA GAFGSPQILM LSGIGPGAHL QEMGIAVARD HAGVGDNLQD HIDYVSSWET
RSTDPFGDSF GGTWRMVKAI VEHRRRRTGI MTTCFAEAGG FWKSRPDLPA PDVQYHFVPA
MLEDHGRTKV KGHGFSCHAC VLRPESRGTV RLASSDAAAA PTIDPGFLTD ERDMATLRAG
VRMMHRIVAA PPLADYAGVD RHPVNLDDDA ALDALIRSRA DTVYHPVGTC RMGSDADAVV
DPTLKLNGID GLWVADASIM PRLVSGNTNA PSIMIGERAA DFVKAALS