Gene Sala_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0294 
Symbol 
ID4082654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp292085 
End bp294085 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content66% 
IMG OID638008652 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_615350 
Protein GI103485789 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.254174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAA CATTCAGTCC GGGGCAGCTC AAGTTCCTGA AGGCGCTGAC CGAGGCCTTG 
TTCGACGGCG CGGCGATGGC GATCACGCCC GATCAGGTCG TCGCCAATAT CGCCGAGCTG
TTCGGCAAGG TCGGCGGCAC CAAATTGGAC GAGATGCGCG TCTCGCTCAT CGCCACCGAA
ATCGCGCTCG GACCGCTGTT CGCCGAGGTC GATGTCGCGG CGCGCGTCGA GCGGATCGCC
GACCGGCTGC GCGACAGCCG CATCGACCTG TTCCAGGACA TGGGGCGGCT GCGCGGCATC
GTCTATGCCT GCTATTACGG CCACTGGCTG CCGGGCGATC AGGACGCCAA CGTCGCCAAC
CCGGTCCACC GCCAGATCGG TTTCGCGCTC CCCCGGTTCC GCGCGCGCGG GCCGGGCGAC
GTCCCCATCA CGCCCGTGCA GGGCCGCGAG ATCGACCCCG CGCATATATT GACCGCCGAT
AGTCTCGACG ATGAATATGA CGTGATTGTC GTCGGTTCGG GCGCGGGCGG CGCGGTCGCG
GGCTATAATA TCGCGGCGCA GGGCTATCGG GTGCTGATCG TCGAGGCGGG GCCCTTCTAC
CCCAGCCACG CGATCACCCA CCACGAACTC GACATGATCG CGAATCTTTA CAAGCATGGC
GCGGTGCAGA CGACGACCAA CCGCGATTTC GTCGTTTTCC AGGGGCGGTG CGTCGGCGGA
TCGTCGACGA TCAACAACGG CATCTGCCTG CGCGTCAACG AGCCCGGCCG CACCCACCCC
GACGCAGAGG ATGTGCTCGC CAAATGGGCG ACCATCGGCG CGCCGATCGA CCCCGCGGCC
TTTCACGCCA GCTATGACGC GGTGCAGGCG ATGCTCGGCA TCGCGCGCAT CGAATCCCGC
AGCGGACGGC ACAACGGCCC GCACCTCATC AATGGCTGGC GCGCCTATGC CAACGCCTCG
TCCGATCCCA AAGACAAGCG CGCGATCGCC GACTGGTTCG ACAAGAATTT CGGCCCGCCG
AACACCCCGA ATGCCTGCGC CTATTGCGGC TATTGCAATT CGGGCTGCGC CTATGGCCGC
CGCATGGGCG TCGCGCAGAC CTATCTGCCC CAGGCGTGCC GCGATCATGG CGCGCGCATC
CTGCCGCGCA CCAAGGTCCA GCAGATCCTC TGGCAGACCG CGATCGACGG GCGACGCGAG
GCCGAGGCGG TCAGGCTCGT CCTGCCCGAC GGAGCGAACC GCCTCGTCCG CGCGCGCGTC
GGCGTCGTTG TCGCCGCGGG CACGATCGCC TCGTCGAAAC TGCTGGCACG CAGCGACATT
GACGGCACGG GTTATCAGGT GTCGCTGAAC GTCGCCTCGC CCGTCGTCGC GCTGATGCCG
CCGGGCGTCG GCGGCGATGC GTGGGACGAG GACCAGATGT CGAGCTATGT CGATTGCGGC
GACTTTCTGC TCGAAAGCCA TTTCCAGCCG CCGATGTCGA TGGCCTCGCT GATGCCCGGC
TGGTTCGCCG ATCACGCCGA CCGCATGAAG AATTACGGTC GCGTCCATTC GGCGGGCATT
CTTTTTCCCG CCGACCGGCG TGGGCAGATC GTCGACGGCA AGCTCCGGTT CCGGCTCGAT
TCAACCGACG ACCTGCCGCT GCTCCGCCGC GCGATGGCGA CGCTGACCAA GGTGCATTTC
GCCGCCGGGG CGATCGAATG CTATCCCGCG CTGGCGAAAG GACAGACGGT GACGCCGGAT
ATGGACATCG ACGCCTTTTT CGAGGCGGCG ATTCGCGAAC AGGACGATGT AACTCTGTCG
AGCAGCCACC CGCACGGCGG CAATGCGATG AACGAGGATT CGCAGCACGG CGTCGTCGAC
CTGGATTGCC GCGTCCACGG CACCACAAAT GTGCTCGTCA CCGACGCCAG CGTCTTTCCC
AGCTGCATCC GCGTCAACGC CCAATGGACC ACGATGGCAA TGGCGCATTA TGCGACGGCG
CGCGGCGATC CCTTCCGGTG A
 
Protein sequence
MTATFSPGQL KFLKALTEAL FDGAAMAITP DQVVANIAEL FGKVGGTKLD EMRVSLIATE 
IALGPLFAEV DVAARVERIA DRLRDSRIDL FQDMGRLRGI VYACYYGHWL PGDQDANVAN
PVHRQIGFAL PRFRARGPGD VPITPVQGRE IDPAHILTAD SLDDEYDVIV VGSGAGGAVA
GYNIAAQGYR VLIVEAGPFY PSHAITHHEL DMIANLYKHG AVQTTTNRDF VVFQGRCVGG
SSTINNGICL RVNEPGRTHP DAEDVLAKWA TIGAPIDPAA FHASYDAVQA MLGIARIESR
SGRHNGPHLI NGWRAYANAS SDPKDKRAIA DWFDKNFGPP NTPNACAYCG YCNSGCAYGR
RMGVAQTYLP QACRDHGARI LPRTKVQQIL WQTAIDGRRE AEAVRLVLPD GANRLVRARV
GVVVAAGTIA SSKLLARSDI DGTGYQVSLN VASPVVALMP PGVGGDAWDE DQMSSYVDCG
DFLLESHFQP PMSMASLMPG WFADHADRMK NYGRVHSAGI LFPADRRGQI VDGKLRFRLD
STDDLPLLRR AMATLTKVHF AAGAIECYPA LAKGQTVTPD MDIDAFFEAA IREQDDVTLS
SSHPHGGNAM NEDSQHGVVD LDCRVHGTTN VLVTDASVFP SCIRVNAQWT TMAMAHYATA
RGDPFR