Gene Sare_4942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4942 
Symbol 
ID5706492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5611099 
End bp5612661 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content67% 
IMG OID641274337 
Productcholine dehydrogenase 
Protein accessionYP_001539679 
Protein GI159040426 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.953244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGACT TCGTAGTGGT CGGCGGCGGT ACAGCGGGTT GCGTCCTGGC AAGCCGGCTA 
AGCGAGGACC CCTCCGTCAC GGTGTGCCTG GTCGAAGCCG GGCCAGCCGA CAATCACGAT
AACTTTCGTA TCCCGGTAGC TGGCGGGAAG TTCTTCAAAA CACGGTTCGA CTGGGACTAC
GACAGTCATC CTGAACAGTT CTGTGATGGC CGCCGTGTTT ACCTTCCGCA AGCGCGAGTG
CTCGGTGGCG GAAGCTCGGT TAATGGCATG GTCTACATTC GCGGGAATCG CGCCGACTAC
GACGAATGGC AGCAGCCGGG ATGGAGCTAC GACGAGTTAC TGCCGTTTTT CAAACGGTCC
GAGGACAACG AGCGGGGCGC TGATGAGTTC CACGGGGCCG GTGGACCGAT GCGGGTCAGT
GACGGACGCG CGCACAGCCC GAGCGCCATG GCCTTCACCC AGGCGGCACT CGACGCCGGC
TACCCGGCCA ACCCCGACTT CAACGGCGCG GTCCAGGAGG GCTTCGGGGA GTACCAGGTG
ACCCAGCGGG ACGGCCGTCG GGCCAGCGCG GTCACCGAGT TCCTGCATCC GGCGAGGCAC
CGTCCGAACC TCGTCGTCGA AACTAATCTG CAGGTACAGC GGATCATGAT CGAGAACGGG
CGGGCGGCCG GTGTGGTCGG CAACCGGTTC GACGACCTGG TCGAACTTCG GGCCGAGCGG
GAGGTCATTG TCTCCGCAGG CACGTACAAC TCACCACACC TGCTCATGCT CTCCGGGATC
GGGCCCGCCG ATCTACTGCG CGCCTTCGAG CTGCCGGTCT TTGTCGACCA GCCCCAGGTC
GGGCAGAACC TCCAGGACCA CCCGCACATC TGGCTCAGCT ACCGCCACGA TCTGCCGGTG
AGCCTACTGG CAGCGGCCGA GTCCGAGCGC GTCCACCAGT ACGAACGCGA TCGCACCGGC
ATGCTCGCCT CGAACGGTCC GGAGAGCGGC GGCTTCGTCC GGACCAGTGC GGCGCTGGCC
GGCCCCGACC TCCAGTTCAT CTGCCTGCCG ATGATGGTCG CGGACACCTT CCTCTCGCCA
CCGACCGGGC ACGGAGTCTC CTTCGGTGCC TCGGTGATGA GGCCGGTGAG CAGCGGCCAC
GTGACGCTGT TCAGCGGCGA GCCGACCGCC AAGCCCAAGA TCGTGCAGAA CTACCTCGCC
GATCCCGCCG ACCTGCAGAC GGCGGTCAGC GGCCTGCGGA TCAGCCTGGA GCTGTCCCGC
CAGGCCGCGC TGAAGCCCTA CGCCGTCGAG CCGTCCGCGG CGCCGAGTTC CGACACGGAA
ACCGACCTGC GGGCGTATGC GCGCAGCCAC GTCCAGACCG GGCTGCATCC GGTCGGTACC
TGCGCGATGG GCCGGGTCGT TGACGCGGAA CTGCGGGTGT TCGGAGTCGA CGGGCTGAGG
GTCGTGGACG CCTCCGTCAT TCCCTTGATC ATCCGGGGTA ACACGAACGC GCCGGTGATG
GCCGTGGCCG AGAGGGCGGC AGATCTCGTC CGCGGCGCAC AATCCCTGCC CGGCGCGAGG
TAG
 
Protein sequence
MYDFVVVGGG TAGCVLASRL SEDPSVTVCL VEAGPADNHD NFRIPVAGGK FFKTRFDWDY 
DSHPEQFCDG RRVYLPQARV LGGGSSVNGM VYIRGNRADY DEWQQPGWSY DELLPFFKRS
EDNERGADEF HGAGGPMRVS DGRAHSPSAM AFTQAALDAG YPANPDFNGA VQEGFGEYQV
TQRDGRRASA VTEFLHPARH RPNLVVETNL QVQRIMIENG RAAGVVGNRF DDLVELRAER
EVIVSAGTYN SPHLLMLSGI GPADLLRAFE LPVFVDQPQV GQNLQDHPHI WLSYRHDLPV
SLLAAAESER VHQYERDRTG MLASNGPESG GFVRTSAALA GPDLQFICLP MMVADTFLSP
PTGHGVSFGA SVMRPVSSGH VTLFSGEPTA KPKIVQNYLA DPADLQTAVS GLRISLELSR
QAALKPYAVE PSAAPSSDTE TDLRAYARSH VQTGLHPVGT CAMGRVVDAE LRVFGVDGLR
VVDASVIPLI IRGNTNAPVM AVAERAADLV RGAQSLPGAR