Gene Sare_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1624 
Symbol 
ID5703405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1859268 
End bp1860944 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content68% 
IMG OID641271132 
Productcholine dehydrogenase 
Protein accessionYP_001536507 
Protein GI159037254 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.529123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAACCC AGCGTTACGA TTACGTCATC GTTGGTGGCG GTTCAGCCGG CAGCGCGCTC 
GCCGACCGAA TGTCCGCCGA CCCGGGCAAC CGGGTTCTCG TCCTGGAAGC GGGCCGTCCC
GACTACCCAT GGGACGTCTT CATCCACATG CCGGCGGCCC TGCCGTTCCC GATCGGGAGC
CGGTTCTACG ACTGGCAGTA CTCGTCGGAG CCCGAGCCGC ACATGCACGG GCGACGCATC
TACCACGCCC GGGGCAAGGT ACTGGGCGGG TCGAGCAGCA TCAACGGGAT GATCTTCCAG
CGGGGTAACC CGTTGGACTA CGAGCGGTGG GCGGCCGATC CGGGCATGGA GACCTGGGAC
TTCGCCCACT GTCTGCCGTA CTTCAACCGG ATGGAGTCCG CCCTGGGAGC GGATCCGGAC
GACCCGTACC GTGGCCACGA CGGCCCGCTG GTGTTGGAGC GTGGCTCCGC GGAGAACCCG
GTGATCCAGG CCATGCTCGA GGCGGCGGAG CAGGCCGGCT ACCCGCGCAC CACCGACGTC
AACGGTGCGC AGCAGGAGGG TTTCGCCCCG TTCGACCGGA ACGTTCGTCG TGGTCGGCGG
TTCTCCGCGG CCCAGGCCTA CCTGCGCCCG GCGATGAAGC GACCCAACCT CGAGGTACGT
ACCCGCGCGT TCGTCACCCG CGTCATCTTC CAGGGCACCC GCGCCGTGGG TGTCGAATAC
ACCCGAGGCC GGGGCACAAC ACCCCACCGG GTGTACGCCA ACGAGGTGAT CCTCAGCGGT
GGTGCGATCA ACACGCCACA GTTGCTGCAA CTGTCCGGCG TGGGTAACGC CGACGAACTG
CGCGGCCTGG GCATCGACAC CGTCGCGAAC CTGCCCGGGG TGGGCGAGAA CCTGCAGGAC
CACCTGGAGG TCTACATCCA GCACGCCTGC ACCCAGCCCG TGACGATCCA GCCCTACCTG
AACTGGCGCT ACGCGCCGTG GATCGGCGCG AAGTGGCTGT TCGGACGCAC CGGTCTGGGC
GCCACCAACC ACTTCGAGGC GGGTGGCTTC GTCCGGAGCA ACGACACCGT CGACTACCCG
AACCTCATGT TCCACTTCCT GCCGATCGCG GTCCGGTACG ACGGCACGTC CCCCGCCGGT
GACCACGGTT ACCAGGTGCA CATCGGGCCC ATGTACTCCG ACGCCCGCGG CTCGGTCAAA
ATCGTCAACC GTGACCCTCG GGTGCACCCC GCCCTGCGCT TCAACTACCT CTCCACCGAG
CAGGACCGGC GGGAGTGGGT GGAGGCGGTC CGGGTGTCCC GCGACATTCT CGGCCAGCCG
GCGCTGGCCC CGTTCAGCGG CGGTGAGATC TCCCCCGGGC CCGCAGTACA GACCGACGAG
GAGATCCTCG ACTGGGTGGC CCGAGAGGGC GAGACCGCGC TGCACCCGTC CTGCACGGCC
AAGATGGGCA CCGACGACAT GTCCGTGGTG AATCCCACCG ACATGCGGGT GCACGGGGTC
ACCGGGTTGC GGGTGGTGGA CGCGTCGGTG ATGCCCTACG TCACCAACGG CAACATCTAC
GCTCCCGTGA TGATGGTCGC CGAGAAAGCA GCGGACCTCA TCCTGGGCAA CACGCCGTTG
CCGCCGTCGA CGCTGGACTT CCACCGGCAC GAGCACGGCC CCTCGGCGGT GAAGTGA
 
Protein sequence
MTTQRYDYVI VGGGSAGSAL ADRMSADPGN RVLVLEAGRP DYPWDVFIHM PAALPFPIGS 
RFYDWQYSSE PEPHMHGRRI YHARGKVLGG SSSINGMIFQ RGNPLDYERW AADPGMETWD
FAHCLPYFNR MESALGADPD DPYRGHDGPL VLERGSAENP VIQAMLEAAE QAGYPRTTDV
NGAQQEGFAP FDRNVRRGRR FSAAQAYLRP AMKRPNLEVR TRAFVTRVIF QGTRAVGVEY
TRGRGTTPHR VYANEVILSG GAINTPQLLQ LSGVGNADEL RGLGIDTVAN LPGVGENLQD
HLEVYIQHAC TQPVTIQPYL NWRYAPWIGA KWLFGRTGLG ATNHFEAGGF VRSNDTVDYP
NLMFHFLPIA VRYDGTSPAG DHGYQVHIGP MYSDARGSVK IVNRDPRVHP ALRFNYLSTE
QDRREWVEAV RVSRDILGQP ALAPFSGGEI SPGPAVQTDE EILDWVAREG ETALHPSCTA
KMGTDDMSVV NPTDMRVHGV TGLRVVDASV MPYVTNGNIY APVMMVAEKA ADLILGNTPL
PPSTLDFHRH EHGPSAVK