Gene Csal_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1514 
Symbol 
ID4029210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1721738 
End bp1723420 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content65% 
IMG OID637966697 
Productcholine dehydrogenase 
Protein accessionYP_573566 
Protein GI92113638 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0264792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGG CTCGTGAATA CGACTACATC ATCATCGGGG CCGGTTCCGC CGGCAACGTA 
CTCGCCACTC GCCTGACCGA GGATCCGGAC GTCCAGGTGC TGCTGCTCGA GGCCGGCGGT
CCCGACTACC GCTTCGACTT CCGCACGCAG ATGCCGGCGG CGCTGGCCTA CCCCCTGCAG
GGCAAGCGCT ACAACTGGGC GTTCGAGACC GACCCCGAAC CCTACATGAA CAATCGCCGC
ATGGAGTGCG GACGCGGCAA GGGCCTGGGC GGGTCGTCGT TGATCAACGG CATGTGCTAC
TTGCGCGGCA ACGCGCTGGA TTACGACAAC TGGGCCAAGA TACCGGGCCT GGAGGACTGG
AACTACCTGC AGTGCCTGCC CTACTTCAAG CGCGCCGAGA CCCGCGACAT CGGCCCCAAC
GATTATCATG GCGGTGACGG CCCGGTGTCG GTGGCCACAC CCAAGGAAGG CAACAACGAG
CTCTACGGCG CCTTCATCCG CGCAGGCATC GAGGCCGGCT ATCCGGCCAC CGAGGACGTC
AACGGCTATC AGCAGGAAGG CTTCGGCCCC ATGGACCGCA CGACGACGCC CAACGGACGT
CGTGCCTCCA CGGCGCGCGG CTACCTGGAT ATCGCCAAGC AACGCCCCAA CCTGACCATC
GAGACGCACG CCACGACCGA TGTCATCGAA TTCGAGGGCA AGCGCGCCGT CGGCGTGAGC
TACGAGCGCA AGGGACAGGC CCAGCGTGTT CGCGCACGCC GCGAAGTGCT GCTGTGCGCG
GGCGCCATCG CCTCGCCGCA GATCCTGCAG CGTTCCGGCG TGGGCAATCC CGAGCATCTC
GAGGAATTCG ACATTCCCGT GGTGCACGAG CTGCCGGGCG TCGGCGAAAA CCTCCAGGAT
CACCTGGAAA TGTACATTCA GTACGAGTGC AAGAAGCCCA TTTCGCTGTA CCCGGCGCTC
AAGTGGTACA ACCAGCCCAA GATCGGTGCC GAGTGGCTGT TTTTCGGCAA GGGCATCGGC
GCCAGCAACC AGTTCGAGGC GGCCGGCTTC ATTCGTACCA ACGACCAGGA AGAGTGGCCC
AATCTGCAGT ACCACTTCTT GCCGATCGCC ATCAGCTACA ACGGCAAGAG CGCGGTGCAG
GCCCACGGCT TCCAGGCCCA CGTCGGCTCC ATGCGCTCCA TGAGCCGCGG TCGCATTCGC
CTGACATCGC GCGACCCCAA GGCCGCGCCG AGCATCCTGT TCAACTACAT GTCCCACGAC
AAGGATTGGC AGGAATTCCG CGACGCCATT CGCATCACGC GCGAGATCAT CGAGCAGCCG
ACGATGGACG AGTACCGCGG CCGCGAAATC TCGCCGGGGC CGAATGTGCA AAGCGACGCC
GAGCTCGACG AGTTCGTGCG CCAGCACGCC GAGACCGCCT ATCACCCCGC CGGCTCCTGC
AAGATGGGCA GTGCCGATGA CGCGATGGCG GTGGTCGATG GTGCGGGACG CGTGCATGGC
CTCGAAGGGC TGCGTGTCAT CGATGCCTCG ATCATGCCCG TGATCGCCAC CGGCAACCTC
AATGCGCCGA CGATCATGAT CGCCGAAAAG ATGGCCGACA AGGTTCGCGG TCGCGATCCG
CTGCCGCCGG CCAAGGTCGA CTACTACGTG GCCAACGGTG CGCCGGCCCG CCGTCGGGCG
TGA
 
Protein sequence
MTQAREYDYI IIGAGSAGNV LATRLTEDPD VQVLLLEAGG PDYRFDFRTQ MPAALAYPLQ 
GKRYNWAFET DPEPYMNNRR MECGRGKGLG GSSLINGMCY LRGNALDYDN WAKIPGLEDW
NYLQCLPYFK RAETRDIGPN DYHGGDGPVS VATPKEGNNE LYGAFIRAGI EAGYPATEDV
NGYQQEGFGP MDRTTTPNGR RASTARGYLD IAKQRPNLTI ETHATTDVIE FEGKRAVGVS
YERKGQAQRV RARREVLLCA GAIASPQILQ RSGVGNPEHL EEFDIPVVHE LPGVGENLQD
HLEMYIQYEC KKPISLYPAL KWYNQPKIGA EWLFFGKGIG ASNQFEAAGF IRTNDQEEWP
NLQYHFLPIA ISYNGKSAVQ AHGFQAHVGS MRSMSRGRIR LTSRDPKAAP SILFNYMSHD
KDWQEFRDAI RITREIIEQP TMDEYRGREI SPGPNVQSDA ELDEFVRQHA ETAYHPAGSC
KMGSADDAMA VVDGAGRVHG LEGLRVIDAS IMPVIATGNL NAPTIMIAEK MADKVRGRDP
LPPAKVDYYV ANGAPARRRA