Gene Sbal223_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3035 
Symbol 
ID7088944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3594543 
End bp3596228 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content49% 
IMG OID643461919 
Productcholine dehydrogenase 
Protein accessionYP_002358943 
Protein GI217974192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.82163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCATT CACTTCAGCA AAAATATTAT AACTACATCA TAGTCGGCGC CGGTTCTGCA 
GGGTGTGTGC TCGCTAATCG TCTGTCTAAG GATCCTAAAA ACGAAGTACT GTTATTAGAA
ACCGGTGGCA GCGATAAGAG CATCTTTATT CAGATGCCGA CGGCGTTATC GATTCCGATG
AACAGCCCTA AATATGCTTG GCAGTTTGAG ACGCAGCCTG AGCCGCATTT AGATAATCGC
AAGATGCATT GTCCTCGGGG CAAAGTGCTT GGCGGTTCAT CTTCGATCAA CGGCATGGTG
TATGTCCGTG GTCATGCCAG GGACTTTGAT GAGTGGCAGG CGCAAGGGGC GACCAACTGG
GATTATAGCC ATTGCTTGCC GTATTTTAAA AAGGCAGAAA CCTGGGCCTT TGGTGGCGAT
GAGTATCGCG GCGAGTCTGG GCCGTTAGGC GTGAATAACG GCAACCAGAT GCAAAATCCC
TTATACCAAG CCTTTGTGGA TGCGGGCATC GCGGCGGGGT ATTTATCGAC CGCCGATTAT
AATGCGGCGC AGCAAGAAGG TTTTGGTCCC ATGCACATGA CGGTCAAAAA AGGCGTGCGT
TGGTCTACCG CTAATGCTTA TTTACGCCCG GCGATGGCCA GACGTAATCT CACTGTGTTG
ACCCACACCT TAGTGCATAA AGTGTTATTG GAAGGGCAAG CGGCCGTTGG CGTTCGCATT
GAGCACAAAG GTAAAGTTGA AGATATTCGC TGCAATAAAG AAGTGATTTT AGCTGCTGGC
TCAATAGGTT CGCCGCATTT GTTGCAATTA TCAGGTATTG GCGATCCCCA AGTATTGGCC
GATGCAGGGG TCGAGTTACA ACATGAGTTG CCCGGTGTCG GTCAGAATTT ACAGGATCAT
TTAGAGTTTT ATTTCCAATT CAAATGCCTT AAACCGATTT CACTCAATGG CAAGTTAGAT
CCTTTCAGCA AGTTCTTAAT CGGTGCTCGG TGGATCTTAG ATAAGTCTGG CCTAGGGGCC
ACGAATCACT TTGAATCCTG TGGTTTTATT CGCTCAAAAG CGGGTCTCGA GTGGCCAGAT
TTACAGTACC ACTTTTTACC TGCGGCTATG CGTTACGATG GTAAAGAAGC CTTTGCGGGG
CATGGTTTTC AGGTACATGT AGGCCATAAC AAGCCTAAAA GCCGTGGTGC CGTGACCTTA
GTGTCGGCAG ATCCTAAAGC GCCACCTAAG ATCCAGTTCA ATTATTTACA GCATCCTGAT
GATATTGAAG GCTTTAGAGC CTGCGTGCGT TTAACCCGTG AGATCATCAA TCAAGCGCCA
TTCGATGAGT ATCGCGGCGA GGAAATTCAA CCCGGCACAC AAGTGCAAAC CGATGAGCAA
ATCGATGCCT TTGTGCGTCA ATCGGTTGAG AGTGCTTATC ATCCTTCTTG CTCGTGCAAG
ATGGGCACAG ATGCGATGGC GGTGGTGGAT CCTGAAACTC GCGTGCACGG CTTGCAGAAT
TTACGGGTGG TGGATTCTTC TATTTTCCCG ACTATCCCGA ACGGTAATTT GAACGCGCCG
ACCATCATGT TGGCTGAGCG CGCCGCGGAT ATGATCTTAG AGGCTGAGAT GTTAGCGCCG
GCGGATTCCG CCGTTGTGGT TGCCGACGCT TGGCAAACCA CACAAAGAAG CACTGCGACA
AGTTAA
 
Protein sequence
MTHSLQQKYY NYIIVGAGSA GCVLANRLSK DPKNEVLLLE TGGSDKSIFI QMPTALSIPM 
NSPKYAWQFE TQPEPHLDNR KMHCPRGKVL GGSSSINGMV YVRGHARDFD EWQAQGATNW
DYSHCLPYFK KAETWAFGGD EYRGESGPLG VNNGNQMQNP LYQAFVDAGI AAGYLSTADY
NAAQQEGFGP MHMTVKKGVR WSTANAYLRP AMARRNLTVL THTLVHKVLL EGQAAVGVRI
EHKGKVEDIR CNKEVILAAG SIGSPHLLQL SGIGDPQVLA DAGVELQHEL PGVGQNLQDH
LEFYFQFKCL KPISLNGKLD PFSKFLIGAR WILDKSGLGA TNHFESCGFI RSKAGLEWPD
LQYHFLPAAM RYDGKEAFAG HGFQVHVGHN KPKSRGAVTL VSADPKAPPK IQFNYLQHPD
DIEGFRACVR LTREIINQAP FDEYRGEEIQ PGTQVQTDEQ IDAFVRQSVE SAYHPSCSCK
MGTDAMAVVD PETRVHGLQN LRVVDSSIFP TIPNGNLNAP TIMLAERAAD MILEAEMLAP
ADSAVVVADA WQTTQRSTAT S