Gene Ssed_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1119 
Symbol 
ID5611607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp1332689 
End bp1334383 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content52% 
IMG OID640931968 
Productcholine dehydrogenase 
Protein accessionYP_001472858 
Protein GI157374258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAT CAACAACAGA AAAGTACGAC TATATTATCG TTGGTGCAGG TAGTGCAGGC 
TGCGTGTTAG CCAACCGTTT ATCGGCTGAC GCAAATAACA GCGTATTACT GATTGAAACC
GGCGGCAGCG ACAGGAGCAT CTTTATTCAG ATGCCAACGG CCCTGTCTAT TCCCATGAAT
ACCCCAAAAT ATGCGTGGCA GTTTGAGACC GAAGCCGAGC CTCACTTGGA CAATCGCCGC
ATGCACTGCC CTCGTGGCAA GGTGCTGGGT GGCTCCTCAT CGATCAATGG CATGGTCTAT
GTCCGCGGGC ATGCCCGAGA TTTTGACGAG TGGCAACAAG AAGGTGCTAA AGATTGGGAT
TATGCCCATT GCCTACCCTA CTTCAAAAAA GCGGAAAGCT GGGCGTTTGG TGAAGATGAT
TATCGCGGTG TTGATGGCCC ACTAGCCGTC AACAACGGCA ACGAAATGAA AAACCCGTTA
TACCAAGCCT TTGTCGATGC GGGTGTCGAT GCCGGGTATA TGGCGACCAG TGATTATAAC
GGCGCGCAGC AAGAGGGCTT TGGCCCTATG CACATGACCA TTAAAAATGG CGTGCGTTGG
TCAACATCTA ACGCCTATCT AAGACCGGCC ATGAAGCGTG AAAACCTCAC CGTCATCACC
CATGCCCAGG TACATAAGGT GTTGTTTGAA GGCAAACAAA CCGTAGGTGT TCGTTTCGAA
CGCAAGGGCA AGATGACAGA TGTGCATTGC AGCAAGGAAG TGGTGTTATC GGCAGGCTCT
ATCGGCTCGC CGCATATACT GCAACTTTCG GGTATTGGCG CCGCAGAGAC ACTTGCCAAG
GCTGGCATCG AACAGGTACA TGAGTTGCCG GGTGTGGGTG AGAACCTGCA GGACCATCTC
GAGTTCTACT TTCAATTTAA GTGCCTTAAG CCCATTTCGC TCAATGGCAA GATCGACCCG
TTAAACAAAC TCTTTATCGG CACTCGCTGG ATCTTGAACA GAACAGGCCT GGGGGCGACA
AACCATTTCG AATCTTGTGG GTTTATTCGC TCTAAAGCGG GACTGGAGTG GCCGGATCTG
CAATATCATT TCTTGCCTGC CGCCATGCGC TACGACGGAA AAGAGGCCTT CGCGGGTCAC
GGCTTCCAGG TCCATATAGG ACACAACAAA CCAAAGAGTC GCGGCGCGGT GAAAGTGGTA
TCGAGCGATG CCCGTGTTGC CCCGAGTATT CAGTTTAACT ATCTATCCCA TAAGGATGAT
ATAGAAGGCT TTCGCGCCTG CGTTCGGTTA ACCCGGGAGA TCATCAATCA GCCGGCATTG
GATGAATACC GAGGCGAGGA GATACAGCCC GGGACTTCGG TTCAAACCGA TGAAGAGATC
GATACCTTCG TCAGAAGCTC GGTCGAGAGC GCCTATCACC CCTCTTGTTC GTGCAAGATG
GGAGAAGATG CGATGGCGGT GGTGGACTCA GAAACTAAGG TGCATGGTAT TCAAGGACTT
AGAGTGGTTG ATTCCTCTAT TTTCCCCACC ATACCTAACG GCAACCTTAA CTCGCCGACC
ATAATGGTGG CGGAGCGCGC AGCCGATATC ATCTTGGGCA TGACGCCATT GCCGGCCAGC
AGTGCGACAG TGACCTTTGC TCAGCAATGG CAGCAAAAAC AGCGGTTACG CGCACCAAAA
AGACAACTCG CCTGA
 
Protein sequence
MTQSTTEKYD YIIVGAGSAG CVLANRLSAD ANNSVLLIET GGSDRSIFIQ MPTALSIPMN 
TPKYAWQFET EAEPHLDNRR MHCPRGKVLG GSSSINGMVY VRGHARDFDE WQQEGAKDWD
YAHCLPYFKK AESWAFGEDD YRGVDGPLAV NNGNEMKNPL YQAFVDAGVD AGYMATSDYN
GAQQEGFGPM HMTIKNGVRW STSNAYLRPA MKRENLTVIT HAQVHKVLFE GKQTVGVRFE
RKGKMTDVHC SKEVVLSAGS IGSPHILQLS GIGAAETLAK AGIEQVHELP GVGENLQDHL
EFYFQFKCLK PISLNGKIDP LNKLFIGTRW ILNRTGLGAT NHFESCGFIR SKAGLEWPDL
QYHFLPAAMR YDGKEAFAGH GFQVHIGHNK PKSRGAVKVV SSDARVAPSI QFNYLSHKDD
IEGFRACVRL TREIINQPAL DEYRGEEIQP GTSVQTDEEI DTFVRSSVES AYHPSCSCKM
GEDAMAVVDS ETKVHGIQGL RVVDSSIFPT IPNGNLNSPT IMVAERAADI ILGMTPLPAS
SATVTFAQQW QQKQRLRAPK RQLA