Gene Dshi_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1428 
Symbol 
ID5712605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1483002 
End bp1484636 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content64% 
IMG OID641267341 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001532771 
Protein GI159043977 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC GCGATTTCCT GCGCCGTGCA ATGGCGCTGG GGGCGACCGC CGGCATGGCC 
GGTGCGCTCG CGACAGCCTG GTCCGGCACG GCAATCGCGC AGTCTTCGAC CACCGCGCCG
CAGCCCGACG GGGAATACGA CTTCATCGTG ATCGGCACGG GCTCCGCCGG GGCCGCTTGC
GTGTATCAAC TGGCTCAGAC AGGCGCGCGG ATCCTCGTGC TCGAAGCCGG GCGCAACGAC
GACCTCGAAG AGGTCCATGA CAGCCGCCTG TGGGCTGCGT CCCTTGGCAC CGACGCCACG
AAATGGTTCG AAACCCTGCC CTCCAGCCAT ACGGATGGAC GCAATCACAT GTGGCCGCGC
GGCAATGTGT TGGGCGGGAC ATCTGCCTTG AACGCGATGG TCTATGCGCG CGGGCACAGG
ACCGATTTCG ACGTCTGGGA GACGATGGGT GCCACCGGTT GGAGCTATGA AGACGTACTA
CCGCATTTCA TGGCGATGGA AAGCTATGAG CCCGGGGGCG AGAACCGCGG CACCAGCGGC
CCGATCTTTG TCAGCCAACC CCAGGACCCA CACCGCCACG AAGGGGCCGT CGCGTTCATG
GATGCCGCGG CGGGGCTGGG ATACAAAGAA ACGCCGTCCT TCAACTCCGA TCGGATGTCC
GGTCAGGCCT GGATCGATTT CAACATCAAG GACCAGCGGC GTCAGTCGTC TGCAGTCGCA
TTCCTGCGCC CGGCGATCGA GAACGGCAAC ATCACGCTGC TGACCGATGC CCCGGTCCAG
AAGCTGACCC TGGAGGGCAC GAAATGCACC GGGGTCACCT ACCTGCACAA CGGCGCGCCC
GTCAGCGTCC GGGCGGCGAA CGAGGTGATC CTCTCGGCCG GGGCCATCGA CAGCCCCAGG
CTGCTGATGC TGTCGGGGAT CGGCATCGCG TCCGACCTCA GGCAGGTCGG GATCGACGCC
GTCGTCGACT TGCCGGTTGG TGTCGGGCTC CAGGACCACA TTCTCGGCGC AGGTGTGAAC
TACGAAGCCA AGGGCCCCGT GCCGGTCAGC CATTACAACC ACTCCGAAGT CTACATGTGG
GAACGATCGG ATCCGGGCCT GCGGTCACCC GACATGATCG CGCTCTATGT TTCGGTGCCC
TTCGCCTCTA CCGGTCACAA GCTGGATTAC GAGCACGGCT ACTGCATTCT CTCGGGCGTC
GCGACGCCGC AATCGCGCGG CTACGTCAAG CTGGCGTCTG ACGACATCGC GGATGCCCCG
ATCATCGAGA CCAATTACCT GGCCGAGGAA CAGGATTGGA AGTCCTACCG TGCCGCGACC
GAGCTGTGCC GCGAGTTGGG CGCCTCGGAC GCTTATGCCG AGTTCCGCAA GCGCGAGAGC
CTGCCGCAGA AGGACGGGGA GCTGACGGAT GCCGAATGGC GCGACTTCCT CTCCGCGTCG
GTCAACACCT ATTTCCACCC CACATCCACA TGCCAGATCG GCAAGGTGGT GGAGCCGGAT
CTGCGCGTGA AAGGCATTGA GGGCCTGCGA GTTGCGGATG CGTCCGTCAT GCCGCAGATC
ACCACCTCCA ACACCAACGC GCCGACCATG ATGATCGGTT GGCGCGCGGG TGACATGATC
TCCAAAGCAA CCTAG
 
Protein sequence
MSRRDFLRRA MALGATAGMA GALATAWSGT AIAQSSTTAP QPDGEYDFIV IGTGSAGAAC 
VYQLAQTGAR ILVLEAGRND DLEEVHDSRL WAASLGTDAT KWFETLPSSH TDGRNHMWPR
GNVLGGTSAL NAMVYARGHR TDFDVWETMG ATGWSYEDVL PHFMAMESYE PGGENRGTSG
PIFVSQPQDP HRHEGAVAFM DAAAGLGYKE TPSFNSDRMS GQAWIDFNIK DQRRQSSAVA
FLRPAIENGN ITLLTDAPVQ KLTLEGTKCT GVTYLHNGAP VSVRAANEVI LSAGAIDSPR
LLMLSGIGIA SDLRQVGIDA VVDLPVGVGL QDHILGAGVN YEAKGPVPVS HYNHSEVYMW
ERSDPGLRSP DMIALYVSVP FASTGHKLDY EHGYCILSGV ATPQSRGYVK LASDDIADAP
IIETNYLAEE QDWKSYRAAT ELCRELGASD AYAEFRKRES LPQKDGELTD AEWRDFLSAS
VNTYFHPTST CQIGKVVEPD LRVKGIEGLR VADASVMPQI TTSNTNAPTM MIGWRAGDMI
SKAT