Gene Dshi_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2072 
Symbol 
ID5713067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2191605 
End bp2193176 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content65% 
IMG OID641267994 
Productputative Gluconate 2-dehydrogenase flavoprotein 
Protein accessionYP_001533410 
Protein GI159044616 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA CGTTCGATCT GAACGACGAC TCTGTCGTCG TGGTTATCGG CACCGGCGCG 
GGGGGCGGAA CGCTCGCCAA TGAGCTGGCG CAAAAAGGTG TGAAAGTGGT CTCGCTGGAG
GCGGGCGCGT TCCACGAACC GCAGGACTTC CTGCAGGATG AGTGGGCGAG CTTTGGCCAG
TTGGCCTGGC TGGATGCGCG CACGACCTCC GGCGACTGGC GGGTCAGCCG GGATTTCTCC
GGGTTGCCCG CATGGATCGT GAAATCCGTC GGCGGCTCTG CGGTGCACTG GGCGGGCGCG
AGCCTGCGGT TCCAGGAGCA TGAATGGAAG GCGCGCACGA CCTATGGCAA CGTGCCCGGC
GCGTCCCTGC TGGACTGGCC GATCGATGCC TCGGAGATGG ATCCGTGGTA CACCGCGGCC
GAAGACAAGA TGCACGTCAC CCGCACCGGG GATCGGCCCG GCCTGCCCGG CAACAACAAC
TACAAGGTGT TCGAAGCGGG CGCCAAGGCG CTCGGCTACA AGGACGTGCA CACCGGCCGG
ATGGCCATCA AGAGTTCAAC CAACGCGGAC GGCACCCCGT GCCAGCAGAT GGGCTTCTGT
TTCCAGGGCT GCAAGGTCGC GGCCAAGTGG TCGCCCAGCT ATGACGAAAT CCCGCGCGCA
CTGGATACCG GCAATTACGA ACTGCGCACC CAGGCCCATG TTCTGAAGAT CGAGCATGAC
GACACGGGCA AGGTCACCGG CGTGCTCTAC GCGGATGCGG ACGGCAACCA GCATTTGCAA
AAGGCGCGCG TGGTGGCCGT GGCGGGCAAC TCGATCGAAA GCCCACGGCT CCTGCTGAAC
TCCGCCTCGT CCATGTTCCC CGATGGGTTG GCGAACTCGT CTGGCCAGGT CGGGCGCAAC
TATATGCGCC ACATGACGGC CTCGGTCTAT GCGACCTTCG ACAAACCGGT GAAGATGTGG
CGCGGCACGA CCATGGCCGG GATCATCACC GACGAGGCCC GGCACGACCC GTCCCGCGGC
TTTGTCGGCG GCTACGAGAT GGAGACGCTC AGCCTCGGCC TGCCCTTCAT GGCCGCCTTC
CTCGACCCCG GCGCCTGGGG GCGGGAATTC ACCTCCGCGC TCGATGCCTA TGAGAACATG
GCGGGCATGT GGCTCGTGGG CGAGGACATG CCCCAGGAGA CCAACCGGGT CACGCTCAAT
ACCGATGTTC TGGACCAGTA CGGGCTGCCG GCACCGAACG TGCATTTCGA CGACCACCCC
AACGATATCG CCATGCGCAA CCACGCCTGG CAACAGGGGC AGGCGATCTA CGAGGCGGTG
GGGGCGACCC GCACCTTCCC GACGCCGCCC TATCCCAGCA CGCACAACCT CGGCACCAAC
CGGATGTCCG AGAATGCGCG CGATGGGGTG GTGAACAAAT GGGGGCAAAC CCACGACATC
CCCAACCTGT TCATCTCGGA CGGCTCGCAG TTCACGACCG GCGCGGCAGA GAACCCGACC
CTGACCATCG TGGCGCTGGC CATGCGGCAG GCAGACCACA TTGCCCGGGA GATGACGGCC
CAAAACCTCT GA
 
Protein sequence
MATTFDLNDD SVVVVIGTGA GGGTLANELA QKGVKVVSLE AGAFHEPQDF LQDEWASFGQ 
LAWLDARTTS GDWRVSRDFS GLPAWIVKSV GGSAVHWAGA SLRFQEHEWK ARTTYGNVPG
ASLLDWPIDA SEMDPWYTAA EDKMHVTRTG DRPGLPGNNN YKVFEAGAKA LGYKDVHTGR
MAIKSSTNAD GTPCQQMGFC FQGCKVAAKW SPSYDEIPRA LDTGNYELRT QAHVLKIEHD
DTGKVTGVLY ADADGNQHLQ KARVVAVAGN SIESPRLLLN SASSMFPDGL ANSSGQVGRN
YMRHMTASVY ATFDKPVKMW RGTTMAGIIT DEARHDPSRG FVGGYEMETL SLGLPFMAAF
LDPGAWGREF TSALDAYENM AGMWLVGEDM PQETNRVTLN TDVLDQYGLP APNVHFDDHP
NDIAMRNHAW QQGQAIYEAV GATRTFPTPP YPSTHNLGTN RMSENARDGV VNKWGQTHDI
PNLFISDGSQ FTTGAAENPT LTIVALAMRQ ADHIAREMTA QNL