Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0439 |
Symbol | |
ID | 4285052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 516174 |
End bp | 517301 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638139902 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_755670 |
Protein GI | 114568990 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.369001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTTG TAGCCAGTCT TGTTGCCGCC ATCGGCCTGA CGTCGAGTGG ATTGGCCCAG TCCGACATTC ATGAAACCGA GCAGGCCGAC TTTCGCATGG AAGCGGTGGC TGAGGGGCTT GAGTTCCCGT GGTCGATGGC TTTCCTGCCG GATGGTCGGC TTCTCGTCTC CGAACGCGCA GGCCGGCTGC GCCTGATCGA GGGCGACACG CTGCGCCGTG CGCCAGTCGC CGGCCTCCCC GACATTATGG TCGAGGGACA GGGCGGCCTG CTGGGCCTGG CGGTCCACCC CGATTTTGCC GACAACGGAT TGATCTACTT CGCTTACGCT GAGGGATCGG CCAATGCCAA TCACACCGCG CTGGCCCGCG GGCGCCTGAA TGATGACGCC ACCGCGCTGA CAGAGGTCGA GACCCTGTTT CAGGTCAATT TTGACAAGGA ACGCGGTTTC CATTTCGGCG GTCGGCTGCA ATTCCTGCCA GACAGCACGC TTCTTCTGAC CCTTGGCGAT GGTGGATTGC ATCGCAATGA AGCGCAGGAC CTGACCAATC ATCTGGGCAC GATAATCCGC CTCAATGATG ACGGCACCGT GCCGTTCGAC AATCCGTTCG TGAGCGCTCG CGGGGCCCGA CCGGAAATAT ACACCTACGG TCATCGCAAT GTGCAGGGCA TTGCCATCAA TCCGCAGACC GGGTCGGTCT GGGCCCATGA GCACGGTGCC CGCGGTGGCG ACGAAATCAA CCTTGTCGAG CCTGGCAGGA ATTATGGCTG GCCACTCGTC ACCTACGGGG TGAATTACAA CGGAACGCCG ATATCCGATG CCACCCACGG CGACGGACTG GAACAACCGA TCTGGTTCTG GGATCCATCA ATCGCTCCCT CCGGCCTGGC CTTTTATGAC GGTGACGGGT TCGAAAACTG GCAGGGCGAC GCATTCGTCG GCGCCCTTGT CGGCTCCAGG CTCGTCCGCT TTGAGGTGGA AGGTGATCGC ATCATATCCC GCGAGGAATT GCTGGTCGGG CGGGGCGAAC GCATCCGCGA TGTTACAACC GGAGCCGACG GATCACTCTA CATCCTGACC GACGAGCGCG CCGGTTCGGT GCTGCGCCTG GTGCCGGTCA TTCCCTGA
|
Protein sequence | MRVVASLVAA IGLTSSGLAQ SDIHETEQAD FRMEAVAEGL EFPWSMAFLP DGRLLVSERA GRLRLIEGDT LRRAPVAGLP DIMVEGQGGL LGLAVHPDFA DNGLIYFAYA EGSANANHTA LARGRLNDDA TALTEVETLF QVNFDKERGF HFGGRLQFLP DSTLLLTLGD GGLHRNEAQD LTNHLGTIIR LNDDGTVPFD NPFVSARGAR PEIYTYGHRN VQGIAINPQT GSVWAHEHGA RGGDEINLVE PGRNYGWPLV TYGVNYNGTP ISDATHGDGL EQPIWFWDPS IAPSGLAFYD GDGFENWQGD AFVGALVGSR LVRFEVEGDR IISREELLVG RGERIRDVTT GADGSLYILT DERAGSVLRL VPVIP
|
| |