Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2927 |
Symbol | |
ID | 4285003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 3207868 |
End bp | 3209769 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638142422 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_758146 |
Protein GI | 114571466 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.914767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGG CGCCATCCAA GACACGCGGC GTCACGTCAA TCCGCAGCCA GATTTCGATG GTCGCGGTGG CCTTGTTCGA TCTTGCGGCC GGCGCGATCT CAATGTGGGT TGCGGTATTG CTGCGCTATC GGTTCGAACC GATAGCGCCG CCGGACGACA TCGTGCTGCA ATCCGTCGCT GTTTTCGCTC TGGCTTGCGC CATCATTTTC CCGATTGAAG GGCTCCATCG CGGCCTGTGG CGCTATACAG CGCTGAACGA TGCAGCGCGA ATTGTGCGGG CCATCGTTCT GGCCAACCTT GTGTTTCTGC CGATCCTTTT CCTGATCAAC CGGCTCGATG GATTTCCGCG CACCTCGATC CTGTTGGAAA TACCGATCCT GTTGTTCATC CTGCTCGCTG CCCGCCTGTT TGTGGCGGCC TTGAGGACCC AGGGTCTGCG GGGCGCGCTG CAGATCGAGG ATCGTACCAA GCCGACGGCG ATCCTGGTGG GCACCGACCT GGAGCTGGAT GACGCCTTGC GCGACCTTGG CCGGCGCAAC GGATCCGCCC CCTTCCGGAC AAAGGGCTTG ATCGAACCTG GCGCGACCCT CACGGGACAT GCCATTCGTG GCATTCCGGT CCTGGGCGGC CTGGAGGCCT TGCCCGCAGC CCTCAAACGC CTGTCTGAAG TCGAGAAGGA AGCTCCGCGA CTTGTCCTGG CCGGTGCCCA TACGGATGCC GACATCGTCA ATCAACTCAT CCGGATTGCC TCCCGGACGG GTGCAAAACT GTCCCGGGCA CGACCCTCGG ACGGCCGCGA AGCCTTTTCC CCGGTCGAAG CCGCGGATTT GCTCAACCGG CCGCCCCGCG CACCCAACCT GTCACCCGCT CGGCCGCTGA TTGAGGGGCG CCGGGTCCTG ATCACTGGTG CCGGCGGGAC CATCGGGTCT CAATTGTCAC GGCTCGCGGC GACGCTTGAG CCGGAAAAAC TGATCCTGTT CGACGCCTCG GAGGCCAATC TCTACGAAAT CGACCTGGAG TTTGCCAAAC GTTTCTCCGG TGTTGCCTGG CGCGCCGTGC TCGGCGATGT GCGCGACCGT GACCGGCTTG ACGAGATTTT CCGCGAGGAA CGGCCCGATG TCGTCCTGCA TGCGGCCGCC CTCAAACATG TCCCGATGAG CGAGCGAAAC CCGGGCGAGG CAGCACGCAC CAATATTATC GGCACCGTCA ACACGATCGA GATGTCGCAA CAATATGGCG CGCGTGTTGT CGCGCTGATC TCGACCGACA AGGCCGTCAA TCCCGGCAAT GTCATGGGGG CAACGAAACG CGCCGCCGAA CTTTACGCCC GTTCTGCCGC CCCGCGCTCG GACACGACCC GTGTCTGCGT CGTCCGCTTT GGCAATGTGC TCGGCTCAAC CGGCTCCGTC GTGCCCCTGT TCGAACGCCA GATCGAAGCG GGTGACCCGG TGACCGTAAC CGATCCGGAA GCGACACGTT ATTTCATGAC CGTCGAAGAG GCATCCGGGC TCGTGCTGGA TGCCGCGGCC CAGACCGCCG CCGATCCAGC CCTGAACGGG GCCCTGTATG TGCTGGACAT GGGGCGCCCC GTATCAATCA TGCGTCTGGC CCGGCAGCTG CTGCGCTTGC GCGGTCGGGA CCCCGACGCA CCGGGTGCGA TCCGTGTTGT CGGGCTTCGA CCGGGCGAAA AGCGTCATGA AAGCCTCGTT TATGACTTCG AGAGCACGAT CGAGACTTCA ATCGACGGCG TCTGGGCGGT TACCGGGCCG GCCCTTGACC CGGTCGCTGT CAGCGCCGGG CTGGATGCCA TTGTCAAAGC AGCCGGCAGC AATAACAGCG CTGGAGCATC CGCGGCGCTG GAGGCCCTTT GCAAATTGCG CCCCTCGCCC CTGGATGATT GA
|
Protein sequence | MSKAPSKTRG VTSIRSQISM VAVALFDLAA GAISMWVAVL LRYRFEPIAP PDDIVLQSVA VFALACAIIF PIEGLHRGLW RYTALNDAAR IVRAIVLANL VFLPILFLIN RLDGFPRTSI LLEIPILLFI LLAARLFVAA LRTQGLRGAL QIEDRTKPTA ILVGTDLELD DALRDLGRRN GSAPFRTKGL IEPGATLTGH AIRGIPVLGG LEALPAALKR LSEVEKEAPR LVLAGAHTDA DIVNQLIRIA SRTGAKLSRA RPSDGREAFS PVEAADLLNR PPRAPNLSPA RPLIEGRRVL ITGAGGTIGS QLSRLAATLE PEKLILFDAS EANLYEIDLE FAKRFSGVAW RAVLGDVRDR DRLDEIFREE RPDVVLHAAA LKHVPMSERN PGEAARTNII GTVNTIEMSQ QYGARVVALI STDKAVNPGN VMGATKRAAE LYARSAAPRS DTTRVCVVRF GNVLGSTGSV VPLFERQIEA GDPVTVTDPE ATRYFMTVEE ASGLVLDAAA QTAADPALNG ALYVLDMGRP VSIMRLARQL LRLRGRDPDA PGAIRVVGLR PGEKRHESLV YDFESTIETS IDGVWAVTGP ALDPVAVSAG LDAIVKAAGS NNSAGASAAL EALCKLRPSP LDD
|
| |