Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oant_0697 |
Symbol | |
ID | 5380145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ochrobactrum anthropi ATCC 49188 |
Kingdom | Bacteria |
Replicon accession | NC_009667 |
Strand | + |
Start bp | 723617 |
End bp | 725485 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640833343 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001369250 |
Protein GI | 153008035 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGA AATATCCATG GCAGAGACTG GCGCTTTTGC CCCGCGCCAG CAAACAGATC ATTCTGATGC TGAGCGACTG TGTTCTGCTG TTGCTGAGTG CTTATCTGGC GTTTGTCATC CGTCTTGGCT TCGTTTTTGT ACCGAACGAA GGGCAGACGT TCCTGATTTT CATCGCGCCC GTTCTGGCAA TTCCGGTATT CATCCGGTTT GGACTTTACC GCGCCATCAT CCGTTATCTG GCTGAACGCG CGATCTGGCC GATTTTCCAG GCTACCGCCA TCGCCGCGCT TTTCTGGGTT GCGCTCGTTT TCCTGATGGA ACTTTACGGT AGTACCGGTC TGCCGCGTTC GGTACCGTTA CTCTACTGGC TTTTGAGCAC AGTGTTCATA TCGGCCAGCC GTTTCGGTGC GAAGTGGCTT TTGCGTAGCT CCGAGACGGA CAAACGATAT ACGAGTTCTG CCTTGATTAT CGGCGTGGGA GAACCCGCCC GCCAGCTCGC GACGGCGCTC CGCAGCCATA GCGACACGTT GGTTGTCGGC TTCGTCGATC CTGATGGCCA GCTTAATGGC ATGGATATAA TCGGGCTGCG GGTCTATGGC GTGGAAGACA TTCCCGCTTT GATCGAAAAC TACGGTATCA AGCAGGTGGT GGTATCGGAA CCGGCGCTGG AACAAAAACA GCGCCAGGAT TTTGCGCGGC TTCTAGGGCG GCTTCCTGTC AGTACACGTA TTCTTCCGCC AATCGCCGAT CTGACGGCCG GGAAATACCT GGTCAGCTCG TTACGCAATG TGGAGATTGA CGATCTGCTC GGACGTTCGC CTGTGCCGCC GGATGCTGCT TTGTTGCGAG AGGTGGTCGA GGGTCGACGC ATCATGGTGA CGGGGGCTGG CGGCTCTATC GGCAGCCAGC TCTGCCGTAC CATCGCCCAG TGGAACCCTG CCTGCATCGT TTTGTTCGAG TCGAGTGAAT TTGCACTTTA CAATATTGAA AGGCAGCTCA GACAGCTAGC AAGTTGCGAG ATCGTGCCGA TCCTTGGGAC CGTCAGCAAC CGGGACTGCG TCGAGAATGC CATTCGCGAC AATAATGTCG ATACGGTCTA TCATTGTGCG GCCTATAAGC ATGTGCCCCT TGTCGAGAAA AACCCGCTGG TTGGTATATT CAACAATGTG TTCGGCACGC TGGAAGTCGC ACAGGCCGCT CTCAACACTG ATGTTGAGCG AATGGTCCTC ATCTCGTCCG ATAAGGCAGT GCGACCGACG AATGTGATGG GTGCAACCAA GCGTTGGGCC GAGTTGGTCG TCTATTATTG CGGTTTGCTG GCAGAGCAAT CTGGAAAGAA GAAAGCTTTC TATTCCGTCC GCTTTGGCAA TGTGCTGGGG TCGAACGGTT CGGTCGTCCC GCTCTTCCGC GAACAGATCG CCAATGGAGG TCCGATTACG CTGACGCATG AGGACATGAC CCGCTACTTC ATGTCGATCA AGGAAGCCGC CGAGCTTATT GTACAGTCTG GCGGCATTGC CGCGTCCGGC GACACGGTTC TTCTGGAGAT GGGCGAGCCG GTGAAAATCC GCGACCTGGC AGAGAACATG ATCCTGCTGG CAGGGCTGAC CATTCGAAAC GACGAAAACC CGCACGGCGA TATTACCATT GAGACGACGG GGATTCGCGA AGGCGAGAAG ATGTATGAAG AGCTCTTCTA CGATCCTGCG CAGGCGAAAC CGACGCGCCA TCCCAAGATA ATGCGCGCTC CTCGGGGTAA GAAGGCTGAG GTGGATGTTC CGGCCGGCCT CACTGCACTT CGTATCGCGA TGGAAAGCGG CGATCTCGAT GCGGTCCGCA AGGTTCTGTT CGGCGTTATA ACTTCCTGA
|
Protein sequence | MSLKYPWQRL ALLPRASKQI ILMLSDCVLL LLSAYLAFVI RLGFVFVPNE GQTFLIFIAP VLAIPVFIRF GLYRAIIRYL AERAIWPIFQ ATAIAALFWV ALVFLMELYG STGLPRSVPL LYWLLSTVFI SASRFGAKWL LRSSETDKRY TSSALIIGVG EPARQLATAL RSHSDTLVVG FVDPDGQLNG MDIIGLRVYG VEDIPALIEN YGIKQVVVSE PALEQKQRQD FARLLGRLPV STRILPPIAD LTAGKYLVSS LRNVEIDDLL GRSPVPPDAA LLREVVEGRR IMVTGAGGSI GSQLCRTIAQ WNPACIVLFE SSEFALYNIE RQLRQLASCE IVPILGTVSN RDCVENAIRD NNVDTVYHCA AYKHVPLVEK NPLVGIFNNV FGTLEVAQAA LNTDVERMVL ISSDKAVRPT NVMGATKRWA ELVVYYCGLL AEQSGKKKAF YSVRFGNVLG SNGSVVPLFR EQIANGGPIT LTHEDMTRYF MSIKEAAELI VQSGGIAASG DTVLLEMGEP VKIRDLAENM ILLAGLTIRN DENPHGDITI ETTGIREGEK MYEELFYDPA QAKPTRHPKI MRAPRGKKAE VDVPAGLTAL RIAMESGDLD AVRKVLFGVI TS
|
| |