Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1272 |
Symbol | |
ID | 8398061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1364464 |
End bp | 1366434 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644995616 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_003153016 |
Protein GI | 257066760 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATT CGAAAATAAT TGACAAGAAA AAACTAATAC TAGGATTTTT TGATATATTA ATTATCAATT TAGCTTATTT TCTAGCCCTG TATTTCAGAT TTGATATGAA TTTCAGGGCC ATACCTTTGG ATTTCCTTGA TGCTTTCAAG ACCTATGCTA TTTTTAATAC CATCCTTACG ATAGCTATTT ACGGTCTACA CAAGATGTAT AACATGATAC TGGAGTATGC TTCCTACAAG GAGCTTATTA ATATCACTCA AGCTGTTATT CTATCTCTTA TAAGTCACGC TGTATTAATG ACAATTTTTG TCTATAGAAT GCCAATATCA TACTATATAT TCGGATTTAT GATTCAATAT GTTTTGACTG TGGCATCAAG ATTTATCAAG AAGATGATGA TTCACAAAAG CTACAAGACT GACAAGAAGA ACCAGTCTTA CAAGAGAGCC CTAATCGTTG GGGCAGGATC TGCAGGACAG ACTCTAAAAA GAGATATATC TAATTCCAAC AATGGTGGCA AGACTAATAT TCTTGTTGTT GGCTTCATAG ATGATGATCC AAAGAAGAAA AATCAATACA TAGATAATAG TAGAATCTTC GGTGGAAGGG ATATGATAAA GGAAATTGTC GAGGAAGAAG CCATCGATGT TATTCTTGTA GCTATTCCTT CTGTAGAGGA AGTCGAGAAG AGAGAAATCC TAAGAATATG TAACGAAACA GGATGCGAGG TCAAAGTTCT TCCTGGTATT TACCAACTTG TTTCAGGAAA AGTTACCATG TCTACTATGA AGGATATCCA AATCGAAGAC CTTTTAGGAC GTGATCCGGT AAAGATTTTC TCCAACGAAA CTTTCGATTA CCTCAATGAC AAGGTAGTCC TAGTTACAGG AGGGGGAGGA TCCATAGGAT CTGAACTTTG TAGGCAAATA GCTCAATATG GTCCAAGACT ACTTATAATA TTCGATATAT ACGAGAACAA TGCCTATGAA ATAGAGCAAG AACTTAAGAG AAACAACAAG GACCTCAACT TTATAACCCT AATAGGCTCT GTAAGAGACT ACAAGAGAGT GGAGAAGGTA TTTAAGACCT ATAAACCAGA CATAGTATTT CACGCAGCAG CCCACAAGCA CGTACCGTTG ATGGAAGTAA GTCCAGTGGA AGCTATCAAG AACAATGTCA GAGGAACATA CAATGTCGCC CTACTTTCCC TAATCTATGA CGTTCAAAGA TTTGTTCTCA TATCAACAGA CAAGGCAGTC AACCCAACAT CAGTTATGGG AGCAACCAAG AGAGTTTGTG AGAAGATAAT CCAAGGAATA AATGACATAA GAGATAGCAA AGAATATAAT AATCTTGCAA AGGTAATTGT CCAAGATGGG GACAGAAATA TTACAATTAA TCCTGAAGAC TTGCTAGAAG GGAAAAATCC AGGAACCGAA TTTGTTGCAG TTCGTTTTGG TAATGTATTG GGATCGAATG GTTCTGTAAT ACCACTATTT AAAAAGCAAA TCGCAGCGGG TGGACCTGTT ACAGTTACCC ACCCAGAGAT AATCAGATAC TTTATGACAA TTAAGGAAGC TGTAAAGTTA GTCCTTCAAG CTGGATCCAT GGCCCAAGGC GGTGAAATAT TCGTCCTAGA TATGGGAGAA CCAGTCAAGA TTGACGACCT GGCAAGACAG CTAATAAGAC TATCAGGTTA TCAACCAGAC ATAGATATGC CTGTAGTCTA TACGGGACTT AGACCTGGAG AGAAGCTCTA CGAGGAAAGA CTGATGGACG AGGAAGCCCT CACAGATACC CACATTGAGG GAATATCTGT AGGCCGACCA CTAGACTTCT CTAGGGGAGA ATTTTTCGGG AAACTTGATA AAGTAATCAA TCAAGAAAAT ATTGATGAAC TTGATATCAT AGCAACGATT AATGAGTTAA TTACTACATT TATAGGAAAG GATAATTCGA AGGGAGAATA G
|
Protein sequence | MKNSKIIDKK KLILGFFDIL IINLAYFLAL YFRFDMNFRA IPLDFLDAFK TYAIFNTILT IAIYGLHKMY NMILEYASYK ELINITQAVI LSLISHAVLM TIFVYRMPIS YYIFGFMIQY VLTVASRFIK KMMIHKSYKT DKKNQSYKRA LIVGAGSAGQ TLKRDISNSN NGGKTNILVV GFIDDDPKKK NQYIDNSRIF GGRDMIKEIV EEEAIDVILV AIPSVEEVEK REILRICNET GCEVKVLPGI YQLVSGKVTM STMKDIQIED LLGRDPVKIF SNETFDYLND KVVLVTGGGG SIGSELCRQI AQYGPRLLII FDIYENNAYE IEQELKRNNK DLNFITLIGS VRDYKRVEKV FKTYKPDIVF HAAAHKHVPL MEVSPVEAIK NNVRGTYNVA LLSLIYDVQR FVLISTDKAV NPTSVMGATK RVCEKIIQGI NDIRDSKEYN NLAKVIVQDG DRNITINPED LLEGKNPGTE FVAVRFGNVL GSNGSVIPLF KKQIAAGGPV TVTHPEIIRY FMTIKEAVKL VLQAGSMAQG GEIFVLDMGE PVKIDDLARQ LIRLSGYQPD IDMPVVYTGL RPGEKLYEER LMDEEALTDT HIEGISVGRP LDFSRGEFFG KLDKVINQEN IDELDIIATI NELITTFIGK DNSKGE
|
| |