Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1114 |
Symbol | |
ID | 8397901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1197242 |
End bp | 1198537 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644995461 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003152862 |
Protein GI | 257066606 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000374822 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTATA AGACACAGAT GGAAGCTGCA AAGAATGGTT TTGTAACTGA AGAGATGAAG ATTGTTGCCA AGAAGGAAAA TGTTAGCGAA GAATTCTTGC TAGAGAAAAT AGCCAAGGGC GAAATTGTTA TTCCTAGAAA CAAGAACCAC AATTCAATTT CTCCAGAAGG TATTGGAACA GGCCTTAGGA CTAAGATTAA TGTAAACTTG GGTATTTCTA AGGACATCAA TGACCTTGAT CTTGAGATGC AAAAGGTAGA TATGGCCCTT GATATGGGAG CTGAATCTAT AATGGACCTT TCAAACTATG GAAAGACTCA AGAGTTTAGG AAAAGACTTA TAGAAAAATC TACAGCAATG ATTGGAACTG TACCAATGTA TGATGCGGTA GGATTTTTAG ATAAGGGGCT TAGCTTTATC AAGGCCCAAG AGTTCCTTGA CGTTGTTAGA AACCATGCGG AAAACGGCGT AGATTTTGTA ACAATCCATT GTGGAATTAA TAGAGCAAAT GCTGAGATTT TTATGAGAAA TAGAAGGGTT AACGAGATTG TTTCCCGTGG TGGATCCTTG TTATTTGGAT GGATGATGAT GAATGATGCT GAAAATCCTT TCTATGAATA TTATGATGAA CTTTTGGATA TTTTAAGAGA ATATGACGTA ACCTTATCAC TGGGAGATTC ACTAAGACCA GGAGGCATCC ACGATGCAAC AGATCCTGCC CAAATAGCTG AGCTAATCAC CCTAGGTGAG CTTACCAAAA GGGCCTGGGA GAAGGACGTT CAAGTAATTA TCGAAGGACC AGGCCATGTT CCAATAAACG ACATAGAAAT GAATATGAAG CTTGAGAAGA AACTCTGTCA CAACGCACCA TTCTATGTAT TAGGACCTTT AGTTTGTGAT GTGGCGCCAG GTTATGATCA TATCACAAGC GCAATCGGTG GAGCAATTGC TGCAAGTCAT GGGGCAGACT TCTTATGTTA TGTGACACCA GCAGAGCATT TGAGACTTCC TGATGTAGAA GATGTGCGTG AGGGAATAGT CGCAGCCAAA ATTGCAGCTC ATGCTGGAGA TATCGCTAAG CTAAAGGATG CTAGAAAATG GGACCTTGAG ATGAGTAAGA GAAGACAAAA ACTCGACTGG GAGGGGATGT TTGAACTTGC CATAGATCCA GAAAAGTGTA GAGCCTATAG GGCGGCTTCA GCTCCAGAAG AGGAAGATAC CTGTACTATG TGCGGGGCAA TGTGTTCTGC AAGAAATATG AATCTTATCC TTGAAGGTAA GGATATTGTC CTATAA
|
Protein sequence | MKYKTQMEAA KNGFVTEEMK IVAKKENVSE EFLLEKIAKG EIVIPRNKNH NSISPEGIGT GLRTKINVNL GISKDINDLD LEMQKVDMAL DMGAESIMDL SNYGKTQEFR KRLIEKSTAM IGTVPMYDAV GFLDKGLSFI KAQEFLDVVR NHAENGVDFV TIHCGINRAN AEIFMRNRRV NEIVSRGGSL LFGWMMMNDA ENPFYEYYDE LLDILREYDV TLSLGDSLRP GGIHDATDPA QIAELITLGE LTKRAWEKDV QVIIEGPGHV PINDIEMNMK LEKKLCHNAP FYVLGPLVCD VAPGYDHITS AIGGAIAASH GADFLCYVTP AEHLRLPDVE DVREGIVAAK IAAHAGDIAK LKDARKWDLE MSKRRQKLDW EGMFELAIDP EKCRAYRAAS APEEEDTCTM CGAMCSARNM NLILEGKDIV L
|
| |