Gene Apre_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1114 
Symbol 
ID8397901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1197242 
End bp1198537 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content41% 
IMG OID644995461 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003152862 
Protein GI257066606 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000374822 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTATA AGACACAGAT GGAAGCTGCA AAGAATGGTT TTGTAACTGA AGAGATGAAG 
ATTGTTGCCA AGAAGGAAAA TGTTAGCGAA GAATTCTTGC TAGAGAAAAT AGCCAAGGGC
GAAATTGTTA TTCCTAGAAA CAAGAACCAC AATTCAATTT CTCCAGAAGG TATTGGAACA
GGCCTTAGGA CTAAGATTAA TGTAAACTTG GGTATTTCTA AGGACATCAA TGACCTTGAT
CTTGAGATGC AAAAGGTAGA TATGGCCCTT GATATGGGAG CTGAATCTAT AATGGACCTT
TCAAACTATG GAAAGACTCA AGAGTTTAGG AAAAGACTTA TAGAAAAATC TACAGCAATG
ATTGGAACTG TACCAATGTA TGATGCGGTA GGATTTTTAG ATAAGGGGCT TAGCTTTATC
AAGGCCCAAG AGTTCCTTGA CGTTGTTAGA AACCATGCGG AAAACGGCGT AGATTTTGTA
ACAATCCATT GTGGAATTAA TAGAGCAAAT GCTGAGATTT TTATGAGAAA TAGAAGGGTT
AACGAGATTG TTTCCCGTGG TGGATCCTTG TTATTTGGAT GGATGATGAT GAATGATGCT
GAAAATCCTT TCTATGAATA TTATGATGAA CTTTTGGATA TTTTAAGAGA ATATGACGTA
ACCTTATCAC TGGGAGATTC ACTAAGACCA GGAGGCATCC ACGATGCAAC AGATCCTGCC
CAAATAGCTG AGCTAATCAC CCTAGGTGAG CTTACCAAAA GGGCCTGGGA GAAGGACGTT
CAAGTAATTA TCGAAGGACC AGGCCATGTT CCAATAAACG ACATAGAAAT GAATATGAAG
CTTGAGAAGA AACTCTGTCA CAACGCACCA TTCTATGTAT TAGGACCTTT AGTTTGTGAT
GTGGCGCCAG GTTATGATCA TATCACAAGC GCAATCGGTG GAGCAATTGC TGCAAGTCAT
GGGGCAGACT TCTTATGTTA TGTGACACCA GCAGAGCATT TGAGACTTCC TGATGTAGAA
GATGTGCGTG AGGGAATAGT CGCAGCCAAA ATTGCAGCTC ATGCTGGAGA TATCGCTAAG
CTAAAGGATG CTAGAAAATG GGACCTTGAG ATGAGTAAGA GAAGACAAAA ACTCGACTGG
GAGGGGATGT TTGAACTTGC CATAGATCCA GAAAAGTGTA GAGCCTATAG GGCGGCTTCA
GCTCCAGAAG AGGAAGATAC CTGTACTATG TGCGGGGCAA TGTGTTCTGC AAGAAATATG
AATCTTATCC TTGAAGGTAA GGATATTGTC CTATAA
 
Protein sequence
MKYKTQMEAA KNGFVTEEMK IVAKKENVSE EFLLEKIAKG EIVIPRNKNH NSISPEGIGT 
GLRTKINVNL GISKDINDLD LEMQKVDMAL DMGAESIMDL SNYGKTQEFR KRLIEKSTAM
IGTVPMYDAV GFLDKGLSFI KAQEFLDVVR NHAENGVDFV TIHCGINRAN AEIFMRNRRV
NEIVSRGGSL LFGWMMMNDA ENPFYEYYDE LLDILREYDV TLSLGDSLRP GGIHDATDPA
QIAELITLGE LTKRAWEKDV QVIIEGPGHV PINDIEMNMK LEKKLCHNAP FYVLGPLVCD
VAPGYDHITS AIGGAIAASH GADFLCYVTP AEHLRLPDVE DVREGIVAAK IAAHAGDIAK
LKDARKWDLE MSKRRQKLDW EGMFELAIDP EKCRAYRAAS APEEEDTCTM CGAMCSARNM
NLILEGKDIV L