Gene Apre_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1104 
Symbol 
ID8397891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1181133 
End bp1182152 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content39% 
IMG OID644995451 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_003152852 
Protein GI257066596 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAC TAACTTATAA GGATAGCGGA GTCGATAAGG AAAAAGGCTA TAAGGAAGTT 
GAATTAATCA AAAAAATCGT AAAAGAAACA CATAGTAAAG AAGTCCTAAC AGATATAGGA
GGCTTTGCTG GGGCCTTTGC CCCAGATCTT ACAGGAATTG ATAATCCAGT TTTCCTAAGC
GGAACAGATG GGGTTGGGAC AAAGATTAAA CTTGCTATGG AGATGGATAA GCACGATACA
GTAGGGATTG ATTGTGTAGC CATGTGTGTC AATGACATCC TCTGCCAGGG AGGAAGGCCC
CTATTTTTCC TAGATTATAT AGCGACTGGT AAGCTTAATC CAGAAAAAAT GGCAAAGCTT
GTCGAAGGAG TTGCAAGAGG ATGTAAAGAA GCTTCAGCAA GTCTAATAGG TGGAGAGACT
GCCGAGATGC CAGGTATCTA TAAGGAAGAT GATTATGACC TGGCAGGCTT TGCTGTAGGA
ATTTGTGATA GGGATAAGTT AATTGATGGG AAAAGTCTAA AAGAAGGAGA TATAGCCATA
GGACTTTACT CATCAGGAGT TCACAGCAAC GGCTTTTCTC TAGTAAGGGC CAGCATGGAA
CAAGGAGGAG TCTCACTAGA TGATAGATTT AGTGAAGAAG AAAGTATTGG AGAAAAACTC
CTAAGACCTA CCAAAATCTA TGCTAAAGAA ATCAAATCCT TACAAGAAAA TATTGATCTA
AAAGCAATCG CCCATATAAC AGGAGGAGGT TTTTATGAAA ATGTCCCTAG AGTTTTAGGA
GATGAGTTGG GAGTAGACTT CGACCTAAGT AGGCTCAATC TCGATCCAAT CTTTACTAAG
ATTCAAGAAT GGGGCAACAT AGATACGGAT GAAATGTATC ATACCTTCAA TATGGGAGTA
GGAATGGTAG TATTTGTAGA CGAGAATGAC AAAGATTTGG CCCTAGACCT CCTAGAAGGC
AAGGCTCAAG TAATTGGTAA AGTAAGAAGC GGTAATAAGG ATATCAAAAT TAATTTATAA
 
Protein sequence
MAKLTYKDSG VDKEKGYKEV ELIKKIVKET HSKEVLTDIG GFAGAFAPDL TGIDNPVFLS 
GTDGVGTKIK LAMEMDKHDT VGIDCVAMCV NDILCQGGRP LFFLDYIATG KLNPEKMAKL
VEGVARGCKE ASASLIGGET AEMPGIYKED DYDLAGFAVG ICDRDKLIDG KSLKEGDIAI
GLYSSGVHSN GFSLVRASME QGGVSLDDRF SEEESIGEKL LRPTKIYAKE IKSLQENIDL
KAIAHITGGG FYENVPRVLG DELGVDFDLS RLNLDPIFTK IQEWGNIDTD EMYHTFNMGV
GMVVFVDEND KDLALDLLEG KAQVIGKVRS GNKDIKINL