Gene Apre_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1051 
Symbol 
ID8397838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1125300 
End bp1126577 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content33% 
IMG OID644995399 
ProductFolC bifunctional protein 
Protein accessionYP_003152800 
Protein GI257066544 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATT TTAATCAATA CTACGAATTT ATATTAAATA GGGGAAGTAC AAGTGGGGGT 
CATAGTCTAG AGAAGATAAA AAATCTCTTA GAATATTTTG ATAATCCCCA GGATAAAATA
AAGGTGATTC ATATTGCAGG AACTAATGGC AAGGGATCTA CTGCCAATAT GATCGCCAAT
ACACTTTCAA GGGAAAATAG GGTCGGCCTA TTTACTTCGC CATATATGAC CAAGATAAAT
GAAGCTATAT CAATTTCTGG AGTTGATATA AGCGATAGTG ATTTTGCAGA AATTATTGAT
AGGTTGAAGA AGCCTTTGGA AGAACTTGAT AAAAAAGGAC TCCACAATTC TTATTTTGAA
GTCTTGACAG CTATAATGTA TATTTATTTT TATGAAAAAA AGGTAGATGT CGCCGTTGTA
GAAGTAGGAC TAGGAGGAAG TCTTGATTCA ACTAATATTA TTAAAAGTCC TCTAGCCTGT
GTAATAACTA CTATTTCAAA AGACCACATC CAAATCCTAG GTGATAGCCT AGAAGAAATA
GCCCAAAACA AGGCGGGAAT TATAAAAGAT AAGTCAGAAG TTTTTCTGTA TCCAAAGGAA
GGTACAGTGA AGGAAGTTTT CATAAAAAAA ATAGAAAATA CTTCTAGCAG ACTCCATACT
TTTGATAAGG AAGAGATTAA TATAATAAAA ACAGGACCTG ATTATAATGA GTTTTCATTT
AGGTCTTATA AGAATATTAA GACAAGGCTT GTTGGAATCC ATCAGATATA TAATGCAGTG
ACAGCTCTTA TAACTTTGGA TTTCCTAAAA GATGAGTTTT CTATCTGTGA AAGAGATATT
TATGAAGGCT TATTAACTAC TAGAAATCCT GGTCGACTTG AACTTATAAA TAAGAATCCA
AGAGTTTTAG TCGATGGATC TCATAATAGA GAAGCCATAG ATGCCTTGAT TGACTCTATA
TCTTCTTACA AATATAGAAA ACTTATCGTT GGATTTTCTA TTTTAAAGGA TAAAGATTAT
GATTATGTAA TTGATAGTCT GGCTAAGATT GCCGATGAAA TCATAGTTAC AAAAATAAAA
GACAACCCAA GGGCCTTTGA TACAGATGAG CTTTATAGCC TAGTGAAAGA TAAAGCTAAA
AAGGCTATAG AGATTGTAGA CTTAGTAAAA GCTTATGAGT ATTCTAAAGA GCTTGCACAT
GAAGATGACC TAGTTCTTTG GTGTGGATCC TTGTATCTTG TGGGAGATAT ATTAAAAAAC
GAAAAAGCTC CTCGATAA
 
Protein sequence
MENFNQYYEF ILNRGSTSGG HSLEKIKNLL EYFDNPQDKI KVIHIAGTNG KGSTANMIAN 
TLSRENRVGL FTSPYMTKIN EAISISGVDI SDSDFAEIID RLKKPLEELD KKGLHNSYFE
VLTAIMYIYF YEKKVDVAVV EVGLGGSLDS TNIIKSPLAC VITTISKDHI QILGDSLEEI
AQNKAGIIKD KSEVFLYPKE GTVKEVFIKK IENTSSRLHT FDKEEINIIK TGPDYNEFSF
RSYKNIKTRL VGIHQIYNAV TALITLDFLK DEFSICERDI YEGLLTTRNP GRLELINKNP
RVLVDGSHNR EAIDALIDSI SSYKYRKLIV GFSILKDKDY DYVIDSLAKI ADEIIVTKIK
DNPRAFDTDE LYSLVKDKAK KAIEIVDLVK AYEYSKELAH EDDLVLWCGS LYLVGDILKN
EKAPR