Gene Apre_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1111 
Symbol 
ID8397898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1192694 
End bp1194349 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content41% 
IMG OID644995458 
ProductFormate--tetrahydrofolate ligase 
Protein accessionYP_003152859 
Protein GI257066603 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.596432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACTG ATATTGAAAT AGCGAAGGAA GCCCAACTAA AAGACATCGA TGAGATCTGT 
AAGGATTTGG GGATCGATGA TTATGAGAAA TACGGAAATT ACAAGGCTAA ACTGCCGTTA
GCCTATGCGG GTAAGATGAA GAAAGACTCT AAGCTAATCC TTGTTACTGC AACAAATCCA
ACCCCAAGTG GAGAAGGAAA AACTACCCTA AACATAGGCC TATCCATGGC CTTAAATAAA
ATCGGCAAGA AGGCAATTTC TGTCCTAAGA GAGCCATCTA TGGGACCTTC CTTCGGAAGA
AAGGGAGGAG CTGCTGGTGG AGGCTACAGC CAAGTTCTCC CAATGGATGA GATCAACCTA
CATTTTACAG GAGACTTCCA CGCCATAACT AGTGCGGTAA ATCTTGTTGC TGCTATTTTG
GATAATCATA TCTACCAGGG GAACGAAAAA AGAATAGATC CAAAAAGAAT CGTCTGGAGA
AGGTGTGTCG ACCTAAACGA CAGGGCCCTA AGAAATGTAG TGATAGGACT AGGCAATAGG
ACAGATGGTG TAAGTCGTGA GGATAAATTT GATATAACAG TTGCTACAGA GATGATGGCA
GTTTTATGTC TTGCTACAAG CATCGAAGAC TTTAGAGAAA AAGTAAGCAA GATGATCGTA
GCCTACGACT ATGACAATAA TCCTGTAACA GTCGATGATA TCAAGGCAAC GGGATCTGTC
GCTGTAGTTA TGAAGGAAGC CCTTAAGCCA AACCTAGTTC AAACAATAGA GCATACCCCA
GCCCTAATCC ATGGAGGACC TTTCGCAAAT ATTGCCCACG GTTGTAACTC CCTCCTTGCG
ACAAAAACCG GCCTTGGCAT AGCAGATTAT GTAGTAACAG AAGCAGGTTT CGGGGCAGAC
CTTGGAGCCG AGAAGTTTAA CGATATCAAA TGCAGGCTAG GAGGACTCAC TCCTTCCGCA
AGTGTAATAG TAACATCAAT AAGAAGTCTA AAATATCACG CTGGAGTTGA CTTTGAAAAC
TTGAAGGAAG AAAATCTAGA AAAACTTGAA CTAGGTTTTA AGAACCTCAA GATCCATATA
GAAAATATGA GAAAATTCGG TAAAAATATA ATCGTCGCTA TCAACAAGTT CGATACCGAC
ACCGACAAGG AAATAGAACT CGTAAAGAAG ATGACAGAAA AGCTTGGAGT AAAGGCAGTC
GAGACAAGTG TCTTTACTGA TGGTGGAGCA GGTGGAAAAG AGCTTGCCCA AAATCTAGTT
GAGCTTTGTG AAAATGACAA TGACTTTAAC TACCTCTATG ACCTAGACCA AGGTGTAAAG
GAAAAAATAG AGACTATAGC TAAGGAAATC TATAGGGCGA AGGGTGTAGT TTATAGCAAA
AAGTGCGAGA AGGATATCAA AAAAATCGAA GACTTAGGCT ATCAAAACCT ACCAATTTGT
GTAGCCAAAA CACCTTATTC CCTATCAGAT GATGGAAATA TCAATATCAC AGAAGACGAC
TACGATATAA CGATTAGAGA AATTAGGATC AATGCTGGAG CAGGATTTTT GGTTGCCTAC
ACTGGAAATA TTTTGACCAT GCCGGGACTT CCAAAGGCTG CTAATGCCTA TAAAATTGAT
TTAGATGAAA ATAACGAAGT AGTTGGATTG TTCTAA
 
Protein sequence
MKTDIEIAKE AQLKDIDEIC KDLGIDDYEK YGNYKAKLPL AYAGKMKKDS KLILVTATNP 
TPSGEGKTTL NIGLSMALNK IGKKAISVLR EPSMGPSFGR KGGAAGGGYS QVLPMDEINL
HFTGDFHAIT SAVNLVAAIL DNHIYQGNEK RIDPKRIVWR RCVDLNDRAL RNVVIGLGNR
TDGVSREDKF DITVATEMMA VLCLATSIED FREKVSKMIV AYDYDNNPVT VDDIKATGSV
AVVMKEALKP NLVQTIEHTP ALIHGGPFAN IAHGCNSLLA TKTGLGIADY VVTEAGFGAD
LGAEKFNDIK CRLGGLTPSA SVIVTSIRSL KYHAGVDFEN LKEENLEKLE LGFKNLKIHI
ENMRKFGKNI IVAINKFDTD TDKEIELVKK MTEKLGVKAV ETSVFTDGGA GGKELAQNLV
ELCENDNDFN YLYDLDQGVK EKIETIAKEI YRAKGVVYSK KCEKDIKKIE DLGYQNLPIC
VAKTPYSLSD DGNINITEDD YDITIREIRI NAGAGFLVAY TGNILTMPGL PKAANAYKID
LDENNEVVGL F