Gene Apre_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1504 
Symbol 
ID8398316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1638605 
End bp1639897 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content41% 
IMG OID644995868 
Productdihydrodipicolinate reductase 
Protein accessionYP_003153246 
Protein GI257066990 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4091] Predicted homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000626794 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGA TAAATAGAAA ACTAAAAGAA ATGAAAGATA ACGGAGAAAG CATCAAGCTT 
GCCATAATCG GTTGCGGCAA GATGGGAGCT TCCTTAATCA GCCAACTATC AAAGATGGAT
GCAATGGAAG TTAAGCTTGT AGTAGATAGA ACTCCACAAA AAGCTATCAA GGCCCTAGTA
AATGCAGGAA TCTCTGAAGA TAAAATAATA TTTACAGATG ACTATAATGA AGGATACGAA
GTTTTAGAAA AAGGCTTCGT TTGTGTATCT ACTAACTACA GATTAGCCTA TAAGCTCATG
CAAATAAATG CGGTAATTGA CTGTACAGGC AACCCTCCTT TTGGTGCAGT TATAGCAAGA
AAGACTATCC AATACCACAA ACACATGATC ACCTTCAATG TAGAATGTGA CGCTGTAGTT
GGACCTGTCC TTCACGATAT GGCCAAGAAG GCAGGAGTGG TTTACACAGG AATCCTTGGC
GATGAGCCAG GAGCTATAAT CGACCTTGTA GAATACGCTT ACGGAATGGG ACTTGAAGTC
TTGGTCGCTG CCAAGGGAAA GAACAACCCA CTAGACCGTG ATGCAACACC AGAAAGTTTG
GCAGAAAAAG CCAAAGAAAA GGGTCTATCA GCAAAGATGC TTACAAGCTT TGTAGATGGA
ACAAATACCA TGCTAGAGCT TAACTCTGTA TCAAACGCCC TAGGCTTCCT GCCAGATGTA
TTTGGCTGCC ATGGAATTGA TACAAGCCCT GAAACTGCTG TAGAAGATTT TAGACTAAAG
GCAGACGGAG GCAAGCTATC AAGATACGGA GTTGTAGAAT TCTCTCGTGG AATGGCTCCA
GGAGTATTTA TCATAGTAAC AAGTGATCAA GAAGACGTTA GAGATCTAAT GAAGTTCTTA
GGCTTTGGAG ATGGTCCAAA CTACTTGATG TACAGACCAT ACCACCTAAC AAGTCTTGAA
ACTCCAATTA CTATCTATAA GGCTGTAGTA GAAAACGAAG CGACAATAGT TCCTCTTCAC
GGCCAAGTAG CTGACACAGT AACTATTGCC AAAAGAGACA TCAAGGCTGG AGAAAGACTC
GAAGGAGTAG GATCAAAGAC AGTTTACGGT AAGCTTACAA GCCACGATAG AAGCCTTGCC
GAAGACCTCT TGCCAATAGC TTTGATTACA GATAAGACAA AGGCAGTAAA AGATATAGAA
AAGGGAACAG TAATAGATAT GTCCATGGTA GAGCTTGACG AAAAGGCAAC TATCACAAGA
CTAAGAAGAA GACAAAATTC CATGAAATTG TAA
 
Protein sequence
MFKINRKLKE MKDNGESIKL AIIGCGKMGA SLISQLSKMD AMEVKLVVDR TPQKAIKALV 
NAGISEDKII FTDDYNEGYE VLEKGFVCVS TNYRLAYKLM QINAVIDCTG NPPFGAVIAR
KTIQYHKHMI TFNVECDAVV GPVLHDMAKK AGVVYTGILG DEPGAIIDLV EYAYGMGLEV
LVAAKGKNNP LDRDATPESL AEKAKEKGLS AKMLTSFVDG TNTMLELNSV SNALGFLPDV
FGCHGIDTSP ETAVEDFRLK ADGGKLSRYG VVEFSRGMAP GVFIIVTSDQ EDVRDLMKFL
GFGDGPNYLM YRPYHLTSLE TPITIYKAVV ENEATIVPLH GQVADTVTIA KRDIKAGERL
EGVGSKTVYG KLTSHDRSLA EDLLPIALIT DKTKAVKDIE KGTVIDMSMV ELDEKATITR
LRRRQNSMKL