Gene Apre_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1594 
Symbol 
ID8398406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1734291 
End bp1735370 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content38% 
IMG OID644995958 
Productoxidoreductase domain protein 
Protein accessionYP_003153336 
Protein GI257067080 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.788368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTA ACAAAAAGGA TGGAATGAAC TATGCACCTA AGGGCTCTGC TAATAAAGTA 
GTTGATCCTG GCGAATTTAA GGTCGGAGTA GGAGCCCTAG ATCATGGCCA TATTTATGGC
ATGTGCAACG GACTTAAGGA AGCTGGTGCA GAGATAGTTT ATGTCTATGA TAAAGACCCT
AAAAAAATCG AAGAATTTCA AGAGAAATTT CCAGAAGCCA AAGCTGTAAA TTCAGTAGAT
GAAATCTTAA ATGATCCTGA GATAAAGATG CTTGCGGCAG CAGCAATCCC AAATCTAAGA
TCAGCCTTAG GTAACAAGGC TATGAAAGCC GGAAAGGATT ATTTTACCGA CAAAACTGGC
TTTACTAGCC TAGATCAGCT AGAAGAAACG AAAAAAGTTG TAGAAGAGAC TGGAAGAAGA
TATTTTGTTT ACTTTTCAGA AAGACTTCAT GTTGAAGGAG CAATATTTGC TGGAAAGCTT
ATTGAAGAAG GAAGAATCGG AAGAGTCATC CAAGTAACAG GCTTTGGACC TCATAGTCTC
AACAAGGAAA GTAGACCAGA CTGGTTTTTC AAAAAAGATC AATATGGCGG AATTTTGACA
GATATAGGAA GCCATCAAAT AGAACAATTC CTATATTACA CTGGAAATAA AGATGCAAAA
GTTCTTCACA GCAAGGTCGC AAACTACGAC AACCCTGATA CACCAGAGCT AGAAGATTAT
GGTGATGCTA CCCTTGTTGG AGAAAATGGA GCAACATTCT TCTACAAGGT GCATTGGTTC
TCACCAAAGG GTCTTTCAAC ATGGGGTGAT GGTAGAACAT TTATAACAGG GACTAAGGGT
TCAATCGAAG TAAGAAAATA TACCAATGTT GCAACGAATG CTACAGGAGA TCATGTATTT
TTGGTAGATG AAAACGGAGA AGAGCATTTT GAAGTAAATG GCAAAGTAGG ATTTCCATTT
TTCGGAGAGA TGATACTCGA TTGTATCAAT GGCACAGAAA ATGCAATGAC ACAAGAGCAT
ATATTCAAGG CACAAGAGCT ATGCTTAAAA TGCCAAGAAG AAGCTATAGT AATAGATTAA
 
Protein sequence
MAINKKDGMN YAPKGSANKV VDPGEFKVGV GALDHGHIYG MCNGLKEAGA EIVYVYDKDP 
KKIEEFQEKF PEAKAVNSVD EILNDPEIKM LAAAAIPNLR SALGNKAMKA GKDYFTDKTG
FTSLDQLEET KKVVEETGRR YFVYFSERLH VEGAIFAGKL IEEGRIGRVI QVTGFGPHSL
NKESRPDWFF KKDQYGGILT DIGSHQIEQF LYYTGNKDAK VLHSKVANYD NPDTPELEDY
GDATLVGENG ATFFYKVHWF SPKGLSTWGD GRTFITGTKG SIEVRKYTNV ATNATGDHVF
LVDENGEEHF EVNGKVGFPF FGEMILDCIN GTENAMTQEH IFKAQELCLK CQEEAIVID