Gene Apre_1244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1244 
Symbol 
ID8398033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1329643 
End bp1331115 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content41% 
IMG OID644995589 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003152989 
Protein GI257066733 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00471898 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC AAGATAGATA TGATTTATAT ATTAATGGTG AATGGATTAA GCCAGCATCA 
GGCGAGTATC TCGATGCAAT AAATCCTGCT ACAGGCGAGA AGATTGCTGA GTTTGCTACT
GCAAATAATG AGGATGTAGA CAGAGCGATC GAGGCTGCTA GAGCATGTTT TGATGGAGAA
TACGGATCTT TCTCTAAGGA AGAACGTGCA AACCTCCTAT TTAAAATAGC CGATAGGATA
GAAGAAAACT TAGAAGATCT TGCCACAATC GAGACTATGG ATAATGGTAA GGCAATTCGT
GAAACAAGGA CAGTTGACCT TCCTTGGGTT GTAGATCATT TTAGATATTT CGCTTCCCTC
TTAAGAGCTG ATGAAGATGA AATTTCTAAG CTTGATGGTA GATTTGTATC AATCAGAAAA
AGAGAGCCAC TTGGGGTTGT AGCTCAAATG ATTCCTTGGA ACTTCCCACT CCTTATGGCT
GCTTGGAAGC TTGCACCAGC TATAGCAGGG GGAAATACTA TAGTAATCTC CCCTTCTTCA
AACACATCTA TAGGGCTTCT TGAGATGATC AGAAGAATAG AAGACCTCCT TCCAAAGGGA
CTTATCAATG TTGTAAGTGG TAGGGGTTCT GTAACTGGAG AATACCTCCA ACATCACAAA
GGCGTGGATA AGCTTGCCTT TACAGGTTCT ACAAGTGTTG GTCGTCACAT TGGTATTTCT
GCAGCAGAAA ACTTAATTCC TTCAACCTTA GAGCTTGGTG GTAAGTCAGC TCATATCATT
TTTGACGATG CTGATATAGA AAAAGCCCTA GAAGGCGCCC AAGTTGGAAT CCTATTCAAT
CAAGGAGAAG TTTGCTCAGC AGGATCTAGG CTCTTTATCC AAGAAGGAAT CTACGATGAA
TTTGTAGAAA AACTTGTTGA AGCTTTCAAT AAGGTAAAAG TCGGTAACCC TCTAGAAGAG
GACACACAAA TGGGTGCCCT AAGAGATGAG AAGAGAATCC CAGTTATAGA AGAATTCATC
AAAAAAGCAA CAGATGCAGG TGCAAAGGTC CTTGCTGGTG GTAAGAGACT TACAGAAAAC
GGACTCGACA AGGGAGCCTT CTTCGCACCA ACTATGCTTG CTGATGTTCC AGAAGATAAC
GACGCCTACA GAGAAGAAAT CTTTGGACCA GTTGTAGTAA TAAAGAAATT CAAGGACGAG
GACGATGTTA TAAGAATGGC AAATGACTCC CACTACGGCC TTGGTGGAGG AATCTACTCC
AACGACCTAT ATAGGATAAT GGATGTTTCA AATAGACTAA AGACAGGAAG AATTTGGGTT
AACACCTACA ACCAATTCCC AGCAGGTGCA TCATTCGGTG GCTACAAGGA TTCTGGTATA
GGTAGGGAAA CAGACAAACT TGCCCTTGAA GCCTACACTC AAGTTAAAAA TATTATCATT
GATTCCTCAA AAGAAAAATT AGGTTTCTAT TAA
 
Protein sequence
MKIQDRYDLY INGEWIKPAS GEYLDAINPA TGEKIAEFAT ANNEDVDRAI EAARACFDGE 
YGSFSKEERA NLLFKIADRI EENLEDLATI ETMDNGKAIR ETRTVDLPWV VDHFRYFASL
LRADEDEISK LDGRFVSIRK REPLGVVAQM IPWNFPLLMA AWKLAPAIAG GNTIVISPSS
NTSIGLLEMI RRIEDLLPKG LINVVSGRGS VTGEYLQHHK GVDKLAFTGS TSVGRHIGIS
AAENLIPSTL ELGGKSAHII FDDADIEKAL EGAQVGILFN QGEVCSAGSR LFIQEGIYDE
FVEKLVEAFN KVKVGNPLEE DTQMGALRDE KRIPVIEEFI KKATDAGAKV LAGGKRLTEN
GLDKGAFFAP TMLADVPEDN DAYREEIFGP VVVIKKFKDE DDVIRMANDS HYGLGGGIYS
NDLYRIMDVS NRLKTGRIWV NTYNQFPAGA SFGGYKDSGI GRETDKLALE AYTQVKNIII
DSSKEKLGFY