Gene Apre_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1086 
Symbol 
ID8397873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1163646 
End bp1165226 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content38% 
IMG OID644995433 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_003152834 
Protein GI257066578 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AAGAATTATT AGATCTCAAA GAAAAGTCTC TAGCAGAGCT TAGGGAAAAA 
ATAGACTATA GCAAAATCTA TGAAGATGAT AAAGGTGATT TTAAGCTTAA GGGAGATTTC
TTTGAAAATC AAAAAAGACT AGTGCTTAAA AACTGTGGTA TAATCGATCC ATCATCAATC
GAAGATTACA TAGGATTGGA TGGATATAAG GCTCTTTATA AGGCTATTTT TGAATTAGAT
AGAAAAGAGA TAATTGATAT AGTAAAAGAT TCTGGTCTTA GGGGAAGAGG CGGAGCAGGT
TTTCCTACTG GAAGAAAATG GGAAGCGGCT TTCCTTCAAG ATACTGATAT TAAATACATA
ATATGCAATG CAGACGAAGG AGATCCCGGA GCTTTTATGG ATAGGTCAGT CCTTGAACTA
GACCCACACT CAGTACTCGA GGCTATGGCC ATATGTGCGA GGGCCATAGG TTCAAATAAA
GGATTTATAT ATGTTAGGGC TGAATATCCA AAGGCGGTTA GAGCTCTTGA AATAGCAATT
GATCAGGCTA AAAAATATAA TCTTTTAGGA GATAATATAT TAGGATCAGA TTTTTCTTTT
GATATAGAGC TAAGACTTGG AGCAGGTGCT TTTGTTTGTG GTGAGGGAAC TGCACTAATG
GAGTCAATAG AAGGAAGGCG AGGCATGCCT CGTAACAAGG AATACAGGAC GACTGTAAGA
GGGCTATGGG GTAAGCCTAC TGTAATAAAT AATGTAGAAA CTTTCGCCAA TATCGCCCAA
ATTATTAATA AGGGCTCAGC TTGGTTTAGG TCCTTTGGAA CAGAAAAATC TCCAGGTACT
AAAGTCTTCG CCCTATCCGG CAAGGTTAAA AATGCAGGTC TTGTCGAAGT GGAGATGGGA
ACTAGCATAG ATCAAATAGT TTATGATATA GGAAAAGGTA TTCAGAATGA TAAGGATGCT
AAAGCAGTAC AGACTGGAGG TCCTTCTGGA GGTTGTATAC CTAAAAGGCT CTTCGATACA
GCTTGTGATT TCGAATCACT GGGAGCTATA GGTTCCATAA TGGGATCAGG CGGCATGGTA
GTTATGGATG AGGATGACTG CATGGTTGAT GTTGCTAGGT TTTTCCTAGA ATTTTCTGTA
GACGAGTCAT GTGGTAAATG TACTCCTTGT AGAATTGGCA ATAAGAGATT ATTTGAAATG
CTCGATGATA TTACTAAAGG TAAAGCTAAT CATGAAACAC TCGATAAGCT AGAAGAATTA
TCAGAAATAG TATCCGAAGC TTCACTTTGC GGACTTGGCA AATCTAGTCC TAACCCGATT
ATTTCTACAA TGAGATATTT TTATGATGAA TATGAGGCCC ATGTAAATGA AAATAAAACT
TGTCCATCCA AAAGATGTAT TAGCCTTTTA AATTATACCA TAGGAGAAGA TTGTATAGGA
TGTGGTAAGT GTAAGAGACT ATGCCCTAAT GAGGCTATAG CTGGAGAAGC TCGCAAAAAA
CATGAAATAA ATCAAGACAA ATGTATTAAA TGTGGCCAGT GTAAAGATAA TTGCCCAATA
AATGCTATAG CTTTGGCCTA G
 
Protein sequence
MNKKELLDLK EKSLAELREK IDYSKIYEDD KGDFKLKGDF FENQKRLVLK NCGIIDPSSI 
EDYIGLDGYK ALYKAIFELD RKEIIDIVKD SGLRGRGGAG FPTGRKWEAA FLQDTDIKYI
ICNADEGDPG AFMDRSVLEL DPHSVLEAMA ICARAIGSNK GFIYVRAEYP KAVRALEIAI
DQAKKYNLLG DNILGSDFSF DIELRLGAGA FVCGEGTALM ESIEGRRGMP RNKEYRTTVR
GLWGKPTVIN NVETFANIAQ IINKGSAWFR SFGTEKSPGT KVFALSGKVK NAGLVEVEMG
TSIDQIVYDI GKGIQNDKDA KAVQTGGPSG GCIPKRLFDT ACDFESLGAI GSIMGSGGMV
VMDEDDCMVD VARFFLEFSV DESCGKCTPC RIGNKRLFEM LDDITKGKAN HETLDKLEEL
SEIVSEASLC GLGKSSPNPI ISTMRYFYDE YEAHVNENKT CPSKRCISLL NYTIGEDCIG
CGKCKRLCPN EAIAGEARKK HEINQDKCIK CGQCKDNCPI NAIALA