Gene Apre_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1083 
Symbol 
ID8397870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1159799 
End bp1161409 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content36% 
IMG OID644995430 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_003152831 
Protein GI257066575 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA ATGAATTAAA TATAGGAAAT GAAATTTTAA GGGCAATTGA TGATTTAGGC 
TACGAAAAGC CAAGTCCCAT CCAAGAAGAG TCTATCCCCC ACCTTTTGGA AGGGAATGAT
CTAATAGGAA AATCTCAAAC AGGAAGCGGG AAAACTGCGG CCTTCGCCAT ACCAATTATA
GAAAATATCG AAGCAAATGG AATAACTCAA GCCCTCATCC TCTGTCCAAC AAGAGAGCTT
TGTATACAAG TATCAAAAGA AATCGAAAAG CTTTATAAAT ACAAGAAAGA AATCAAAATC
CTTTCTGTCT ATGGTGGAAG TCATATAGTA AGGCAAATCA AGAGCCTAAA AAAAGGAGTA
GAAATTGTAG TTGGTACTCC TGGAAGACTG ATGGATCTTA TGAGAAGAAA GGTCTTAAAG
CTCGACCAAC TTAAAACAGT TGTCTTGGAT GAGGCTGACG AAATGTTTGA TATGGGCTTT
AGGGATGATA TGAAATTCAT CCTTGATAGG ACAAATCCCA ATAGACAGAC TTGTTTCTTT
TCAGCAACTA TGGGACCTGA AATCCAAGAA TTTTCTAAGC TTTATCAAAC TAATCCCTAC
GAAGTCAAAA TAAAATCTAA AGAAGTAACT GTTGATAGGA TCGACCAATA TTATATTAAA
CTTAAAGAAT CTATGAAGGA AGAAGCCTTG ATGCGACTTC TAGAAATCCA TAAGGCAAAT
CTTGCTATCG TATTTTGTAA TACCAAGAGA AAAGTCGACA GGCTTGTAGA AAGTCTCACC
AAGAAAAACT ACCTAGTAGA CGGCCTTCAT GGAGATCTCA AACAAAGTAG TCGTGACCAA
GTCATGAAGA AGTTTAGAAA TAAAACCATC CAAATCCTTG TAGCGACAGA TATAGCTGCT
CGTGGTCTTG ATGTGGATGA TGTAGATATT GTCTTTAACT ATGACCTACC TCAGCTTGAC
GAATACTATG TCCACAGGAT TGGAAGAACA GCCAGAGCTG GCAAAAGCGG TCTAAGCTTT
TCCCTAATCT CAGGTCGTGA TAATAATAGG CTAAGGCAAA TCGAAAATTA TACCAAGGCC
AATATAAAAC AGATGCCCGT CCCAACCCTT GTCCAAATGG ATAGGCAAAG CGACCTTCGT
CTTATAGAAG ATATTTCATC TAAGCTTGAT AAAAATGAAG ACCTCAGTAG AGAAAAAGAT
ATACTAATAA GACTTATGGA AAAAGGCTAT GATCCATTTA TGATCTGCCA GGTCTTACTT
AAAGATAAGC TAGATATTAA TAATAACCAC GAAAAGCTTG AAGGAATTGA CCTTAAATCG
GAGAAGAAAT CTAAGAGTAA GTCAAATAAT AAAACTAAAA AGAAATACGA CAAGGAAATG
ACAACTCTCT TCATGAACAG GGGTAAGATT GATAATTTCA CCAAGGATAA GATAATAAAG
GCCCTAGCAA GAATGGCCAA AGTCCCAAAG GATAAAATAG GCCAAATCAG GATTCAAAAA
ACCTATAGTT TTATAGATAT AGAAAAGCCT GAAGCCTTTA AGGCAATTAG GGCAATTGAT
AATAAGAAAA TATCTAGCAA AAAAGTTAAA ATAGAAGAAT CAAATAAATA G
 
Protein sequence
MKFNELNIGN EILRAIDDLG YEKPSPIQEE SIPHLLEGND LIGKSQTGSG KTAAFAIPII 
ENIEANGITQ ALILCPTREL CIQVSKEIEK LYKYKKEIKI LSVYGGSHIV RQIKSLKKGV
EIVVGTPGRL MDLMRRKVLK LDQLKTVVLD EADEMFDMGF RDDMKFILDR TNPNRQTCFF
SATMGPEIQE FSKLYQTNPY EVKIKSKEVT VDRIDQYYIK LKESMKEEAL MRLLEIHKAN
LAIVFCNTKR KVDRLVESLT KKNYLVDGLH GDLKQSSRDQ VMKKFRNKTI QILVATDIAA
RGLDVDDVDI VFNYDLPQLD EYYVHRIGRT ARAGKSGLSF SLISGRDNNR LRQIENYTKA
NIKQMPVPTL VQMDRQSDLR LIEDISSKLD KNEDLSREKD ILIRLMEKGY DPFMICQVLL
KDKLDINNNH EKLEGIDLKS EKKSKSKSNN KTKKKYDKEM TTLFMNRGKI DNFTKDKIIK
ALARMAKVPK DKIGQIRIQK TYSFIDIEKP EAFKAIRAID NKKISSKKVK IEESNK