Gene Apre_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1090 
Symbol 
ID8397877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1166432 
End bp1168405 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content33% 
IMG OID644995437 
Productprotein of unknown function DUF214 
Protein accessionYP_003152838 
Protein GI257066582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AAATCGCCCA CAAATTTGCC CTAAAAAACC TCAGGGCACA CAGACTAGTC 
TATCTTCCCT TTATCCTATC ATCTGGGATA ATGCTTATGG TATTTAATAT TATGGCAAGT
CTTTCTGCTA ACACTTATGT AAGAGAAAGA CATGCCTCTC TTCCGACTGT TATTAATATA
GGAATTGTTA TAATAGGTCT ACTAACTTTT ATCTTTTTAT TATATAGTAC AAATTTCCTA
AACAAGAGGA GAAATAAGGA ATTTGCTCTT TATGGAATTC TAGGACTTGA GAAAAGACAT
ATAAGAAAAA TTATTTTCTT AGAGTTATTG ATTGCTTTTG TCATAATAGG AGTGATTGGT
CTTATAGGAG GATATATATT TGGTAAGCTA TCCTTCCTTG CCTTAAATAA ACTAATGAAA
GATGTTGCGG GAGGCTTGAT GGATTATCCT TTCTCGTTAA AAGCGATGAC AGAAACTATT
CTTCTTCTTC TCCTAGCTCT TTTAACAAGT TTATTTGCAA CGAGTCTAAA AATATATAAT
TCTACTCCTG TAGAGCTTTT AGCTGATCAA AAAAGTGGGG AGGGAGAGCC TAAATCAAGA
TATATTTTGA TGGTTTTAGG TTTTGTCTTG CTTTTATCAG GCTATTACAT AGCTATTACG
ACACAAGGTA TCCTAAAAAG TCTTGGATAT TTCTTCCTTG CAAGTCTTAT AGTAATGCTT
GCGACATATA TTCTGTTTAT GTCATTTTCT GTAATTTATC TAAAAAGACA GAAAGAAAAG
AAGTCCTACT ACAAGAAGGA AAGATTCCTA GCTGTATCTG GACTTTTGTA TAGAATTAAG
TCAAATGCTA TATCCCTTGC TAGTATTTCT ATAATGAGCG TTGGCGTAAT CATAGCCCTA
TCCGCATCAT TATCTATATA CGATCGAATA GAGTATTCAG CTACAAATGT TGTTCCAAGA
GAGTACAATC TGGATAGTCC AGTAGAGATT GATAAAGACA ATCTAGAAGA AGAAAAAGAA
AAACTTAGAA GTATTGTTCA GGCCACTACA GCTAAGGGTG GAAAAATCAC CAATGACTTC
ATATCCTATG GACTATTTAC CGGTATCCAA GTAGACGGAG ATAAGTTTGA GACCTTTATG
GGAGATGGTA AACATAAACC ATATTTTCTA GTGGCTTGTG ACCTAGATGG CTACAATAAG
AGAGTCGGTA AAGAATATAA GTTAGCTGAT GATGAGATAC TCCTTACAGC TAATCAGAAA
TTTATGTTGG ATAAAGATAA AATTGAAATT GCAGATAGGA CTTATAAGGT TAAGATTATC
GATAACTTTA TCCCTTCTAA TGTCGCTATC GAGACTTATG GCATAGTTGT AAAAGATTTT
GATACTATTA AATTTCTATC TAGCGAATTT AAGATATTTG ATAGGGAGAC GTCAAGCTAT
AAGGACTCAA TTATTTCTCT TTCAGCAAAC TGGGACGTCA CTAGTATAGA TAAGAATATT
TATAGGAAAA ACCTAGAAGA ATATTCTAAG GATAAGGATA TAAATATCGA ATACGCAGAT
AAGTATCTGG AAGATGCCTA TGAGCTTTAC GGAGGGTTCG TATTCCTGGG AACAATAATC
GCCATAATCT TTCTCATTTC AACTATTTTG ATAAGTTATT ACAAGCAAAT AAAAGAAGGC
TTTGAAGATA GAAAGAAGTA TGAGATTATG AAAAAGATTG GTCTAGAAGA TAAGCTAATC
AAAAAGACGT CTGCTTCTCA GATAAGCTAT CTATTTGCAG CTCCTCTGGT CTTTGCTATA
ATTAATTCTC TAGTAGCTTC CAAAATAGTC TACCAGCTTC TAGCACTTTT TGGAGTGATG
ACCTTTATGC AATACGGAAA ATACTTCTTC ATCATGATAG GAGTCTTTGT AGTAATATAT
TATATGATAT TTAAAATAAC AAACAGAGCC TACTACAGGA TAGTAAGTAG GTAA
 
Protein sequence
MNKKIAHKFA LKNLRAHRLV YLPFILSSGI MLMVFNIMAS LSANTYVRER HASLPTVINI 
GIVIIGLLTF IFLLYSTNFL NKRRNKEFAL YGILGLEKRH IRKIIFLELL IAFVIIGVIG
LIGGYIFGKL SFLALNKLMK DVAGGLMDYP FSLKAMTETI LLLLLALLTS LFATSLKIYN
STPVELLADQ KSGEGEPKSR YILMVLGFVL LLSGYYIAIT TQGILKSLGY FFLASLIVML
ATYILFMSFS VIYLKRQKEK KSYYKKERFL AVSGLLYRIK SNAISLASIS IMSVGVIIAL
SASLSIYDRI EYSATNVVPR EYNLDSPVEI DKDNLEEEKE KLRSIVQATT AKGGKITNDF
ISYGLFTGIQ VDGDKFETFM GDGKHKPYFL VACDLDGYNK RVGKEYKLAD DEILLTANQK
FMLDKDKIEI ADRTYKVKII DNFIPSNVAI ETYGIVVKDF DTIKFLSSEF KIFDRETSSY
KDSIISLSAN WDVTSIDKNI YRKNLEEYSK DKDINIEYAD KYLEDAYELY GGFVFLGTII
AIIFLISTIL ISYYKQIKEG FEDRKKYEIM KKIGLEDKLI KKTSASQISY LFAAPLVFAI
INSLVASKIV YQLLALFGVM TFMQYGKYFF IMIGVFVVIY YMIFKITNRA YYRIVSR