Gene Apre_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0347 
Symbol 
ID8397121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp393784 
End bp396381 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content29% 
IMG OID644994705 
Producttype III restriction protein res subunit 
Protein accessionYP_003152117 
Protein GI257065861 
COG category[S] Function unknown 
COG ID[COG3421] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGACC AATATTTATA CCAAGAGTTA GACGTTATTA AAAAAATAGG AGCTTTGAAA 
GAATTGCCTT CCTATATAGA GTCAAACTTA AATCCTAATA TAATTTTGCG TGATTATCAA
GAAGATGCTT TTAGGTATTT TATAACCTAT GTTGAAAGTA ATTTGATAAA AAATAAACAA
ATCCATAATT TATTTCATAT GGCAACTGGT TCTGGTAAGA CTGTTATTAT GGCAGGTTTG
ATTTTATATC TATATACCAA GGGCTATAGG AAGTTTTTGT TTTTTGTAAA TCAGAATAAT
ATTATAAAAA AGACTAAAGG AAATTTTCTG AATCCATCTT CTATTAAGTA TCTCTTTGCT
GATAATATTG AATTAATGGG TGAGCAAGTT CAAGTAAAAA AAGTAAATAA TTTTGCTTTT
TATGATAAAA ATGCTATAAA TATTTGTTTT ACTTCCACAC ATAAACTTCA TCTGGACATG
AATTTCACAA AGGAGAATTC TCCTACTTTT GAAGATTTTG AGGATGATAA AATTGTCTTA
ATATCTGATG AATCCCACCA TATCAATACT GTGACCAAAG GCCTAACAAA AACTGAAAAA
AGAAACTTAG ATGAAAATGC CAAAAGTTGG GAATATACAA TTGAAAGAAT ATTTAGAGCA
AATAGAGATA ATGTTTTGCT AGAATTTACA GCAACTGCAG ACTTAAAAGA CCCTAATGTT
GAGAAGAAAT ATTTGGATAA AATTGTCTAT GATTATACCT TATCTAAGTT TAGAGAGAGT
GGATACACCA AAGACTTTAA TAATATGCAA GGTGATTATG ACAGGTGGTC AAGAACTCTT
CTTGCCCTTG TTATAAGTGA ATATAGGAGA CATTTGTTTG GAGACAATGG TCAAAATATA
AAGCCAGTTG TTTTATTGAA GTCAAAAATT ATTAAAGATT CAAAAGAATT TTATGATGAG
TTTTATCAAA AATTAAATAA TCTTAAGGCT GAAGAAATTC TAAAATATAA GGATTCTGAT
AATGAGTATT TGACAAATGC AATTGAATAT TTTTTTAAAA AAGACTCAAG TTTGAATTCA
TTAGTATCTG ATATTAAATT AGGATTTTCT GTTGAGAATT CTATACTTTT AGATTCGAAA
ACAATATCTG AGGATAAACA AATATACATA AACTCTCTTG AAGCAAAAGA TAATCCATAT
AGGATAATTT TTACTGTAGA TATGTTAAAT GAGGGCTGGG ATGTATTAAA TCTTTTTGAT
ATTGTAAGAT TATATGAAAC AAGAGATGGA AAAAATGGGA AGCCGGGAAA AACAACTATT
AGTGAAGCTC AACTAATAGG TCGTGGTGCA AGGTATTGTC CATTTAAAAT TGAGGATGGT
CAGCCAAGAA ATAAGCGTAA GTATGATTTT GATATGAATA ATGAGAATAG AATACTTGAA
ACGCTTCTAT ATCATTCTAT GCAAGACTCA AGGTATATTA GTGAATTACG ATATGCACTA
AAACAAACCG GTTTATTAGC AGATGCTTCG ATGGAAATAA ATTATATATT GAAAGATGAA
TTTAAACAAA CAGATTTTTT CAGGGAAGCC TATGTATTTT CAAATAGAAA AGTTGAAAAG
TCAAGAAAAT CAGTTGCAGG AATAGATAAA AAAATACGAA ATGGATATTA CCAGCATAAA
GTATCCACAG GAAGTTCTTT TATATATGGA TTGTTTGATG AGGGAAAAGT AAAGGCTAAT
GAGATGACTA ATACTTTTAA CTATAAATTT AAAAATATAC CTTTAAATAT TGCTGAAGGT
GCAATGTCAA ATTTTGAGGT CTTAAAATTC AATACACTTA AGTCATATTT TCCTAACTTA
AAATCAAAGA AAGAATTTTT GAAATCTGGA AATTATTTGG GAAATATAAG CTTACAAATA
GAAAGCCCAT ACAAGAAATT AAAAGCCAAA GATCTTTATG ATGGAACAAT GAAAATACTT
AAAGAAATAT CATTATATTT ACAAAAGATA GAAACTGAGT ATGAAGGAAC CAAGGAATTC
TACGCTAAGA GAGTCTACGA GGTATTAAAA GATAAAAAAA TATATATTGA CAATCCACAT
GGAGATGGAG TAGGAGTGTC CCAATCCATG ATTGCCAATG AAGACGTCTT GGACTTATCA
TACGAGCAAT GGTATGTCTA CAATGATAAC TATGGAACAG GTGAAGAAAA AGCCTTTGTC
AAATATTTCA AGGGGATAGT TAAGGATTTG AGAAGTAAAT ATGATGATAT TTATTTGGTA
AGAAACGAGA GAATTCCGGC CTTGGCTATT TATGAATTTG ATACAGGAGA AAGATTTGAG
CCAGACTTCT TATTGTTCTT ACAAAAGAAA GGAACTGATG GATATTTGCA AGAACAAATT
TATATTGAAC CTAAAGGTAG CCATCTACTA AAAAAAGACA AATGGAAAGA AGATTTCCTA
CTAAAAATTG AAGAACAGGG CATACCTACA AAAACTTATG TGGATGATAA CAAGTATAAG
ATTATAGGAC TTCCATTCTT TAATAGGGAG TGTAGGATGG AAGAATTTGA GGAAGAGATT
AGAAAAAACT TTAGTTAA
 
Protein sequence
MTDQYLYQEL DVIKKIGALK ELPSYIESNL NPNIILRDYQ EDAFRYFITY VESNLIKNKQ 
IHNLFHMATG SGKTVIMAGL ILYLYTKGYR KFLFFVNQNN IIKKTKGNFL NPSSIKYLFA
DNIELMGEQV QVKKVNNFAF YDKNAINICF TSTHKLHLDM NFTKENSPTF EDFEDDKIVL
ISDESHHINT VTKGLTKTEK RNLDENAKSW EYTIERIFRA NRDNVLLEFT ATADLKDPNV
EKKYLDKIVY DYTLSKFRES GYTKDFNNMQ GDYDRWSRTL LALVISEYRR HLFGDNGQNI
KPVVLLKSKI IKDSKEFYDE FYQKLNNLKA EEILKYKDSD NEYLTNAIEY FFKKDSSLNS
LVSDIKLGFS VENSILLDSK TISEDKQIYI NSLEAKDNPY RIIFTVDMLN EGWDVLNLFD
IVRLYETRDG KNGKPGKTTI SEAQLIGRGA RYCPFKIEDG QPRNKRKYDF DMNNENRILE
TLLYHSMQDS RYISELRYAL KQTGLLADAS MEINYILKDE FKQTDFFREA YVFSNRKVEK
SRKSVAGIDK KIRNGYYQHK VSTGSSFIYG LFDEGKVKAN EMTNTFNYKF KNIPLNIAEG
AMSNFEVLKF NTLKSYFPNL KSKKEFLKSG NYLGNISLQI ESPYKKLKAK DLYDGTMKIL
KEISLYLQKI ETEYEGTKEF YAKRVYEVLK DKKIYIDNPH GDGVGVSQSM IANEDVLDLS
YEQWYVYNDN YGTGEEKAFV KYFKGIVKDL RSKYDDIYLV RNERIPALAI YEFDTGERFE
PDFLLFLQKK GTDGYLQEQI YIEPKGSHLL KKDKWKEDFL LKIEEQGIPT KTYVDDNKYK
IIGLPFFNRE CRMEEFEEEI RKNFS