Gene Apre_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0689 
Symbol 
ID8397470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp788350 
End bp789984 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content31% 
IMG OID644995044 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003152449 
Protein GI257066193 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR02294] nickel ABC transporter, periplasmic nickel-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAC TTAGAAAAAA TATGATGGTT TTATTGGCAC TTGTATTTAG TTTTTCAATT 
TTACTAGGAG GTTGCCAAAA GAATCAAGAA AAAGCACAAG AAAATTCAAA TGTAGCAAGT
GAGAACACAA AAGATGAAAG CAAGAAGGTT TTAACCTTAT GTACTGCTAA AGAACTGACA
AACCTTACGA CTTTAACAAT GAATAAGGAA AATAATATGG CTTGTGGTTT AATTTACGAA
ACCCTAGTAG CTTATGAAAA CGGTGAAATT GTACCAAAAC TTGCTGAAAG CTTTGAATAT
AAAGATGATG GCAAAACTTT AGTCTTTAAA TTAAAGGACG GGGTAAAGTT TTCAGATGGT
GAAGATTTTA ATGCTGATGC AGTAAAAAAG ATATTAGATT TTGATAAGTC TAACCCTAAT
TTTGCAGGCA TAAGGGCTGT TGCAGAGATA AAATCAACAG AAGTTATTGA TGATAATACG
ATTGCTGTTC ACTATGAAAA TCCGTCTAAA TTTTATATAA ATGGCTTTTG TTTCCAAAAT
GTATTGGGGA TGCCATCTCC AAAGTCATTT ACTGAGGGAA ACTTTGAGAA ATTTAACAAA
AATATAGGAA CAGGTCCTTA TGTATATGAA GAATTTAAAT CTGGAGAATA TACAAAATTT
GTTAGAAATG AAAATTATCA CGGGGAAAAA CCTTATTATG ATGAAGTTAT TGTCAAATAT
ATTCCCGATG CTTCATCAAG ACTTCAAGCC TTAAATAAGG GGGAAATAGA TTTAATTTAT
GGAGCAGATT TAATAAATTA TGATGACTTC AAAAAAGGTT CTGAAATTAA GGATGTTACT
GGAGAAGTCA ATAAAAATAG GACTTTGACT AAGAATCTAA TTTTAAATCC AAGTAAAAAA
GAATTAGAAG ATTTAAAAGT TCGCCAAGCA ATTAATTATG CAATTAACAA AAAAGACATT
GTCGACAGTT TAACATACTC ATATGAAGAT GTAGCTGAAA CTTTATTCCC TAAAGATGTG
GCTTATTGTG ATGCAAATTA TCCAACTATT TATAGTTATG CTCCTGAAAA GGCAAATGGC
TTGCTAGATG AAGCGGGTTG GAAACTCAAT AAAGATACAG GAATTAGAGA AAAGGACGGA
AGTCCATTAA AGCTTCAATA TGTTTATTGG TCAGATTTAG TACTTGCTAA GGAAACTGCA
CTTGCAATAA AGACACAATT AAAAGAAGTA GGTATAGATG TTGACCTAGT TGAAAAAGAT
CAAATGTCAT GGTGGACAGA TGGTATAAAG GGAGAATTCC ATTTGACAAC CTGGAATACA
GAGGGTTCTT ATACTGAACC TCATAAGTTC TTACAAGAAT CAATCACTGA AATGGATCCA
CATTTGATGC CGTTAAAAGC ACTTTCTGAT TCAAACATAT ATATTGATGC AATAAAGAAG
GCTTCCACTT CTACAAACGA AGGGGAGATT AAAGATAATA TACAAAAAGC TATAGTATAT
TCAAACGAAA ACGCTATGGA TTTGCCTCTT TCTTATTCAA AAGAAATGAT TTTGTATAGA
AATGACAAAA TTGGTGGATA TGACTTTACA AGTACACCAC AATTTTTCAA TATTTACAGT
GTAAAAGCTA AATGA
 
Protein sequence
MIKLRKNMMV LLALVFSFSI LLGGCQKNQE KAQENSNVAS ENTKDESKKV LTLCTAKELT 
NLTTLTMNKE NNMACGLIYE TLVAYENGEI VPKLAESFEY KDDGKTLVFK LKDGVKFSDG
EDFNADAVKK ILDFDKSNPN FAGIRAVAEI KSTEVIDDNT IAVHYENPSK FYINGFCFQN
VLGMPSPKSF TEGNFEKFNK NIGTGPYVYE EFKSGEYTKF VRNENYHGEK PYYDEVIVKY
IPDASSRLQA LNKGEIDLIY GADLINYDDF KKGSEIKDVT GEVNKNRTLT KNLILNPSKK
ELEDLKVRQA INYAINKKDI VDSLTYSYED VAETLFPKDV AYCDANYPTI YSYAPEKANG
LLDEAGWKLN KDTGIREKDG SPLKLQYVYW SDLVLAKETA LAIKTQLKEV GIDVDLVEKD
QMSWWTDGIK GEFHLTTWNT EGSYTEPHKF LQESITEMDP HLMPLKALSD SNIYIDAIKK
ASTSTNEGEI KDNIQKAIVY SNENAMDLPL SYSKEMILYR NDKIGGYDFT STPQFFNIYS
VKAK