Gene Apre_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0255 
Symbol 
ID8397029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp287952 
End bp289271 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content37% 
IMG OID644994616 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003152028 
Protein GI257065772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AATTATCGTT GTTAATGGCT CTAGTTTTGT CACTAGGAGT ATTCACAGCA 
TGCGGATCTA ATGAAAACAC AAGTGAAGAA AAGGCAGACA GCAAGGTAGA AGAAAGTGCT
GATCAAGTTA AAGATGACAA GAAGGAAGAA GGAAAAGAAG AAGATAAGGC AGGAGGTCAA
ACAGAAATAG TATTCTGGCA CGCTATGGGT GGAGGTCAAG GAGAAGCTCT TGAAAATTTG
ACCAAGAAGT TTGAAGAAGA AAATCCAAAC ATCAAGGTTA CTCTTCAAGG TCAAGGAAAA
TACGGAGACT TAAACCAAAT CCTCGTTGCA TCAATGCAAT CACCTAAAGA CCTTCCAACA
ATAACTCAAG CTTATCCTGA CTGGATGCTT CAATTCAAAG ACGCTAATAT GATAGCAGAT
CTTACAGACT ATGTTAAAAA AGACATGGAT GATTATGATG ATATCTTGCC AGGAGTAAGA
GATGAGTTAG AAAAAGATGG CAAGATTGAA GCTCTACCAT TTAACAAATC AACAGAAGTA
TTCTGGTATA ACAAAAATCT TTATGATGAA CTAGGACTAA AAGAGCCTAC AAGTTTTGAA
GAGCTAAAAG AAAATGCTAA GAAAATTTAC GAAGCAAAAG GAATCCCTGG GGCAGGATTT
GATTCACTAT CAAACTTCTA CCTAACATAT CTAAAAAACA AGGGAATCGA ATTTGATGAA
AATCTAGACC CTGCTTCAGC TGAATCTATC GAAGCTGTTG AATATTATCT AGAAGGTATC
AAAGAAGGAT ACTTTAGAAT AGCAGGGACA GACCAACACT TATCTGGTCC GTTTGCAAAC
GAACAAGTTG GTTCATTTGT AGGATCAAAC GCTGGTGAAG TATATGTAAA AGAAGCTCTA
AACGATAAAT TCGAATATGC TGCAGCTCCT TACCCAGCAA AAGAAGCCTT CCAACAAGGT
ACAAATATTT ACATGTTTGA CAAGGCAAGT GATGAAGAAA AACAAGCAGC ATTTAAATAC
ATGCAATTTT TAGCTAGCAA GGATTCACAA GTTGAATTTG CTATAGCTAC AGGTTACATG
CCAGCAAGAA AATCTGCAGT TGAAGATGAA ACTTACAAAT CATCTGATTC AAAAATCGCA
CCAATTCTTG ATAAGGCAAG TGAAAAACTA TTCTCTAGAC CACTTGCTCC AGGCAGCCAA
CAAGCCTACA ATGACGTAGC AAGTTTACTT GAAAGCATCC TTTCAAATCC AAATGCAGAC
GTTAAGGCAG AGCTTGAAGC ATTCGCGCCA CAATTTAAGG CAGACTTTGA AGCTCAATAA
 
Protein sequence
MKKKLSLLMA LVLSLGVFTA CGSNENTSEE KADSKVEESA DQVKDDKKEE GKEEDKAGGQ 
TEIVFWHAMG GGQGEALENL TKKFEEENPN IKVTLQGQGK YGDLNQILVA SMQSPKDLPT
ITQAYPDWML QFKDANMIAD LTDYVKKDMD DYDDILPGVR DELEKDGKIE ALPFNKSTEV
FWYNKNLYDE LGLKEPTSFE ELKENAKKIY EAKGIPGAGF DSLSNFYLTY LKNKGIEFDE
NLDPASAESI EAVEYYLEGI KEGYFRIAGT DQHLSGPFAN EQVGSFVGSN AGEVYVKEAL
NDKFEYAAAP YPAKEAFQQG TNIYMFDKAS DEEKQAAFKY MQFLASKDSQ VEFAIATGYM
PARKSAVEDE TYKSSDSKIA PILDKASEKL FSRPLAPGSQ QAYNDVASLL ESILSNPNAD
VKAELEAFAP QFKADFEAQ