Gene Apre_0146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0146 
Symbol 
ID8396897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp172457 
End bp174013 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content37% 
IMG OID644994484 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003151919 
Protein GI257065663 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAG TTAAAAAATT TGCTTACACA TCAATGGCTC TAATGCTTGC TCTAGGCATT 
TCAGCTTGCG GAAACAATGA CGGAAATGTA GAAGATGAAA ACAAAACCGA AACTAGTGAA
AGCGTAGAAA ACAAAGAAAA CAAAGACGAT AAAAAAGATA AAAATGCTAA AGATGACGAA
GTTCCTACCT TAACTTATCT TAATATAGGA ACCCCGCAAG CAGGAACGGA AGAGACAGTA
AGTAAGATTA ACGAATACCT TGATGAAAAA GAAGCTGGTT ATCACTTAAA TCTAATCTTC
TATGATTGGG GAGATTATGA ACAAAGACTT CAACTTGCTT CATCAACAGG AGAAGATTGG
GATCTAGCCT TTACTGCAAG CTGGGCAGGA CCTTACAAGA CTCTAGTATC ACAAAACGCC
TTGATGGATC TAACTGACTT AATGGAAGGC AAGGAATTTG TTGATTTAAT CAATCCTGAT
ATGCTAAAAG GTGTTAGTGT AGACGGCAAA GTTTACGGAA TCCCTGCAGC ATATCCTGGA
GTTGTTGCAG CTAACCAATT TGTATGGACA AAGAGCATGG TTGATAAATA TAATATCGAC
TACAAGAATA TCCAAAATAT AGACCAATTA GAGCCTATCT TTGAAGAAGT AAAAGAAAAA
GAAGGTATGC AATATCCATT TGGCGTATCT AAGGACTTCT TGTTCGCTAT GCCAGAGCCT
GTATATCAAG TTACAGATGG GGTAGCTGTA AGAGAAGAAG ATGGCAAGCT TAAGGCTTAC
AACTTATATG CGGATGATCT TTATAAAGAA CAAATCATGA AGATGAAAGA TTATATGGAC
AAAGGATACA TCTCACCATC AGCTCCACAA GTTGAGCCAG GAACAGTAAT GCCAGAAAAC
GAAGTTCTAC TAACAGAAGG TGAAGGAGAA CCAGGATCTG CCGCTATATG GTCAGTTGCA
CCACGTAACG AAGTAGTATC AAATATAATT GGAGATAAGG TTCTTATATC TAACGACAAG
GCTACAGGAA AGATGATTTC CATCAACTCA CAAACTGATA AGGCAGAACT TGCAATGGAC
TTTATCAACA GAATGTTCAC AGATAAAAAC CTACAAGACA TGCTATCTTA CGGTGTTGAA
GGCAAAAACT TCGAGTACAA AGACGGAAAA GTTGTAAAAC ACAATAAGGA TTCTGATGGA
ATAGATAACG ATTACGATGT ACCATCATTT ACCTTCCTAT GTGCATTTAA CAGAACTCCA
CTAGAGGGAG CTCCTGGAAT TGGCGATGAA GAATTCGATA AGGAAGCAAA AGAATTTGAA
GATAAACTAG TAGCATCTCC AGTATTAGGA TTTACCCTAG ACGAATCAAA GATAGCAACA
GAAATAGCAA ATGTTCAACA AACACAAAGC GAATACACCA TCAACCTTAA GACAGGTGCC
TTTGACGAAA GCTACTATCA AGAATTCTTA GATAAACTAA ATACAGCAGG AATCGAAAAG
GTAATAGAAG AAGTACAAGC ACAATTAGAT AATTGGGAAG GCGCTGATAA GCAATAA
 
Protein sequence
MNLVKKFAYT SMALMLALGI SACGNNDGNV EDENKTETSE SVENKENKDD KKDKNAKDDE 
VPTLTYLNIG TPQAGTEETV SKINEYLDEK EAGYHLNLIF YDWGDYEQRL QLASSTGEDW
DLAFTASWAG PYKTLVSQNA LMDLTDLMEG KEFVDLINPD MLKGVSVDGK VYGIPAAYPG
VVAANQFVWT KSMVDKYNID YKNIQNIDQL EPIFEEVKEK EGMQYPFGVS KDFLFAMPEP
VYQVTDGVAV REEDGKLKAY NLYADDLYKE QIMKMKDYMD KGYISPSAPQ VEPGTVMPEN
EVLLTEGEGE PGSAAIWSVA PRNEVVSNII GDKVLISNDK ATGKMISINS QTDKAELAMD
FINRMFTDKN LQDMLSYGVE GKNFEYKDGK VVKHNKDSDG IDNDYDVPSF TFLCAFNRTP
LEGAPGIGDE EFDKEAKEFE DKLVASPVLG FTLDESKIAT EIANVQQTQS EYTINLKTGA
FDESYYQEFL DKLNTAGIEK VIEEVQAQLD NWEGADKQ