Gene Apre_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0147 
Symbol 
ID8396898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp174029 
End bp175501 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content38% 
IMG OID644994485 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003151920 
Protein GI257065664 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAC TAAAAAAACT TAGCATAAGC ACACTCCTAG TTGGACTTAG CTTAAGTCTT 
TCTAGCTGCA CAGCTTCCGA TAATAAAGAA AAGTCCAAAA AAGCTTCAGA AGAGCTTCCT
ACCATAACTT ATTATGATGT AGGAACTCCA CAAGAAGATA CAGGAGAGGT GGTAGAAGCT
ATAAATGAAT ATCTAGATAA ATCTGATGCA GGATATCACC TAAACCTACA ATTTTTCGAT
TGGGGCGAAT ATGAACAAAG ACTTCAGCTT GCGTCAAATG CGGGAGATGA CTGGGATATA
GCCTTTACAG CTAACTGGTC TGGTCCTTAT AAAAACTTGG TAGAGAAGGG AGCCTTTGCT
GACATAACGG ATCTTATAGA TGAAAAAGGT CAAGCTATAA AAGATTCTCT TTCAGAAGAC
GTATTAAAGG GAGCTTCTAT TGAAGGAAGA CTATATGGTG CGCCAGCAGC TGCTAAAAAT
GTTGTTCCAG GCAATTATTT CGTTTGGAAT AAAGCTTATG TAGATAAATA TAAGATTGAT
ATAGATAGTG TAAAGACTAT AAAAGACCTT GAACCTTATC TTAAAGAGGT CAAGGAAAAT
GAAGCGAGCG TCGATTATCC TTTCAACATA GTAAGTGATT TCCTCCTTCA AACACCAACT
CCACAGTCTG AAGCGACACC AGGAGTTGCC GTAAAAGAAG AAAATGGAAA GCTCATCGCC
TACAATAGCT GGGCTGACCC AGAGCTTAAG AAACAATTAG ACGTCCTAAA AGACTACATG
GACAAGGGTT ATATTAACCC ATCAGCTCCT CAGATGAATG CGGGAGATGG AGAAGAAGGT
GATAGGTGGC TAGTAACAAA AGCCGAAGGA GGTCCAGATT CGGATGGGAT TTGGTCTAAC
TCCTTCAAGA GCGAAGTCAT ATCCTCCCCA GCAGGTAACA AAACAATAGT AACAAATCAA
AAAGCTACTG GTTCTCTTGC CGCTATTAAC TCCCAATCTG AGCACAAAGA ATTGGCTATG
GATTTTCTAA ATAGAATGTA TAGCGATAAG GAGCTTATGA GATATCTAAC CTATGGAATA
GAAGGCAAGC ACTATGATTT AGTAGATGGA AAGGTTGAAA AGTACGAAGA TACTAAATAT
GACGTACCAG CCTTTACCTT CCTAGCCTCT GAAAATATGA CACCACTTAC TACATCTGAA
GATTCTGACA CGCCAGAAGC TAAGGAAAAA CTAGATAAGT TTTTAGAGAA TTTAGAGCCT
TCTCCTATAC TAGGTTTCAA CTTTGATAGG AAAAGCGTTG AAAGCGAGGC AGGAAATGTT
GAACAGACAA TTTTCGAATA CGAGAAAAAT CTCAAAACAG GTGCCTTTGA TGAAGACTAT
TATCAAGAAT TCTTAGATAA GCTTAATACT GCTGGAATTG ATAAGTTGAT AGAAGAAGTC
CAAAACCAAT TAGATAATTG GGATAGGAAA TAG
 
Protein sequence
MNLLKKLSIS TLLVGLSLSL SSCTASDNKE KSKKASEELP TITYYDVGTP QEDTGEVVEA 
INEYLDKSDA GYHLNLQFFD WGEYEQRLQL ASNAGDDWDI AFTANWSGPY KNLVEKGAFA
DITDLIDEKG QAIKDSLSED VLKGASIEGR LYGAPAAAKN VVPGNYFVWN KAYVDKYKID
IDSVKTIKDL EPYLKEVKEN EASVDYPFNI VSDFLLQTPT PQSEATPGVA VKEENGKLIA
YNSWADPELK KQLDVLKDYM DKGYINPSAP QMNAGDGEEG DRWLVTKAEG GPDSDGIWSN
SFKSEVISSP AGNKTIVTNQ KATGSLAAIN SQSEHKELAM DFLNRMYSDK ELMRYLTYGI
EGKHYDLVDG KVEKYEDTKY DVPAFTFLAS ENMTPLTTSE DSDTPEAKEK LDKFLENLEP
SPILGFNFDR KSVESEAGNV EQTIFEYEKN LKTGAFDEDY YQEFLDKLNT AGIDKLIEEV
QNQLDNWDRK