Gene Apre_0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0469 
Symbol 
ID8397244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp535247 
End bp536905 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content40% 
IMG OID644994826 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003152237 
Protein GI257065981 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAA AGAGTAAATT ATTAATAGGC TTACTGAGCT TGTCTATGAT ATTTTCTGCC 
TGCGGAAATA ACGATAAGGC AGATAATGGA AATGGAAACA ATCAAGAGGT AGTAAATGAA
GAAGAAGGCA AGAAGGACGG AGTCCTAGAT ATAAATATAG CAAGCGAGCC TGACTCAATA
GACCCAGCCC TAAACACTTC AGTGGATGGA GCTATAATGA TATCCCACCT ATTCGAATCC
CTAATCAGAT GGGACGACGA TGGGGAAGGT AATGCAGTCC TTAAACCTGG TATAGCGGAA
AGCTGGGAAG TATCAGATGA CGGTTTGACT TGGACCTTCA AGCTAAGAGA TGCTAAGTGG
TCTGATGGAA AGGAAATCAC AGCAGATGAC TTCGTGTATT CTTGGAACAG ACTAGTAGAT
CCTGCAACAG GAGCAGACTA TGAGTATATG CTAGATATGG TAAAGGGCTA TGATGAGAAA
AAGCTCGATA TCTCTGCACC AGATCCAAAA ACATTTGTAG TTAATCTAAA TGTAAAATGT
CCATACTTCG AAGAAATATG TGCCTTCCCT GCAGTAATGC CAGTAAGAAA AGACATCATC
GAAGCTAACA AGACTTGGAC AAATAGCCCA GAAACATTAG TATCAAACGG AGCTTACAAG
CTAGAAAAGT GGGACCACAA CTCTACTTTA TCTATGGTCA AAAACCCAGA ATACTATGAT
CAAGACTCAG TTAAGGCAGA AAAGTTAGCC TTCCACCTCC AAGATGACCA AAACGCAATC
TATGCATCAT ATAGGTCAGG AGACTTAGAC TTTATTAACT CAGTTCCACA AGAAGAAATC
CAAAAACTTC TAGATACCAA AGAACTAAAG ATAAAACCAT ATGTAGGGAC ATATTTCGTA
TGCTTCAATA CTGAAAAAGA ACCATTCAAC GATCCAAAAG TTAGAAAGGC CTTCTCTCTA
GCCATAGATA GAAACTTCAT CGTAAACCAA GTTACAGGTC AAGGCCAAGA GCCAGCTACA
GCTTACGTTC CATCTGGAGT ATATGATGCC AAGGGAGCTG AAGGTGATGA CTTTAGAACT
GTTGGTGGAG ATTATTACTC TATAAATGAC GAAGATTACG AAAAAAATAT CGAAGAAGCT
AAAAAGCTAA TGGAAGAAGC AGGCTATAAA GATGGCGAAG GCTTCCCACA AATCGATTAC
TTGTACAACA CCGACGAAAA CCACAAGGCT ATAGCAGAAG CCCTACAAAA CATGTGGCAA
GAAAACTTAG GCGTACAAGT TAGCCTACAA AACCAAGACT GGAATGTATT CCTCAAAGAA
AGAAAAGAAG GAAACTACAA CATAGCAAGA CACGGTTGGA TAGCAGATTA CAACGATCCA
ATGAGCTTTA TAGATATGTG GCTAACAGGC GGTGGAAATA ACGATGCCCA ATACAAGAAC
CCAGAGTTCG ACAAGTTCGT AAAAGCTGCC AAGGCTACAT CAGATCCAGA CGAAAGAATG
GAAAATATGC ACAAGGCTGA AGATATTCTT ATAGGAGAAG ATAACGTAGT TGCACCATTG
TACTTCTACA ACAACTCTTA TATGATGAAA CCAAACATCA AAGGCCTATA CTACACACCA
CTAGGATACT TCTTCTACAA AGGTGCAGAA GGATTCTAA
 
Protein sequence
MKAKSKLLIG LLSLSMIFSA CGNNDKADNG NGNNQEVVNE EEGKKDGVLD INIASEPDSI 
DPALNTSVDG AIMISHLFES LIRWDDDGEG NAVLKPGIAE SWEVSDDGLT WTFKLRDAKW
SDGKEITADD FVYSWNRLVD PATGADYEYM LDMVKGYDEK KLDISAPDPK TFVVNLNVKC
PYFEEICAFP AVMPVRKDII EANKTWTNSP ETLVSNGAYK LEKWDHNSTL SMVKNPEYYD
QDSVKAEKLA FHLQDDQNAI YASYRSGDLD FINSVPQEEI QKLLDTKELK IKPYVGTYFV
CFNTEKEPFN DPKVRKAFSL AIDRNFIVNQ VTGQGQEPAT AYVPSGVYDA KGAEGDDFRT
VGGDYYSIND EDYEKNIEEA KKLMEEAGYK DGEGFPQIDY LYNTDENHKA IAEALQNMWQ
ENLGVQVSLQ NQDWNVFLKE RKEGNYNIAR HGWIADYNDP MSFIDMWLTG GGNNDAQYKN
PEFDKFVKAA KATSDPDERM ENMHKAEDIL IGEDNVVAPL YFYNNSYMMK PNIKGLYYTP
LGYFFYKGAE GF