Gene Apre_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1226 
Symbol 
ID8398015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1309274 
End bp1310353 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content40% 
IMG OID644995571 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003152971 
Protein GI257066715 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000135939 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGG TATATAAATT TTTCCTAGCC TGTATTGTGG GAATTTTCCT CCTCCTAGGC 
CTAAAGTCCA TTCTTCTTGG AGGCAAGGAA ATAGCAGAGG CAAATACCTT TTACTTATAC
AATTGGGGAG ATTATATAGA TCCTGAGCTT TTGGATAAGT TTGAGGAAGA GACTGGTTTT
AATGTAGTTA TGGAGACTTT CGATTCAAAT GAGGCCATGA TTACCAAGAT TAAACAAAAA
TCAACCGACT TTGATATCTG TATTCCCAGT GAATATGCAG TGGAGATGAT GAGAGACCAG
GGGCTACTAG AAAAGCTTGA CCATTCGAAA ATCGAGGGCC TTTCTAATAT CGACGAGAGA
TTCCTAGATA GGGAATATGA TCCGGGAAAT GAATATTCGA TTCCTTATCT TTGGGGGACC
TTCGGTATCT TGTATAATAC TAAAAAATAC CAGGCTAGCG ACTTTGATTC CTGGAAGAAC
TTGTGGGATC CTAAGTTTGA GGGAGAAATC CTAAGCTTTG ATGGAGCTCG TGAGACAATG
GGAATAGGAC TTCTCGCAAA TAACTTAAGT CTAAATACAG AAGATCCAAA AAAGCTTACA
GAAATAAGGA ATGAGCTTAT AGGTCTTATG GGCAATGTCA AGGCCATCCT TGCCGATGAG
ATAAGGATGT ATATGGCCCT AGAAGAGGCC AATGTCGGCC TGACATTTTC GGGAGATGCC
TCAAGTGCCA TAGAATCTAA CGAAGATCTT TCCTATGTCA TACCAAAGGA GGGTTCTAAC
ATTTGGTTTG ATACCATGGT TATCCCAAAA ACTAGCAAGA ACAAAAAGGC TGCCTATGCC
TTTATCAACT TCATGTTAGA GCCAGAAAAT GCTGCCCAAA ATGCAGACTA TATTTGGTAT
GCGACTCCAA ACAAGAAGGC CATGGATTTA ATAGATTCTG AGGCTCGAAA TGACAAGACC
CTTTATCCAG ATGATGAAAT TATAAATAAA CTAGAAGTCT TCAAGGCCCT AGATAAGGAA
AGCACTATTT TATATAATGA CCTTTTCCTA GACCTTAAGA TTTCACCACA AGCGGAGTGA
 
Protein sequence
MNKVYKFFLA CIVGIFLLLG LKSILLGGKE IAEANTFYLY NWGDYIDPEL LDKFEEETGF 
NVVMETFDSN EAMITKIKQK STDFDICIPS EYAVEMMRDQ GLLEKLDHSK IEGLSNIDER
FLDREYDPGN EYSIPYLWGT FGILYNTKKY QASDFDSWKN LWDPKFEGEI LSFDGARETM
GIGLLANNLS LNTEDPKKLT EIRNELIGLM GNVKAILADE IRMYMALEEA NVGLTFSGDA
SSAIESNEDL SYVIPKEGSN IWFDTMVIPK TSKNKKAAYA FINFMLEPEN AAQNADYIWY
ATPNKKAMDL IDSEARNDKT LYPDDEIINK LEVFKALDKE STILYNDLFL DLKISPQAE