Gene Apre_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0387 
Symbol 
ID8397161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp438242 
End bp439753 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content39% 
IMG OID644994745 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003152157 
Protein GI257065901 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000219856 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA GAAACATATT CGTAAGCCTT GCCCTAGCAG GTCTACTGCT AACTTCTTGC 
GGCAATCAGG ATAAGAAGGA AAATACCAAA ACTAGCGATG CGCCACTTAA GGTAGCCATG
TATACTGAAA TCGATTCGCT CGATCCTTTC AATGCTACAG CAGGAGATAC CAAGACAATC
ATGGATCAAG TATTTGATGG ACTCTTCGAT GTGGATGAAG ATGGAAATCT AGTTCCTGAC
CTTTGTGAAT CTTACGAGAT AAGTGAAGAT GGCCTTACTT ATGATTTCAA ATTAAAAGAA
GGGGTCAAAT TCCACAATGA TAAGGACTTT ACTGCAGATG ATGTCTACTA CACTTATGAT
ATCCTAGCAG GACTAACAAG TGGAGAGCCA AAGTCTTCTA AGTTTGCCCA AATAGAATCA
ATGGAAGTAG CCTCTCCTAC AGAAATTAAA ATCAAATTAA AGGAAAAATC TAATTCCTTT
ATCTATCTAA ATACTCAACC AATAGTTCAA AAAGACTATG AAGACAATCA GACAAAGCCA
ATAGGAACTG GTCCTTTCGA GTTTGTTTCC TACACACCAG GTGAGGGCAT GAAGCTAAAA
AGATTTGACG ACTATCACAG AAAAGACCAT ATTGCGAAGT TTGCTGACGT AGAGATTCTA
AGAATTGCCG ACAGACAAAC CCTAATCATG GCCCTAAACA ACAAGGACGT TGACCTTGCT
ACAGGACTTA CTAATGATGA GCTAAGCCAA ATAGAAGAAA CTTGCGATAT CCATTCCTTC
CCACAAAACT TAGTTCAAGT CCTAGGTCTT AATAACGATG TTAAGCCTTT CGATGATATG
AAGGTAAGAC AAGCTATTGC TTACGCAATC GATAAGGATG AAATCATAAA CACAGCGGCA
GGTGGCAGGG CAAGTAAGCT AGTATCAAAC TTCTCCCCAG CCCTTAAAGA ATATTACAAT
GATATGGAAG AAAAATACCC TTACAATCCG GAAAAGGCCA AGGAACTTTT AAAAGAAGCA
GGCCTTGAGG ACGGATTTTC TGTAAAACTA ACTGTGCCAA GTGATTACAA ATATCATATG
GACACAGCAG AGCTAATCCA AGCCCAACTA GGTAAGGTAG GAATTGATGT AACACTTGAT
CCAATCGAGT TTTCTACTTG GCTAAGCAAG GTATACAAGG ATAAGGACTA TGAGGCTACA
GTTTCAGGCT TTGTAGGTTA TGTTGATCCA ATCAGAGTAA TCGATAGATA TGTATCAACT
AATGACAAAA ACTTCATCAA CTACAAGTCA GAAGGTTATG ACGAGGCTAT AAAGGCAGCC
CAAAGTGCAG ATAATAAGGA AGATATAATC CAAAATGTAA AAGATGCCCA AGAATTTATG
GCAGAAGATG CAGGATCAGT CTTCCTAACA GACCCTGACA ACAACCAAGC CCTAAACAAG
GACCTCACAG GACTTAAGTC CTATCCAGTA CAAAAGATAA ATCTAGAGGA TATAGAAAAG
AAAAATGACT AA
 
Protein sequence
MKKRNIFVSL ALAGLLLTSC GNQDKKENTK TSDAPLKVAM YTEIDSLDPF NATAGDTKTI 
MDQVFDGLFD VDEDGNLVPD LCESYEISED GLTYDFKLKE GVKFHNDKDF TADDVYYTYD
ILAGLTSGEP KSSKFAQIES MEVASPTEIK IKLKEKSNSF IYLNTQPIVQ KDYEDNQTKP
IGTGPFEFVS YTPGEGMKLK RFDDYHRKDH IAKFADVEIL RIADRQTLIM ALNNKDVDLA
TGLTNDELSQ IEETCDIHSF PQNLVQVLGL NNDVKPFDDM KVRQAIAYAI DKDEIINTAA
GGRASKLVSN FSPALKEYYN DMEEKYPYNP EKAKELLKEA GLEDGFSVKL TVPSDYKYHM
DTAELIQAQL GKVGIDVTLD PIEFSTWLSK VYKDKDYEAT VSGFVGYVDP IRVIDRYVST
NDKNFINYKS EGYDEAIKAA QSADNKEDII QNVKDAQEFM AEDAGSVFLT DPDNNQALNK
DLTGLKSYPV QKINLEDIEK KND