Gene Apre_0945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0945 
Symbol 
ID8397731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1008113 
End bp1009333 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content33% 
IMG OID644995292 
Productpeptidase T 
Protein accessionYP_003152694 
Protein GI257066438 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA TAACGAAAAG ATTTCTTAAG TATATATCTT TTGATACAAA AAGTGACCCA 
GAGTTAGGAA AAACTCAAAA ACCATCAACT CCAGGTCAAC TTCTACTTGC GAAAGAACTT
AAAAAAGAAC TAGAGGAATT GGGACTAGAA GCTAGTATAA ATAAAGAAGG ATTTGTATTT
GCAAAATTAG CTTCAAATAC AGACAAAGAA ATACCAATTG TTGGGTTTTT ATCTCATATG
GATACATCTC CTGAAATGTA TGGTAAGATT GATGATCCTC AAATAATTAA TTACGAAGGC
GGAGACATTA AGCTAAATGA CGAGAGATCT ATTAAGGTTA GTGAGTTTCC AATTCTTGAT
AAACTTAAGG GCTTAACGCT TATTACAACT AGAGGAGAAA GCTTATTAGG CGCTGATGAT
AAAAGCGGTC TAGCTTCAAT AATGAATGCT GTTGAGTACC TCGTAAACAA TCCTGATATA
GAGCATGGAG ATGTTATGAT AGCCTTTACT CCAGATGAGG AGATTGGTAC TGGATGCGAT
ACTTTCGATG TAGAATCTTT TGGAGCAGAC TTTGCTTATA CAGTTGATGG TGGCTATCTT
GGAGAATTAG AATATGAGTC TTTTAATGCT GCAAGTGGTC TTGTAAATAT AAAAGGTAAA
TCAATCCACC CTGGTTCTGC AAAAAATACT ATGGTTAACT CTATGAGCCT TGCTCGTGAA
TTTGATAGCT TACTAGGTGA TGTAAGAAGA CCTGAACATA CTGAAGGTTA CGAAGGATTC
TTCCATTTAT TAAGTATTAA TGGAGATATT GAAAATACAA AAATGGAATA TATAATTAGG
GAACATGATA GGGAAAAATT TGAAGCTATG AAAAAAGAAT TCTCCGATAA TGCTTCTTAT
TTAAACAAAA AATACGGAGA TTATATTAGT GTAGATATTT CAGATTCTTA TTATAATATG
GGTGAAGTAA TCGAGAAAAA TATGAAGATT GTTTATTATG CCAAAACAGC CATGGAAAAT
CTTGGGATAA AACCTATCAT TGAGCCAATC AGAGGGGGAA CAGATGGGTC TAAACTTTCA
TTCATGAATC TACCTACCCC AAATATATTT ACTGGTGGAA TGAACTATCA TGGAGTCTAT
GAAATTATAC CAATAGAGCA TATGAAAAAG GCTAGCGAAA CGGTTATAGA AATAATTAAG
CTAATTGCAA ACGATAATTA A
 
Protein sequence
MDNITKRFLK YISFDTKSDP ELGKTQKPST PGQLLLAKEL KKELEELGLE ASINKEGFVF 
AKLASNTDKE IPIVGFLSHM DTSPEMYGKI DDPQIINYEG GDIKLNDERS IKVSEFPILD
KLKGLTLITT RGESLLGADD KSGLASIMNA VEYLVNNPDI EHGDVMIAFT PDEEIGTGCD
TFDVESFGAD FAYTVDGGYL GELEYESFNA ASGLVNIKGK SIHPGSAKNT MVNSMSLARE
FDSLLGDVRR PEHTEGYEGF FHLLSINGDI ENTKMEYIIR EHDREKFEAM KKEFSDNASY
LNKKYGDYIS VDISDSYYNM GEVIEKNMKI VYYAKTAMEN LGIKPIIEPI RGGTDGSKLS
FMNLPTPNIF TGGMNYHGVY EIIPIEHMKK ASETVIEIIK LIANDN