Gene Apre_1306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1306 
Symbol 
ID8398096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1398874 
End bp1400121 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content36% 
IMG OID644995651 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003153050 
Protein GI257066794 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000692323 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAA ATAAATCTGG TGGAGGATTT TCACATTTTA TTTCAGCCCT TTTGGGAGCT 
GTTATCGGTG GTTTTGTTAT TTATTTTCTC CTAGGTGGAG CAGCTAATAA TGATAATAAC
ATTGCAGAAA GTGAAAATAA TCCAAAACAA ATAGAAAGTA AGGAAAATAA GGAAAGTACA
AAGAAAGATA TAAATATAAA TGCGGATGAT TCAATTGAAT CTGTTGTTGT GAAAAAATCA
ATTGACTCTG TAGTTGGAAT CAATACGATT TCTGAAGTTA CCCAAAATAC CTTCTTCGGC
CAACAATCTG GTTATGTCGA AGGAATCGGT TCTGGTTCAA TCGTCACAGA AGATGGTTAT
ATAGTAACCA ATTCTCACGT TGTAAGCAAT GGCGATGTGG CCCAGATTAA TGTTCTTTTC
TCCAATGGAG AGACTAGTGA GGCAGAGCTT GTGTGGAACG ATGCAGCTTT AGATCTTGCT
ATAATTAAGG TCAAAAAGGA CAATCTTCCA GCAATTGAGC TAGGAGATAG TGACAAGGTT
GGTATAGGGG ATAAGGCTAT AGCTATCGGA AACCCTTTAG GCTTTGAACT TCAATCAACC
GTAACTAGTG GTATTATTTC AGGTCTTAAT AGAACTGTTA AGTTTAATAC AGGAGTAAGC
ATGGATGGTT TAATGCAAAC AGATGCTGCC ATAAATGCAG GAAACTCTGG TGGTGCCCTA
CTTAATACCA AAGGAGAGCT TATAGGAATT AATACTGCCA AGGCAGGTAA TAGCGATGGG
ATCGGCTTTG CTATTCCTAT TAATATTGTA AAACCTGTTA TAGAAAAAGT AAGAAAGACA
GGTAAGTTTA ATTCAGTTTA TCTTGGAATT ACTGGCCAAT CTATTGATTA TATAAAACAA
ATTCCAAACT TCAATACCGA AAAACTTGGA ACTGATTACG GTGTATATGT TGTATCAAGC
TTTGATAGAA ATAGTGATAT CGAAGAAGGT GACGTGATAA CCGCCATAGA TGGAGTGGCT
GTAAAGGATA TGAATAGTCT AAGAAAGGCC CTCCTATCCT ATGCAGTAGG AGATAAGGCG
AAACTTACTG TCTATAGGGA TGGAGTCAAG AAAGAAATTG ATGTAGAATT TAATATAGAT
TCTTCAAATA TAGACGAGTT CGATAAGGCT CAACCTAGCG AAGATAATAC AAGCGAAGAT
AATAAAGGAA GTAAAGATAG ATTAAATCCA TTCTTTAACC TTCCATAA
 
Protein sequence
MRRNKSGGGF SHFISALLGA VIGGFVIYFL LGGAANNDNN IAESENNPKQ IESKENKEST 
KKDININADD SIESVVVKKS IDSVVGINTI SEVTQNTFFG QQSGYVEGIG SGSIVTEDGY
IVTNSHVVSN GDVAQINVLF SNGETSEAEL VWNDAALDLA IIKVKKDNLP AIELGDSDKV
GIGDKAIAIG NPLGFELQST VTSGIISGLN RTVKFNTGVS MDGLMQTDAA INAGNSGGAL
LNTKGELIGI NTAKAGNSDG IGFAIPINIV KPVIEKVRKT GKFNSVYLGI TGQSIDYIKQ
IPNFNTEKLG TDYGVYVVSS FDRNSDIEEG DVITAIDGVA VKDMNSLRKA LLSYAVGDKA
KLTVYRDGVK KEIDVEFNID SSNIDEFDKA QPSEDNTSED NKGSKDRLNP FFNLP