Gene Apre_1337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1337 
Symbol 
ID8398144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1438719 
End bp1440929 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content38% 
IMG OID644995699 
Productpeptidase U32 
Protein accessionYP_003153081 
Protein GI257066825 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAT CTGAAATCCT AGCTCCAGTT GGTAATGAAG AGATGCTTTA TGCAGGCCTT 
GCCGCTGGGG CTTCTAGCTT TTATATGGCA GTGGACGACT TTGGAGCCCG TGCCTATGCC
AAAAACTTTG ATATAGAAAA TGTGGGAAGC TTTATCGACC TCATCCACCT ATTTGGAAAG
AAGGTATTTG TGACTATGAA TATCTTAATC AAGGATGAGG AGATGGAAAA GGCGGTATAC
TACGCCAAAA AGCTCTACGA ATGCGGGGCA GATGCCCTCA TAATCCAAGA TCTGGGCCTA
TTTACCATCC TAAAAGACCA AGTGCCTGGA ATGGATCTAC ATGCTTCAAC CCAAATGGCC
GTAAGAGACT ATTACGGAGC GAAAAGCCTC ATGGATATGG GTTTTGATAG GGTAGTTATT
GCTAGAGAGA CCCCAATCGA GGAGCTTAGA AAGATTGCTA CCCTTCCTGT AGAAAAGGAA
GTCTTCGTCC ACGGGTCCTT GTGCGTATCT TACTCTGGAG AATGTTTGAT GAGCTCATAC
TTTGGAGCTC GTTCTGCCAA TAGGGGGAGA TGTGCTGGGA TTTGTAGGCA AAAATACTCC
CTTATAGCAG ATGGGAAAAC TTTGGCTGAT GATTATTTCC TTAATATGAG AGACCTGAAT
GTCATAGATC AAATCGATCA GTTAGTAGAC CTCGGCATAG ATTGCTTCAA GATCGAAGGC
AGGATGAAAA GTCCAGAATA TGTCTATGCT AGTGTAAAAA GCTATAAGGA TAAGATAGAT
AAAAATTATT ACGACAAAAA TGATTTAAGA GATATATCAA ACAGAGGCTA CACCAAGGGC
TTTATCTTTG GACAAAAGAG TGATTATGTA AGACTTTCAT CTGATGCCAA GCATAGGTCT
GTGGGGAAAG TCATCAAAGA AGGAAATAAA AAGTACTTCA TTAATTCATC CGAACTTCTT
CTAGGAGATA ATCTAGAGAT AATCACAGAT AAGGGCAAGA AATTACCCTA TACCTTGACC
GAAAATCTAA AGAAAGAGTC TAAGATTTAT CTCGACCAAT ATCCAGACGC CAAGGAAGGC
TCTGACGTAT ATATCTTAAA TTCAAAAAAG ATAGGACTTA ACTTAGAAAA GGCTCTAGGT
GAATATAAAA ATCTTCCGAT AAGAATTGAC TTTAGGGCCA AGGTAGGCGA GCCTGCAGAG
ATTACCATGA CCTATGAGGA TAAGTCAATT AGTCTAAGTA CAGATGACAA TCTAGAAAGG
GCTAAGAAGA TTTCTCTGAC AGAAGAAGAC TTAAGGGAAA ACTTAAGTAA GTTCGGAGAC
GATATCTATA AGGCAAGGCA AATTAATATA GTCATGGATC CAGATGTCTT CATTAGAAAA
AAGGACATCA ACAGATTAAG AAGAGAAGGA TCTGCTAAGC TTAAAGAAGA AATCCTCAAA
TCCTTTAGGA GAGATGAGAT AGATATAGAA ATCCCTGAAG TTGGTAAAAA TAAAAACCAC
AAGAAGGAAG TAAATGCAGA GCTTAAAAAT ACTAATATAA TCCCAAGTCT CCTAAAAGAC
TTCGATAATA TTTACCTAGA AGAATATGAC GAGAAATATG CTGGCCTTAG TCTTTATCTA
ATCCTTAATT CCCACACAGA CTACGATATA GATGAGCTTA TAGCCTTCAT TAAGGAAAAA
TCTATCAAGG GTGTAGTCTT CAATAACTAT AGAGATCTTG CCTTTGTTGA TAAATTCAGG
GAAAATAATA TAAAAATTAG GATAGGCAGA TACCTAAATG TCTTCAATAA ATTTACCTAC
GATTTTTACG ATAAGTTCGC TGAGATGACT TCATCATCGG TAGAAGCGAC ATTCGATTCT
ATAAATCAAA ACGGCAGAGA CTTTGATGTA GAAGCCCTAG CCTTCGGAAG AATTGAGCTT
ATGAATATGG TCCATTGTCC ATTTTCTACA ATCAAGAAGT GTGGCCTAAA GGGGTGTGCT
ACTTGTAAGT TTAGAAATGG AGAAATGGTA AATGAAAATG GCGATAGGCT TAAGATAATA
AGAAGAGAAG GGATAAGTAG GATTTATCCA TCTGAGGCAT CTAAGATAGA TAGGAGAAAC
TTTTCTACTG ATATATCCCT ACTAGTCCAG GTCTTTTCGG ATGAAGATAT AAGAGATTAC
CAAGACAGAA GAGAAACTAA CAATCTGAAT TACGATAGAG GTGTTATTTA A
 
Protein sequence
MAKSEILAPV GNEEMLYAGL AAGASSFYMA VDDFGARAYA KNFDIENVGS FIDLIHLFGK 
KVFVTMNILI KDEEMEKAVY YAKKLYECGA DALIIQDLGL FTILKDQVPG MDLHASTQMA
VRDYYGAKSL MDMGFDRVVI ARETPIEELR KIATLPVEKE VFVHGSLCVS YSGECLMSSY
FGARSANRGR CAGICRQKYS LIADGKTLAD DYFLNMRDLN VIDQIDQLVD LGIDCFKIEG
RMKSPEYVYA SVKSYKDKID KNYYDKNDLR DISNRGYTKG FIFGQKSDYV RLSSDAKHRS
VGKVIKEGNK KYFINSSELL LGDNLEIITD KGKKLPYTLT ENLKKESKIY LDQYPDAKEG
SDVYILNSKK IGLNLEKALG EYKNLPIRID FRAKVGEPAE ITMTYEDKSI SLSTDDNLER
AKKISLTEED LRENLSKFGD DIYKARQINI VMDPDVFIRK KDINRLRREG SAKLKEEILK
SFRRDEIDIE IPEVGKNKNH KKEVNAELKN TNIIPSLLKD FDNIYLEEYD EKYAGLSLYL
ILNSHTDYDI DELIAFIKEK SIKGVVFNNY RDLAFVDKFR ENNIKIRIGR YLNVFNKFTY
DFYDKFAEMT SSSVEATFDS INQNGRDFDV EALAFGRIEL MNMVHCPFST IKKCGLKGCA
TCKFRNGEMV NENGDRLKII RREGISRIYP SEASKIDRRN FSTDISLLVQ VFSDEDIRDY
QDRRETNNLN YDRGVI