Gene Apar_0712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0712 
Symbol 
ID8413573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp790209 
End bp791579 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content44% 
IMG OID645022290 
Productpeptidase M50 
Protein accessionYP_003179732 
Protein GI257784515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.41926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000525349 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATTTGT TAGCCATTCT TTCACCAATT TTTTGGGGAT TGCTTGTTCT CTCGCTTCTT 
GTATTTGTTC ACGAGGCTGG TCATTATGGC ATGGCTCGTC TGTGTGGCGT TCGTGTTACT
GAGTTTTTCC TGGGCATGCC GTTCAGGTAT AAGCTTTCTC ACAAGAGCAA GAAATACGGA
ACCGAGGTTG GAGTAACACC GCTTCTTTTT GGCGGCTATA CTCGTATCTG CGGCATGGAA
GGTGAGCTTG ACGAGCTATT GCCGCAGGCT CTGATGAGCG TTCAAGAGGC AGGTATGCTT
GAGGTTGGCG CTCTTGCTGC AAAACTAAAC TGCGAGGATG AACGTGCATT TAGCCTGCTG
TACACTCTTG CAGATTGGGG CTCTATTGAG CTTGTTAAAG AGTCAGAATC AGAGGAGCTG
GCTCACACTG TGTTCCAGAC CCTTGCACGA GATGCTGAAC TTCGCTGTGA GTATGACCGT
GGTCATAATT TTGAGTTGGA AGGCTCAACT GCAGCGGGAG AGCCTCGTGT TCCGCAAATG
TCTACAGAGG AATTTTTTGC TCAAGAGAAG GCTCATACCT ATGTCGGCGT AAATGTCCCC
AAACGCCTGC TTATGATCTT GGGTGGACCC TTGGTCAACA TTGCGCTTGC TTTTTTATTA
GTTGTTGGTT CTTTGATGTT TGTTGGTGTT CCTACCGCTC AAAATAAGGC ACAGTTGGGG
TCTGTTGAGT CAAACTCTTT AGCTGCTATT TCAGGTCTTA ATCCGGGAGA TACCATTCTT
ACCTTTAACG GAGTAGAGGT ACATACCTGG GAAGAACTCA CTGTAGCTAT TAAAGAAGCA
ATGTCTGCTG ATGGAAAGGA TATTCCTGTC ACCTATGATC GTGGTGGCAT TCAGCTTGAA
ACTACAATCA AACCTGTTTT GCGTCCTGAC GATAAAATCA TTGGCGTGTC ACCTGTGATG
ATCACGTATC ACTTTTCGTT TATTGATGCA TCTGCTGCAG CTGTATCGTA TGCCGCACAG
GTTGGTCAAT TTGCTCTAAG ATTGCTCATT CCCACACAGA CTATGGAAGT TCTCAATCAG
TCATCTTCTG TTGTAGGAAT TTCTGTTATG GCTTCAAAAG CTGCGGCAGA GGGTTTCTCT
ACCCTTATTA TGCTAGTAGC AGCTATTTCT ATGTCTCTTG GGTTTATGAA TCTTTTGCCC
ATTCCACCTC TTGATGGTGG AAAGATTCTG ATTGAGGTTA TTCAGATTAT TGTTAGAAAG
CCACTGTCTA TCAAAGTTCA AAATATTTTG TCTTATATTG GACTGGCATT TTTCCTGTTT
GTATTTGTTG TTGCACTTCG AAACGATATT CTTCATTTAC TTTTTAGGTA G
 
Protein sequence
MNLLAILSPI FWGLLVLSLL VFVHEAGHYG MARLCGVRVT EFFLGMPFRY KLSHKSKKYG 
TEVGVTPLLF GGYTRICGME GELDELLPQA LMSVQEAGML EVGALAAKLN CEDERAFSLL
YTLADWGSIE LVKESESEEL AHTVFQTLAR DAELRCEYDR GHNFELEGST AAGEPRVPQM
STEEFFAQEK AHTYVGVNVP KRLLMILGGP LVNIALAFLL VVGSLMFVGV PTAQNKAQLG
SVESNSLAAI SGLNPGDTIL TFNGVEVHTW EELTVAIKEA MSADGKDIPV TYDRGGIQLE
TTIKPVLRPD DKIIGVSPVM ITYHFSFIDA SAAAVSYAAQ VGQFALRLLI PTQTMEVLNQ
SSSVVGISVM ASKAAAEGFS TLIMLVAAIS MSLGFMNLLP IPPLDGGKIL IEVIQIIVRK
PLSIKVQNIL SYIGLAFFLF VFVVALRNDI LHLLFR