Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0378 |
Symbol | |
ID | 8413227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 437076 |
End bp | 440108 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645021946 |
Product | Peptidase M16C associated domain protein |
Protein accession | YP_003179400 |
Protein GI | 257784183 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.858415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTGG AAGGGACAAG ATGGTCCAAA AACGGCATTA CTGCTCACGT AAGTTTTCGT ATAGATTGGC AGCTTATGAC ATCCAAATAT TTACAGATGG ACAGCGCGTT TTCTGTTGGC ACGCTTCATG CAGAAGACAA AAGCTTTGAG ATAATCTCGG CTGAGTGGGT TAACGAGATT TCTGGCTATG CATACATTTT CAAGCACGTT CCCACGGGCG GACGCCTTAT GTGGTTTGCC TGTGACGACG ACAATCGCTC ATTTGCTATT GCGTTTAAGA CACCTCCTGT AGATCACACG GGCGTCTTCC ACATCCTGGA GCACTCGGTT CTCTGCGGAT CCGATGCCTA TCCTGTTAAG GAGCCTTTTG TTAACCTGCT CAAAACCTCT ATGCAGACAT TCTTGAACGC GATGACCTAC CCTGACAAAA CGGTTTACCC TGTGGCTAGC ACTAACGTAG CTGACCTGGA AAACCTTATG AGTGTGTACC TAGACGCCGT CTTGCACCCT GCAATCTATA AACGCAAGCG CATCTTTGAA CAGGAAGGCT GGCACCTAGA GGCTGATGAC CAGGGAAATC TGAGTTATAA CGGTGTTGTC TTTAACGAGA TGAAAGGCGC ACTCTCTAAT CCTGATCGCG TGCTCTATGA TTCTGTCAGC GAAGCTCTCT TCCCTGACAC TGCCTACGGT AAAGAGTCCG GCGGCAAGCC TCGCGCAATC CCTAAGCTCA CCTACGAGAA CTTCCTGGAC GCCCACGCTC GCCACTATGA CCTCTCTAAC AGCTACACCT TCCTCTATGG CGATCTTGAT TGCGAGCGTG AGCTCTCTTT TATTGCCCAG AGGTTTGCAG CTGCCGAGAA ACGCGATGCA GGTGCTCCAA ATCCGCTTAA CCTACAGACA CCCGTGCTGC CCAAGCCTTG CCAGATCCAC ATGAACACCA CAGCCGACAA CTCCAGCGTT GGCCTAGGCT ACGTACTTGG AACACCAGAC CAGCGCAACA AAATGATGGC AGCAGATATC CTGTTTGATA CCCTCATGGG CTCCAATGAA TCTCCGCTCA AACGCGCAAT TCTTGACGCA GAGCTAGGTG ATGATTTTAG CTACTACCTT TCCGATGATC TTGCACAACC TATGCTCTTC TTGCAGCTCA AAGGTCTTAA AAAGGGTGCA GCTCAAAAGT TCCGTGAGCT TGTAGAATCA ACCTGCCAAA AAATTGCTAC CGAAGGCATT AATCAAAAGA AGCTCAGCGC TTCCATTGCA CTTGCAGAGT TTAACCTGCG CGAAAATGAT CAGCCTTACT CAAACGGTAT TGAATACACG CTGCGTTCGC TTTCAAGCTG GCTCTACGAC GATGCGCGCC CTCTAGACTA CATCCGCTAC GAGGACGCCA TTGCTTACGT CAAAGAACTT GCAGCTCAAA GGGGCTTTGA GAAGCTGCTG CTAGAGCTTA TCTGCAATAG CAAACATGCA GCTCAGGTCG AGCTTGTTCC CACAGACGAG GGAGATGCCC AAGAAGAAGC CACCGAGCTG GAACAGCTTC GCTCCACACT TACCGATAAA GACGTCGAGA AGATTCGTGC AGAAGTTGAG GCGCTCCGTC TGGAGCAAGA AACACCTGAC GCTCCAGAAG ATCTTGCCAA GCTTCCGTCC CTCTCGCTCA GTGATATTGG TGCAGGTAGA GAGCGCCCTG CTGGCTTTGA AGTTAAGGCT CCCCTGCCTT GCGTTGCACA CGAGCTGGAC ACTCACGGCA TTGACTACGT CTACCATTAT TTTGATTTGA CTCACGCGGT TACCTTTGAG GAACTGCCGC TTGTTGGTGT TCTTGCTGAG GTACTGGGCA AGCTTGATAC AGCTGCTCAC ACTGCATCTG AGCTGGATAT TCTTATTGAG AGCAACCTTG GTCATCTCTC GTTCTTCACA GATATCTACG ATCAAGACAC ACTTGACCAG GCATACCCTG CTTTTATTGT TGCAGCTAGT GCACTCACCG AGAAAACCGA AGAACTTGCA AGCATTCCTT CTGAGGTCTG GTCCAGCACG CGTTTTGATG ACCTAAACCG TCTAAAAAAC ATCCTGACTC AGCGTCGTAT TGCGCAAGAG CAGTACTTTG TAGGTGCTGG TCATACAGCT GCGCAAAATA AGGCTTTGAC CTCGTATTCT GCCGCTAGTC GCGTAAATGA TGCGCTGGCG GGTGTTGGCT TCTACGAGTA CCTAAAAAAC CTGCTTTCCA ACTGGAATCA GCGTGCTCCT CAGCTTGCAA AAGACTTAGA TGCACTAACC CACAAAATTT TCCGCGTAGA TAACGTTACC GTCAGCTTTA CTGGCTCCAT GCAAAGTCGA GACGCATTCT GGAAAGTGGC AGGAGATCTC AACCTCAAGA AAAGTAATGA ATCGCAAGCC GACAGCGCAA GGTCAACGCT TGTTGTCCCT GAGGGCAAAC TACAGCGCGT GGCATATATC ATTCCATCAA ATGTCTCTTA TGTTGGTCTC TCCTATCCAA ACGTTGCCCA TGCAACCAAT GAACAGCAGG GTGACTGGCT TATTGCTACC AAAGTTTTGG GCCTTGACTA CCTGTGGAAC GAGGTCCGCG TCAAGGGCGG CGCTTATGGC GTCATGTTCA GAAACTCCAT CGCTGGCTTG CAGAGCTTTG TCTCCTATCG AGATCCCTCG CTTGATGCAA CCCTTGACCG CTATGTTGGT GCGGGTAGTT GGCTCTCTAA ATGGACTCCA GACCAGGACG AGTTTGAGGG CTACGTAGTT GCCTCTGTTG CTGGCGTTGA CGCTCCTGTT CCCGCTCGTA TGCTTGCTCG CAGACAGGAC ATTGAATACT TCAACCATCG TGATCCAGAG CGTCTTCTTA AACTTCGCGA GAAGATCCTT CACGCCCAGG TTGAAGACAT CAAAGAGCTA GGAAACACTA TACCTCAAAG CCATGATGAC CTCTCGGTTG TTGTCTTTGG TGCAAAAGAC GCCATTGAAG CTTCCAAGCT TGATCTTAAG GTTGTTGACC TCTTTGGCGA TCAGGTGAAT TAA
|
Protein sequence | MALEGTRWSK NGITAHVSFR IDWQLMTSKY LQMDSAFSVG TLHAEDKSFE IISAEWVNEI SGYAYIFKHV PTGGRLMWFA CDDDNRSFAI AFKTPPVDHT GVFHILEHSV LCGSDAYPVK EPFVNLLKTS MQTFLNAMTY PDKTVYPVAS TNVADLENLM SVYLDAVLHP AIYKRKRIFE QEGWHLEADD QGNLSYNGVV FNEMKGALSN PDRVLYDSVS EALFPDTAYG KESGGKPRAI PKLTYENFLD AHARHYDLSN SYTFLYGDLD CERELSFIAQ RFAAAEKRDA GAPNPLNLQT PVLPKPCQIH MNTTADNSSV GLGYVLGTPD QRNKMMAADI LFDTLMGSNE SPLKRAILDA ELGDDFSYYL SDDLAQPMLF LQLKGLKKGA AQKFRELVES TCQKIATEGI NQKKLSASIA LAEFNLREND QPYSNGIEYT LRSLSSWLYD DARPLDYIRY EDAIAYVKEL AAQRGFEKLL LELICNSKHA AQVELVPTDE GDAQEEATEL EQLRSTLTDK DVEKIRAEVE ALRLEQETPD APEDLAKLPS LSLSDIGAGR ERPAGFEVKA PLPCVAHELD THGIDYVYHY FDLTHAVTFE ELPLVGVLAE VLGKLDTAAH TASELDILIE SNLGHLSFFT DIYDQDTLDQ AYPAFIVAAS ALTEKTEELA SIPSEVWSST RFDDLNRLKN ILTQRRIAQE QYFVGAGHTA AQNKALTSYS AASRVNDALA GVGFYEYLKN LLSNWNQRAP QLAKDLDALT HKIFRVDNVT VSFTGSMQSR DAFWKVAGDL NLKKSNESQA DSARSTLVVP EGKLQRVAYI IPSNVSYVGL SYPNVAHATN EQQGDWLIAT KVLGLDYLWN EVRVKGGAYG VMFRNSIAGL QSFVSYRDPS LDATLDRYVG AGSWLSKWTP DQDEFEGYVV ASVAGVDAPV PARMLARRQD IEYFNHRDPE RLLKLREKIL HAQVEDIKEL GNTIPQSHDD LSVVVFGAKD AIEASKLDLK VVDLFGDQVN
|
| |