Gene Apar_0378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0378 
Symbol 
ID8413227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp437076 
End bp440108 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content50% 
IMG OID645021946 
ProductPeptidase M16C associated domain protein 
Protein accessionYP_003179400 
Protein GI257784183 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.858415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGG AAGGGACAAG ATGGTCCAAA AACGGCATTA CTGCTCACGT AAGTTTTCGT 
ATAGATTGGC AGCTTATGAC ATCCAAATAT TTACAGATGG ACAGCGCGTT TTCTGTTGGC
ACGCTTCATG CAGAAGACAA AAGCTTTGAG ATAATCTCGG CTGAGTGGGT TAACGAGATT
TCTGGCTATG CATACATTTT CAAGCACGTT CCCACGGGCG GACGCCTTAT GTGGTTTGCC
TGTGACGACG ACAATCGCTC ATTTGCTATT GCGTTTAAGA CACCTCCTGT AGATCACACG
GGCGTCTTCC ACATCCTGGA GCACTCGGTT CTCTGCGGAT CCGATGCCTA TCCTGTTAAG
GAGCCTTTTG TTAACCTGCT CAAAACCTCT ATGCAGACAT TCTTGAACGC GATGACCTAC
CCTGACAAAA CGGTTTACCC TGTGGCTAGC ACTAACGTAG CTGACCTGGA AAACCTTATG
AGTGTGTACC TAGACGCCGT CTTGCACCCT GCAATCTATA AACGCAAGCG CATCTTTGAA
CAGGAAGGCT GGCACCTAGA GGCTGATGAC CAGGGAAATC TGAGTTATAA CGGTGTTGTC
TTTAACGAGA TGAAAGGCGC ACTCTCTAAT CCTGATCGCG TGCTCTATGA TTCTGTCAGC
GAAGCTCTCT TCCCTGACAC TGCCTACGGT AAAGAGTCCG GCGGCAAGCC TCGCGCAATC
CCTAAGCTCA CCTACGAGAA CTTCCTGGAC GCCCACGCTC GCCACTATGA CCTCTCTAAC
AGCTACACCT TCCTCTATGG CGATCTTGAT TGCGAGCGTG AGCTCTCTTT TATTGCCCAG
AGGTTTGCAG CTGCCGAGAA ACGCGATGCA GGTGCTCCAA ATCCGCTTAA CCTACAGACA
CCCGTGCTGC CCAAGCCTTG CCAGATCCAC ATGAACACCA CAGCCGACAA CTCCAGCGTT
GGCCTAGGCT ACGTACTTGG AACACCAGAC CAGCGCAACA AAATGATGGC AGCAGATATC
CTGTTTGATA CCCTCATGGG CTCCAATGAA TCTCCGCTCA AACGCGCAAT TCTTGACGCA
GAGCTAGGTG ATGATTTTAG CTACTACCTT TCCGATGATC TTGCACAACC TATGCTCTTC
TTGCAGCTCA AAGGTCTTAA AAAGGGTGCA GCTCAAAAGT TCCGTGAGCT TGTAGAATCA
ACCTGCCAAA AAATTGCTAC CGAAGGCATT AATCAAAAGA AGCTCAGCGC TTCCATTGCA
CTTGCAGAGT TTAACCTGCG CGAAAATGAT CAGCCTTACT CAAACGGTAT TGAATACACG
CTGCGTTCGC TTTCAAGCTG GCTCTACGAC GATGCGCGCC CTCTAGACTA CATCCGCTAC
GAGGACGCCA TTGCTTACGT CAAAGAACTT GCAGCTCAAA GGGGCTTTGA GAAGCTGCTG
CTAGAGCTTA TCTGCAATAG CAAACATGCA GCTCAGGTCG AGCTTGTTCC CACAGACGAG
GGAGATGCCC AAGAAGAAGC CACCGAGCTG GAACAGCTTC GCTCCACACT TACCGATAAA
GACGTCGAGA AGATTCGTGC AGAAGTTGAG GCGCTCCGTC TGGAGCAAGA AACACCTGAC
GCTCCAGAAG ATCTTGCCAA GCTTCCGTCC CTCTCGCTCA GTGATATTGG TGCAGGTAGA
GAGCGCCCTG CTGGCTTTGA AGTTAAGGCT CCCCTGCCTT GCGTTGCACA CGAGCTGGAC
ACTCACGGCA TTGACTACGT CTACCATTAT TTTGATTTGA CTCACGCGGT TACCTTTGAG
GAACTGCCGC TTGTTGGTGT TCTTGCTGAG GTACTGGGCA AGCTTGATAC AGCTGCTCAC
ACTGCATCTG AGCTGGATAT TCTTATTGAG AGCAACCTTG GTCATCTCTC GTTCTTCACA
GATATCTACG ATCAAGACAC ACTTGACCAG GCATACCCTG CTTTTATTGT TGCAGCTAGT
GCACTCACCG AGAAAACCGA AGAACTTGCA AGCATTCCTT CTGAGGTCTG GTCCAGCACG
CGTTTTGATG ACCTAAACCG TCTAAAAAAC ATCCTGACTC AGCGTCGTAT TGCGCAAGAG
CAGTACTTTG TAGGTGCTGG TCATACAGCT GCGCAAAATA AGGCTTTGAC CTCGTATTCT
GCCGCTAGTC GCGTAAATGA TGCGCTGGCG GGTGTTGGCT TCTACGAGTA CCTAAAAAAC
CTGCTTTCCA ACTGGAATCA GCGTGCTCCT CAGCTTGCAA AAGACTTAGA TGCACTAACC
CACAAAATTT TCCGCGTAGA TAACGTTACC GTCAGCTTTA CTGGCTCCAT GCAAAGTCGA
GACGCATTCT GGAAAGTGGC AGGAGATCTC AACCTCAAGA AAAGTAATGA ATCGCAAGCC
GACAGCGCAA GGTCAACGCT TGTTGTCCCT GAGGGCAAAC TACAGCGCGT GGCATATATC
ATTCCATCAA ATGTCTCTTA TGTTGGTCTC TCCTATCCAA ACGTTGCCCA TGCAACCAAT
GAACAGCAGG GTGACTGGCT TATTGCTACC AAAGTTTTGG GCCTTGACTA CCTGTGGAAC
GAGGTCCGCG TCAAGGGCGG CGCTTATGGC GTCATGTTCA GAAACTCCAT CGCTGGCTTG
CAGAGCTTTG TCTCCTATCG AGATCCCTCG CTTGATGCAA CCCTTGACCG CTATGTTGGT
GCGGGTAGTT GGCTCTCTAA ATGGACTCCA GACCAGGACG AGTTTGAGGG CTACGTAGTT
GCCTCTGTTG CTGGCGTTGA CGCTCCTGTT CCCGCTCGTA TGCTTGCTCG CAGACAGGAC
ATTGAATACT TCAACCATCG TGATCCAGAG CGTCTTCTTA AACTTCGCGA GAAGATCCTT
CACGCCCAGG TTGAAGACAT CAAAGAGCTA GGAAACACTA TACCTCAAAG CCATGATGAC
CTCTCGGTTG TTGTCTTTGG TGCAAAAGAC GCCATTGAAG CTTCCAAGCT TGATCTTAAG
GTTGTTGACC TCTTTGGCGA TCAGGTGAAT TAA
 
Protein sequence
MALEGTRWSK NGITAHVSFR IDWQLMTSKY LQMDSAFSVG TLHAEDKSFE IISAEWVNEI 
SGYAYIFKHV PTGGRLMWFA CDDDNRSFAI AFKTPPVDHT GVFHILEHSV LCGSDAYPVK
EPFVNLLKTS MQTFLNAMTY PDKTVYPVAS TNVADLENLM SVYLDAVLHP AIYKRKRIFE
QEGWHLEADD QGNLSYNGVV FNEMKGALSN PDRVLYDSVS EALFPDTAYG KESGGKPRAI
PKLTYENFLD AHARHYDLSN SYTFLYGDLD CERELSFIAQ RFAAAEKRDA GAPNPLNLQT
PVLPKPCQIH MNTTADNSSV GLGYVLGTPD QRNKMMAADI LFDTLMGSNE SPLKRAILDA
ELGDDFSYYL SDDLAQPMLF LQLKGLKKGA AQKFRELVES TCQKIATEGI NQKKLSASIA
LAEFNLREND QPYSNGIEYT LRSLSSWLYD DARPLDYIRY EDAIAYVKEL AAQRGFEKLL
LELICNSKHA AQVELVPTDE GDAQEEATEL EQLRSTLTDK DVEKIRAEVE ALRLEQETPD
APEDLAKLPS LSLSDIGAGR ERPAGFEVKA PLPCVAHELD THGIDYVYHY FDLTHAVTFE
ELPLVGVLAE VLGKLDTAAH TASELDILIE SNLGHLSFFT DIYDQDTLDQ AYPAFIVAAS
ALTEKTEELA SIPSEVWSST RFDDLNRLKN ILTQRRIAQE QYFVGAGHTA AQNKALTSYS
AASRVNDALA GVGFYEYLKN LLSNWNQRAP QLAKDLDALT HKIFRVDNVT VSFTGSMQSR
DAFWKVAGDL NLKKSNESQA DSARSTLVVP EGKLQRVAYI IPSNVSYVGL SYPNVAHATN
EQQGDWLIAT KVLGLDYLWN EVRVKGGAYG VMFRNSIAGL QSFVSYRDPS LDATLDRYVG
AGSWLSKWTP DQDEFEGYVV ASVAGVDAPV PARMLARRQD IEYFNHRDPE RLLKLREKIL
HAQVEDIKEL GNTIPQSHDD LSVVVFGAKD AIEASKLDLK VVDLFGDQVN