Gene Apar_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0188 
Symbol 
ID8413036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp221587 
End bp223113 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content48% 
IMG OID645021760 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003179215 
Protein GI257783998 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0846582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC GACTGTTAAA GCTTTTTAGT TTTGTCCTCG TTCTTGCTCT ATCGCTTCCT 
CTTGCAGTGA AATTAGCTTT TGCTGAGCCC ACAAGCGTTG AAGGCGCAAC GGGCGCCCCT
AAGGCGGTTG TTGCAACCAT TACAGACTTT AGAATTGAAG ACCGTTTTGG TAACCCAACC
ACTGTTGTCA ACAAGGCAAA TCTTTATGCC ATTGCAATGA ACTGGGACGC CTCTGCTAAT
GCTCATGTTG AGCCTGGTGA CTACTTTGAC GTCACCCTTC CTGACACTAT GAGGTTTACT
GCGGCTCATC CTGCATCTAC CTTCAACATC ACTGATGCAG CAACTGGTGA GGTTATGGCA
GTTGCTCACG TCAGCCCAGG TGCTGATGGT TTTGGCGGAA CCATGCGTGT TGTCTTTACC
GACTACGTTA ACAATCACAC TGACCTGCGT GGTAGTGTCC GTCTTGGCTT TACCATTAAT
TCAGAACGTA TTCAAACAGG TACCGGCAAG ACCTTTAACT TTACTGTTTC TGGCAACATT
GTTCCTGTAA CGTTTGACGT CACCGCTCTT GGCGTTATTG ATACCGAGTA TCTCTATAAG
TGGGGAAAGG TTATCGAAGG CAACGCAAAC GAGATTCAGT GGACTGGTCG TATCAACTTC
TCCCGCGGTA ACTTCAACAA TGTCCACCTT CACGATCAGC TCCTTGACTG GGGTGGCGGA
GACCTTCCTG CAGAGATTAC CTACGTTCCA GGCTCTTTTG AACTTTGGAC TGGTGTCTTT
GATGAGTACG GTTCAACCAT TCCAGGTACC GCTCACCGTG TTCCAATTAC CGACGACATG
ATTACCATTT CTCCTGATGG TAAGTCGTTT GATCTTGATC TTTCCGGCGT AGACTTCAGT
AATGGTCAGA GCTATAAGTT TATGTATCGA ACCACCTACG TTCCTGGCGT TGCCCTGCGT
AACCTCATCA AGATGTATTC CGATAAACCT GAGTACAGCT CTGACTGGGT CTGGCGTAAT
GCAACTTCCA GTGGAGAAGG TACTTCTACT ATCGTCGCTC GCATCCGCGT CATTAAGGTT
GATAAGGATG ACCACAACAC CAAGCTTGCG GGCGCTGTTT TTAAGGTAAC CAGCGTTGCT
GATCCAACCA AGACCTGGAC TATCACCACA GGTGAGGACG GCACTGCTAC TACCGAAAAG
CTTCCTGCAG GCACTTACAC TGTTCAAGAG ATTACCGCTC CTAACGGTTA CGAACTTAGT
ACTGATACCT ACAACTTAAC CGTCTCTGCA ACAAGCGGCG TTATCAAGAC TGTTGAGAAC
GAGAAAACCC CAACCGTTCC ACCAACCACC CCGCCGAACA CTCCACCTGC AGCTCCTCCA
TCAGAGACTC CAAAGAAACC TAAAAAGAAG CAATCTAAGC TTCCAGAGAC TGGCGAAGTT
TCCGAGATTG CTCTTGTTGC AGTAGCTACT GTTGGTAGCG TTGCTTCAAC TCTTGGCTAT
GCTCTAAAGA ATCGTCACCG TAAGTAA
 
Protein sequence
MNKRLLKLFS FVLVLALSLP LAVKLAFAEP TSVEGATGAP KAVVATITDF RIEDRFGNPT 
TVVNKANLYA IAMNWDASAN AHVEPGDYFD VTLPDTMRFT AAHPASTFNI TDAATGEVMA
VAHVSPGADG FGGTMRVVFT DYVNNHTDLR GSVRLGFTIN SERIQTGTGK TFNFTVSGNI
VPVTFDVTAL GVIDTEYLYK WGKVIEGNAN EIQWTGRINF SRGNFNNVHL HDQLLDWGGG
DLPAEITYVP GSFELWTGVF DEYGSTIPGT AHRVPITDDM ITISPDGKSF DLDLSGVDFS
NGQSYKFMYR TTYVPGVALR NLIKMYSDKP EYSSDWVWRN ATSSGEGTST IVARIRVIKV
DKDDHNTKLA GAVFKVTSVA DPTKTWTITT GEDGTATTEK LPAGTYTVQE ITAPNGYELS
TDTYNLTVSA TSGVIKTVEN EKTPTVPPTT PPNTPPAAPP SETPKKPKKK QSKLPETGEV
SEIALVAVAT VGSVASTLGY ALKNRHRK