Gene Apar_1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1262 
Symbol 
ID8414141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1415174 
End bp1416601 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content42% 
IMG OID645022854 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_003180278 
Protein GI257785061 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TAGCTAAGAG TATTTTTAGT AAGATTTGCT GCCTCTTTTC AGCAGCTCTT 
CTAATTTACT CACTGGTCCC GGTTCAGGCT TTTGCTAATC CAACAAGCGG CACTGATGTT
GCGGATAGCA AGGTTACTAT TAATGGTCTT GAGTCTGGCG ATGTTGTAAG TGCATATTTA
ATTGCTGATG CAGACATTGA TGCTGCAAAT AACTTAACGT ATAAGATGGC TGACGGTCTG
CCAAGTGCTT ACAATACAAT TGATAAGATT GCAGCTGTTG CTACTGACGG TTATACCTTT
ACTCAGGGTA CAGATATGCA AAATGCTGCC GCAGCAATTG CTGGTGCCGT TACAGCAAAT
CCAGCTGCTG CTACCGCAAC GGCTGGCAGT GATGGCTCCG CTAAGCTTAC TCTTGGAAGT
GGCTACTATC TCGTTCGTGT TACCACCGCA AGTGGCAAGA CTCGCGTTTA TCAGAATATG
GTAGTTGACG TAACTCCTAA GGTTGATGGC GGAACATATA AGTCTCGCGA TGTTGCTCCT
ATTGATGTTA AGAAGACTGA CGTAACTGTT AAGAAGACTG TTGGTTCTGA GTATAAAGAG
TCAACCGATA AGTACAGCGT TGGTGATAGC GTCCCATTCA AGATTAATAC TGCTGTTCCT
AATTATCCAA AAGATTCTAA GAACGCAACT TTTACTATTG GTGATACACC TTCTGCTGGT
TTAAAGATTA AGACTGACAC CATTAAAATT AATGGTCAGA AAGCTGTATC AGGTGCAGAC
TACACTTTGA CAGCTTCTGA AACTGGCTAC ACCATTGAGT ATTCTAAAGA CTATGTTCTT
GCTCATCCTG GTGAGGCAAT CGAGGTTACT TATGAGGCAG AGCTTACTTC CGATGCTTTC
TCTCATAGCG CTACTGATGT AACTGGAAAC ACTGCTACTG TTACTTTCAA CCCAAATCCA
TATGAGAATA AGACTGTTAC TCCTAACAGC AACACTACAG TAAAGACATA TGGTTACGTC
TTTAAGAAGA CTGATCCAGA GGGCAATCCT CTTCAGGGTG CTACATTTAC CCTTACGCTT
GATAACGGTA AAGTACTCAC TTCTACTTCT GATGCAAACG GTTATGTTTA CTTCTCTGGT
CTTGCAGCAG GTCACTATAA GATTTCAGAG ACTGGCGTTC CTTCTGGATA CACCAAAGTC
AACGACATTG AGTTTGATTT AAGCGATACT ACTGCTACTG CTGATAACCC AGCTACAACT
GATGTCGAGA ACAATTATCT GGTTAATTCA CAGAATGTTG TCGATAATAG GCAGCCAGTT
CTTCCAGTAA CCGGTGACGC TGGAACCTTT ATGTTCACTG CTATTGGAAC CGTTCTGCTT
GTTGCTGGTG TAGGCGCTAT TGTGTATTCT AGAAAGCAAC GCGCATAA
 
Protein sequence
MKSIAKSIFS KICCLFSAAL LIYSLVPVQA FANPTSGTDV ADSKVTINGL ESGDVVSAYL 
IADADIDAAN NLTYKMADGL PSAYNTIDKI AAVATDGYTF TQGTDMQNAA AAIAGAVTAN
PAAATATAGS DGSAKLTLGS GYYLVRVTTA SGKTRVYQNM VVDVTPKVDG GTYKSRDVAP
IDVKKTDVTV KKTVGSEYKE STDKYSVGDS VPFKINTAVP NYPKDSKNAT FTIGDTPSAG
LKIKTDTIKI NGQKAVSGAD YTLTASETGY TIEYSKDYVL AHPGEAIEVT YEAELTSDAF
SHSATDVTGN TATVTFNPNP YENKTVTPNS NTTVKTYGYV FKKTDPEGNP LQGATFTLTL
DNGKVLTSTS DANGYVYFSG LAAGHYKISE TGVPSGYTKV NDIEFDLSDT TATADNPATT
DVENNYLVNS QNVVDNRQPV LPVTGDAGTF MFTAIGTVLL VAGVGAIVYS RKQRA