Gene Apar_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0887 
Symbol 
ID8413753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp989169 
End bp991436 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content44% 
IMG OID645022470 
Producthypothetical protein 
Protein accessionYP_003179907 
Protein GI257784690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.069318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGTC AGGAAAATAA TCCAGAAGCT AAACAACCTG GAAGAAACAG GCTTGGAAAA 
CTCTATCCGT CTCTCGTCAA TGAAGACGTT CTGCCTGATG AGTCTCTAGA CGCTGCTCAA
AATAATCAAC ATGTTGGTTC TATTGACCAA ACTGGTTGGC ATACAAAACT AAATATTGAA
GCGCTTTCTC TTACTCTTAC GGAATCCACT TCTCCACTTG AAGTACTTAA ACATTTTGTA
AATACAGCAT TTTTGCGTGA AAAAGATTCA CACCGAAAAG GTGGTTCTGT TGCCATTCCG
CCAAGTGATT TTGAACTTCA TCTAGCTCAA AGACTAAAAG ATGCTGGTGT CCTTGAAGTG
TGTGGCATTA CCGTCCCCTG CCGTGTTGTG CGTCTGCGCA CCAATGGTCT TTTTTATCTG
CGTGCTAATC AAAAAGCACT TTCCTACAAG CAGAAAATTC AGCTTCTTAC TATTGAAAGC
GCATTAAACA CTGCCCTTTT TGCAGAGCGA TATTTTCTTG ATGCAAATAT GGCATCTGAA
CATGACTTAC TTGCCTTTGA ACAAAAGATT GTTTCTACCA TGATGAAACA ATCTAGGTCG
CGCTGCGCCA CAACCCAAAG CACTCAAGAA GATGGCGAGT GGGTTGTCCG CAAAGCCATT
TCCCTACACA TTGAGAGCCT GCGTCTATCT CAGCGTCTTA TCACTGAGTT TAGAACCAAC
GTACACAAGG GCATTGCTGC CTTTAGAGTA CATCTAGCGC TTCCTGAACA ATTTCCAAGA
AGCTTTCTTT ATGCAGATGG AAGCATCAAA GAAGCCAACT TTGATGACCT TTGTCGTGCA
GCCACTGCTT ACAACATGCA CCTTGGCATG CTCATTACCG CAAGTGCCTT TAGAAGCTCC
CTGAAGATCA ATGAGGTCTG GCTTTCTGGC ATTTGCGACA CAGACAAACT GCATGCATGT
CTTTTCTCGT TCCACATTAC ACGCAAGCAA TTTGAAGATA CGAACATTAC TGAAGAGACT
GATTCACTTT CTATCTACAA GCTATGGAAC GCACGAATTG ACGAGGTAGA CGGGATTCTT
CATGCAGTAC AACCTCTCTT TACCTTAAAC AATGAGATTT TCTGTCCACC AAGTAGATAC
GATTTTGTTG AGTCTTCACA AAAACGACTT GATAGCGTCG CAGCTCACGC ACTAGGCACT
GATGAGGTGT CAGGACTGTC TATTGATGAA GGTCAAGCGC GAGAAGAGAT TATCACCAAA
ATTCTGCGTA ACCTCTCTTC TTCCACAGAA GAAAACGTCC GCACCATATT GTCGTATACC
GCAAACTCCA CCGATCTTAC GGTGCGAGCA GCAGGAGAAC GCGTGGTTTC TCGCTTTATT
AAGGGTACCC TCGCAGAAGA TGATTTTGAG GCCATTTACG AAGAATTTGT TGATGGTGGT
GTTCTTTCCT TAGCGGTTAC CAAAGCTCAG CGTCTCATTG GCGAGAAAAA CTATCAAGAC
GCAGAGCTAC TCCTGCTTGA AGCACTTGAC TCCGTTGACG GAAATAATAC ATACACGGAT
ACAAGCACTA CGCAGTGGCG CGTGTTTATG AACTACGTCG ATCGCGTACT CTATAACAGA
ATTCTCTCTA ACCCAGAAAA AAATACTCGC CTGGCTCCTA TGTCCTACTT TGAAGCGCAC
CTTCTAGTTT CTATTGCACA AGCGCTCCAA GGACAAATTC AGAATGCTCT TCGGCACGCA
AGAAGAGCTG TTGAGATTGC GCCACTAAGC ATGAGTGCTA GGCTACACCT CTCCCACTGC
CTTGAAGAGT CACTGGATAT TAATGCTGCA GCAGATGAGC TAAAGAGGCT TCTGTTGCTT
GCACATGACC CTGAGAGCAT TGGATTTGGC TATTATCGTA TGGCATTCTT CCAATGGAGG
CTCAGACACC TGAAAGCCGC ACAAGCTTGT TATCAGTTTG CCTTGAAGTT TTTGCCAGGT
GCTACGTTCA TCATTGGCAC AGAAACAGCA ATTCTCTCGC AAACTGAGCC TGAGTTTAGT
TGTGACGATA TGAGCGACGA AGAAATCTAT TCCACACTGA TGCAAGAGCA AATTCCTATC
GCGCCAACCA ACGAAGTCTC TACTATTTTC TTAGATGGTA CACGTGCCTC ACTTGATGCA
GAACTCTTTA GTGTTGCTAA GAACTTCATT ATGAACCTGG GCATTATGAC TAAAGACGAT
ATCTACTTTG ACATGCTCAA CTCCATTGAA GGAGAGCCTG ATAGATAA
 
Protein sequence
MTRQENNPEA KQPGRNRLGK LYPSLVNEDV LPDESLDAAQ NNQHVGSIDQ TGWHTKLNIE 
ALSLTLTEST SPLEVLKHFV NTAFLREKDS HRKGGSVAIP PSDFELHLAQ RLKDAGVLEV
CGITVPCRVV RLRTNGLFYL RANQKALSYK QKIQLLTIES ALNTALFAER YFLDANMASE
HDLLAFEQKI VSTMMKQSRS RCATTQSTQE DGEWVVRKAI SLHIESLRLS QRLITEFRTN
VHKGIAAFRV HLALPEQFPR SFLYADGSIK EANFDDLCRA ATAYNMHLGM LITASAFRSS
LKINEVWLSG ICDTDKLHAC LFSFHITRKQ FEDTNITEET DSLSIYKLWN ARIDEVDGIL
HAVQPLFTLN NEIFCPPSRY DFVESSQKRL DSVAAHALGT DEVSGLSIDE GQAREEIITK
ILRNLSSSTE ENVRTILSYT ANSTDLTVRA AGERVVSRFI KGTLAEDDFE AIYEEFVDGG
VLSLAVTKAQ RLIGEKNYQD AELLLLEALD SVDGNNTYTD TSTTQWRVFM NYVDRVLYNR
ILSNPEKNTR LAPMSYFEAH LLVSIAQALQ GQIQNALRHA RRAVEIAPLS MSARLHLSHC
LEESLDINAA ADELKRLLLL AHDPESIGFG YYRMAFFQWR LRHLKAAQAC YQFALKFLPG
ATFIIGTETA ILSQTEPEFS CDDMSDEEIY STLMQEQIPI APTNEVSTIF LDGTRASLDA
ELFSVAKNFI MNLGIMTKDD IYFDMLNSIE GEPDR