Gene Apar_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1042 
Symbol 
ID8413915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1177785 
End bp1179584 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content43% 
IMG OID645022631 
ProductABC transporter related 
Protein accessionYP_003180061 
Protein GI257784844 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGC GTCAAAACCA AAATGTTCAA GCTACTGAGT CCAACCAAAA CACAAAACAG 
GATTCTCAAC AATCCGAAAT AAATCCTGTT ACTGTGCTGC TTGAGTGGTC AGATAGAAAA
GGTATGTATG CGCTGTCTGT CATTCTTGCC ATTATTGGTG TTTTTGGTGG TATGGTTCCC
TATGTTGCTG CAGGCAACAT GGTAGTTGGC ATCTTTAATG GAATTCAAGA ATGGCCGTTT
TACCTTCATT GGGGACTTAT AGCGGGCTTC AGCTATTTAG TTAAAATAAT TTTCCATCAT
CTTTCAACTA TCGTTTCTCA TAAGGCAACG TTTGCAACAA TTGCAAACAT GCGCACTCGA
GTAGCAAAGA AGCTGACGAG CATCCCAATG GGATATGTTC TTTCTACTCC TTCTGGTTCT
CTAAAACGCC TTTTAGTCGA GAAAATCGAT AGCATCGAAA CTACGCTTGC TCATGCAGTT
CCCGAGCTTA CTTCAAACCT TACTGCTGCA TTTGCAGTTT TTGTTTATTT GATTATTTTG
GATTGGCGTC TTGCCCTTGC TGCATTACTC ACCATTGTCG TAGGACTTGC ATGTTTATCA
GGCATGATGA AAGACTATGA GTACTGGTAT CAGAATACTA TTGTTACTGG TAGAGAGATG
AATAATGCCT CAGTTGAGTA TGTGTTAGGG ATTGAGGTAA TTAAAGCTTT TGGACAGTCG
GCAAGTTCTT ACGAGAAGTT CTCTAAAGCG GTTTACGCCT CTGCTCATGC TTTTATTGAT
TGGATGAGTC ACTGTCAAAT TTGGCAAGAT GCCATGCTAT CAATTGCCCC AGCAACATTA
GTAACGGTAT TGCCCTTAGG ATGCTTATTT GTACTTCAGG GAACTCTTTC TCCTTCTACC
TTTGTCATGT GTGCAGTGCT TTCTTTAGGC ATTTTTCAGC CGCTCTTTGA AGCAATGAGC
TTTATGGATT CTCTTGCACA GGTTAGCTCT GTTGTTAAGG AAATTTCTGA GGTTTTGAAT
TACCCAGATC AGATTAGAGC TGAGGTTCCT TGCACACTTA CGGGAACCGA AATTAAACTG
AGCGATGTTC GTTTCTCGTA TGGAAGCAAT GAGGTAATTC ATGGGATTAC CTTAGATATT
AGCCCAGGAC AAGTTACGGC GCTTGTTGGT CCAAGTGGAT CTGGTAAATC AACTGTTGCA
CGTCTTATTG CTGGTTTCTG GGATCCAAAT GAAGGTTCTG TATCTATTGG TGGACAAGAT
ATTCGCTCCT GTACTCCATC TCAACTTGCA GATCTTGTAG CTTACGTTGA TCAAGATAGT
TACCTTTTTG ATGAGACCAT TATGGAAAAC ATCAGAATGG GTAAAAAAGA CGCCACCGAT
GATGAGGTAA TTGCTTGTGC AAAAGCTTCT GGTTGTCATA ACTTTATTCA ATCGCTTCCT
CATGGCTATC AAACTGTCAT AGGAGGCTCA GGTGGGCATC TTTCTGGTGG AGAGCGTCAG
CGCGTGGCAA TTGCTCGAGC TATGCTCAAA GATGCGCCTA TCATCATGCT TGATGAAGCA
ACGGCCTATG CAGATCCAGA ATCGGAAGCA GAGGTTGAAT CTGCAGTTGC AAAGTTGGTA
GCTGGTAAAA CGCTGGTTGT TATCGCGCAC CGTCTATCAA CCGTACAAAA TGCCGACAAA
ATTGTTGTCG TTAATGATGG CTCTATTGAA GCTGAAGGTA CACACGAACA GCTTATGGAG
ACATGCCCGC TCTATGCAAC TATGTATCGC GCTCATATCG GCGCACTTGA TGAGGCTTAA
 
Protein sequence
MTKRQNQNVQ ATESNQNTKQ DSQQSEINPV TVLLEWSDRK GMYALSVILA IIGVFGGMVP 
YVAAGNMVVG IFNGIQEWPF YLHWGLIAGF SYLVKIIFHH LSTIVSHKAT FATIANMRTR
VAKKLTSIPM GYVLSTPSGS LKRLLVEKID SIETTLAHAV PELTSNLTAA FAVFVYLIIL
DWRLALAALL TIVVGLACLS GMMKDYEYWY QNTIVTGREM NNASVEYVLG IEVIKAFGQS
ASSYEKFSKA VYASAHAFID WMSHCQIWQD AMLSIAPATL VTVLPLGCLF VLQGTLSPST
FVMCAVLSLG IFQPLFEAMS FMDSLAQVSS VVKEISEVLN YPDQIRAEVP CTLTGTEIKL
SDVRFSYGSN EVIHGITLDI SPGQVTALVG PSGSGKSTVA RLIAGFWDPN EGSVSIGGQD
IRSCTPSQLA DLVAYVDQDS YLFDETIMEN IRMGKKDATD DEVIACAKAS GCHNFIQSLP
HGYQTVIGGS GGHLSGGERQ RVAIARAMLK DAPIIMLDEA TAYADPESEA EVESAVAKLV
AGKTLVVIAH RLSTVQNADK IVVVNDGSIE AEGTHEQLME TCPLYATMYR AHIGALDEA