Gene Apar_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0143 
Symbol 
ID8412989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp163045 
End bp164574 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content48% 
IMG OID645021713 
ProductABC transporter related 
Protein accessionYP_003179170 
Protein GI257783953 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGTCA GTTCGGATTA CGCTGTTCAG ATGCATGGCA TTACCAAAGT TTTTGGATCG 
TTTAAAGCTC TTGACGCCGT AGACCTTAAC GTGCGCAAGC AAACTGTCCA CGCCATTTTA
GGAGAGAACG GCGCAGGTAA AAGTACGCTC ATGAACGTAC TGTATGGCCT GTATTCTGCT
GATGAGGGCG AGGTTTACCT CAACGGAGAG CGTGTATCCA TATCTGATCC AAACGATGCT
ATCGCTCACG GTATTGGTAT GGTTCACCAG CACTTTATGT TGGTCGAGAA TTTTACTGTT
ACAGAAAATA TTGTTTTGGG TAATGAGGTC ACCAAAACCG GTGGCATTCT TGATCCAAAA
CGAGCTCGCG AGAAGGTCCT TGAAATTGTT GAGGAATACG GCTTTGACGT AGATCCTGAC
GCTAAGATTG AAGACATTTC TGTTGGTATG CAGCAGCGTG TTGAGATCTT AAAGGCCCTA
TATCGCGGTG CTGATACGCT GATTCTTGAT GAGCCTACGG CAGTGCTTAC GCCACAGGAG
ATTGAGAAGC TTATCCAGAT CATGCATGAC CTGGTAAGCA AAGGTAAAAC CATCATTGTT
ATTACTCACA AGCTTAAAGA GATTATGTCA TCTGCAGATG AATGCACTAT TATTCGCCGC
GGTAAGTACA TGAGCACTGT TGATGTCTCC AAGACATCAG AGACTGAGCT TGCAACACTT
ATGGTGGGTA GAAACGTTAA CCTGCATGTT GAAAAGAAGC CAGCAACTCC TGGTGAGGTT
GTGCTTTCTA TTAAGGATCT CCACGTCAAG GATGAGCGTG GTATTGAGCA GGTAAACGGC
TTTAATTTGG ATATTCGTGC CGGCGAGATT GTTGGTCTTG CGGGTATCGA CGGCAACGGT
CAGAAAGAAC TCGCCGATGC CATAAACGCA ATGGTCAAGC CCGAGTCGGG CACCATCACC
GTCAAAAATG AAGAGATTCA AGGTACAACT CCTAAGACGG TCATTGATCA TGCGGTTGCA
ACCATTCCTT CAGACCGTCA TCGTTGGGGC TTGGTCCTGC CATTTACGGT TGCCGAGAAC
ATGATTCTTG AGCGCCACAA TGAGGAGATT TTTGGCAAGG GCATTGCGCT TGATTTGGCA
AAGATGAAGG AATTCTCTCA GAAGTTGATT GACGAGTTTG ATATTCGCCC TGCAGAGTGC
TCCGATCATC AAGCAGCAGG ACTTTCTGGT GGTAACCAGC AGAAGGTTAT TATCGCCCGA
GAGGTCTCTT CCAACCCAGA CGTTCTTATT GCCATCCAGC CAACTCGCGG CCTTGACGTT
GGTGCAATTG AGTTTGTTCA CAAAGCGCTG ATTCGCGAGA GGGACCGTGG AGCAGCAATT
TTGCTGATTT CCTTTGAGCT GGATGAGATT ATGGACGTTG CCGATAAGAT GGCAATTATT
TACGCCGGCA AGAATGTTGG CGAGTTTGAC CAAGGTACTA TCACTGAAGA GCAGGCTGGC
CTGCTGATGG CAGGAGGTGA CGCCGAGTGA
 
Protein sequence
MEVSSDYAVQ MHGITKVFGS FKALDAVDLN VRKQTVHAIL GENGAGKSTL MNVLYGLYSA 
DEGEVYLNGE RVSISDPNDA IAHGIGMVHQ HFMLVENFTV TENIVLGNEV TKTGGILDPK
RAREKVLEIV EEYGFDVDPD AKIEDISVGM QQRVEILKAL YRGADTLILD EPTAVLTPQE
IEKLIQIMHD LVSKGKTIIV ITHKLKEIMS SADECTIIRR GKYMSTVDVS KTSETELATL
MVGRNVNLHV EKKPATPGEV VLSIKDLHVK DERGIEQVNG FNLDIRAGEI VGLAGIDGNG
QKELADAINA MVKPESGTIT VKNEEIQGTT PKTVIDHAVA TIPSDRHRWG LVLPFTVAEN
MILERHNEEI FGKGIALDLA KMKEFSQKLI DEFDIRPAEC SDHQAAGLSG GNQQKVIIAR
EVSSNPDVLI AIQPTRGLDV GAIEFVHKAL IRERDRGAAI LLISFELDEI MDVADKMAII
YAGKNVGEFD QGTITEEQAG LLMAGGDAE