Gene Apar_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1041 
Symbol 
ID8413914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1176037 
End bp1177770 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content42% 
IMG OID645022630 
ProductABC transporter related 
Protein accessionYP_003180060 
Protein GI257784843 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATCTG TCTTAAAAAT GTACTTTCAG TTTGGAAGCT CTTATCGCCA GAAACTCTAC 
AAAGGTCTTT TCTTTACCAT TCTTGGTTGC CTTTTTGAAG GCGTACAAAT AACGGCGCTT
TGGATCGTTT TTACCGCTCT TACTACTAAC ACACTCTCAA CCCAGACGAT ATTTTCTGCA
CTGGGAGTAA TGCTTTTAAG CATTATGGGG ACGTTCGTTT GCGCTCACTT TAAGAGTGAG
AATTTCTGTG ATGCTAATTT CAGTATGGCT GGTGCAAAGC GTGCAGAAAT TGGAGATACA
TTACGTCGCC TTCCTATGGG TTATTTTAAC GAGAATAGCC TTGGAGAAGT AACAGCTGTT
ATGACTAATC AGCTTGACGT AATGCAAAAT CTTGGTGGTC TGCTTTATAT GATGGTTGTT
GGAGGTTTGG CTTTAACAGC AATTATTGTT GCATTCTTGT TCGTGTTCTG TTGGCAATTA
GGTCTTATAA CTGCAGCTAC TTTTGTTTTC TTCTGTATAA CAATGGAGCT GCTTCAGGCT
TACGTACGTA ATACATCAGA TGATTATGTG GCTGCCAACA CTACCTTGAT CAGCTCCGTA
CTTGAGTATG TCCGTGGCAT TAATGTGGTA AGGTCATTTT CTCTTATTGA CGATGCTGAG
GGAAAATATG CCAAAGCTGT TGATGATTGT CGTGTTCAAG CACTCAAACT TGAATTTAAG
GCACTTCGTT TCTCAGTTCT GCAAATGGTT GTTTCTAAGG CTACCAGCGT TATTATGTGC
TTGGTATCAG TTGAATTATG GCTATCGGGA ACACTTGATA CAGCTTCTTG TTTGACGGTA
GTAGTTATGT CATTTATGTT GTTTAGTCGC TTAGAATCAG CTGGTCGTTT CTCAACTATT
TTACGCAACC TTGAAATTGC AATGGAGCAA ACAAATGCTA TTCTCGCTAC TCCTGCAATG
GAGGAGGGAG AAGGTCTTGA GGAAGCGGCA TCCTGTGACA TAGAGCTTTC TCATGTGTCG
TTTGGCTACG ACGATAGGCA AATTCTCGAA GATGTAAGCC TTTCTATTCC TGCGGGTACT
TCTTGCGCAA TTGTAGGACC AAGTGGATCT GGCAAAACAA CGCTTGTCCG TCTTATTGAG
CGTTTTTGGG ATGTGAATAC AGGACAAGTG TCACTTGGTG GACACGATGT ACGCGACTAT
AAGGTTGATG CTCTTCTTCA AAATTTCTCT ACCGTTTTTC AGGGAGTATT TCTTTTTGAT
GACACTATTG AGAACAACAT TAAGTTTGGT AATCCAAGTG CAACTCATGA GCAAGTAGTC
GATGCAGCTA GGCGCGCCTG TTGTGAGGAG TTTATCCAGG CATTACCTAA TGGATATGAA
ACACGCTTAG GTGAAGGTGG TTCAATGCTT TCTGGTGGCG AGCGCCAGCG TCTTTCTATT
GCGCGTGCCA TCTTAAAAGA TGCACCGATT GTTGTACTTG ATGAGGCTAC GGCTAATGTT
GATCCGGAAA ATGAACTTGA GTTGCAGCAT GCAATTGCAG AGCTCACAAA GTCAAAGACT
GTCATTATGA TTGCTCATCG CTTAAAGACA GTTCGCAATG CAGATCAGAT ACTTGTGTTA
GATAAAGGTC GTATTGTGCA AAGAGGCACT CACGAATCTC TCATGGCAGA GGGCGGTATT
TACGCTGATT TTGTTAACAT GCGCGAGAAA ACCGTTGGCT GGAAAATTGC ATAG
 
Protein sequence
MLSVLKMYFQ FGSSYRQKLY KGLFFTILGC LFEGVQITAL WIVFTALTTN TLSTQTIFSA 
LGVMLLSIMG TFVCAHFKSE NFCDANFSMA GAKRAEIGDT LRRLPMGYFN ENSLGEVTAV
MTNQLDVMQN LGGLLYMMVV GGLALTAIIV AFLFVFCWQL GLITAATFVF FCITMELLQA
YVRNTSDDYV AANTTLISSV LEYVRGINVV RSFSLIDDAE GKYAKAVDDC RVQALKLEFK
ALRFSVLQMV VSKATSVIMC LVSVELWLSG TLDTASCLTV VVMSFMLFSR LESAGRFSTI
LRNLEIAMEQ TNAILATPAM EEGEGLEEAA SCDIELSHVS FGYDDRQILE DVSLSIPAGT
SCAIVGPSGS GKTTLVRLIE RFWDVNTGQV SLGGHDVRDY KVDALLQNFS TVFQGVFLFD
DTIENNIKFG NPSATHEQVV DAARRACCEE FIQALPNGYE TRLGEGGSML SGGERQRLSI
ARAILKDAPI VVLDEATANV DPENELELQH AIAELTKSKT VIMIAHRLKT VRNADQILVL
DKGRIVQRGT HESLMAEGGI YADFVNMREK TVGWKIA