Gene Apar_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1113 
Symbol 
ID8413986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1261027 
End bp1264359 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content45% 
IMG OID645022702 
Productprotein of unknown function DUF214 
Protein accessionYP_003180132 
Protein GI257784915 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCG CTCTTTTTAC CGAGATTATT CGGACTATCA AAGGGTCTCT TGCTAGATTT 
CTTGCAATTG TAGGAATCGT TGCTTTGGGC TGTGGTTTTT TTGCGGGACT TAAAATGGCT
AGCCCAGATA TGCAAGAAGC TGCACATACG TTTTACAAAA ATCAGCACCT CTACGATCTT
CGCATTATCT CAACACTTGG GTTAAGCGAG AAAGACGTCC ATGCCCTTGC TTCAGTTGAG
GGTGTTGAAG CGGTTATGCC TTCTCGCACG GTAGACGTCA TGGCTACGTT GACCTCATCG
CAATCTAGCG CTCGTGTTAG TTCGTTCAGA CCTGGTGAGC TTAACCAGCC GGTAGTTGTT
GAAGGAAGGC TTCCTCAGGG GCCTTATGAG TGCGTGATGA GTGCTGATTC CAAGAAACGT
ACAGACATTA GCCTTGGCCA TCAGATTGAG CTTCCTGAGA CTTCAAATGG CGTTCATTTA
AAGGGCGGAT CGTACACGGT TGTTGGTTTT GTAAATGCGC CAACGTATCC GTATGTAAGC
AATTTTGGTA CCACGTCATT AGGCAATGGT ATTGTTCAGC AATTTGTATA TGTCACTGAA
GATGCTTTTG CTAATGATGA TCCGTACACC GAAGTGTATC TGACGGTCCA GGGCGCCACC
AGTTATAAAA GCGGCTCATC AATGTATCAG AGTGCTATTG ATAGTGTTGC AGAGCGCATT
AAACAAATGA ATCCATCGCT TGCTTCTCTT CGTTTGCAAG AGCTAAAAGA TGATGCTCAA
GCTCAGGTTG ATGAGGCTCG TCAAAAGCTG GAGCAATCAA GGCAGGAGGC TGCAGATAAA
TTAGGTGATG CACAGAAAAA GCTTGATGAA GCGGAAGCAC AGATTTCTGC TCAGCAACAA
AGACTGACTG ATGGTCAGAA GCAATATGAC GCAGGACGTC AGCAGCTTAC AACTTTACGT
TCCAGTGCTG AGCAGAAATT TGCACAAGCG GAAGCTCAGA TTAAAGCTTC AGAGGCTCAG
ATAGCCCAGG GAACAGATGA GCTTAGCGCA GGAGAGGCTC AGTATCAGGC TGGTCTTGCT
GCTTTCAATG CAGGTCAAAC GACGTTTACG CAACAAAAGA GTGAATTTGA AGCGGGTCGC
GACGCGTATC TTACAGGGCT TGCTGCACAA GGGATTACGG CTTCAACACT TGAAGAAGCG
CAGCAGCAGC TGAGGGCTTT AGGTTTACCA ACCACGCAAG TAGATGCTCT TCTCGCTACA
CAGGCACAGA TTGTGGCTGC AGAAGCAGAA TTAGCTACTC AGCAGCAGGC TTTAGCGGCA
GCGCGAGGGG AGCTTGATCA ACGTACGGCT CAGCTGCGTG AGGCAGAGTC TCAGGTAGCT
CAGGCGCGCC AAAATCTGAC AGAAGCAAGA AATGCTACTG CAGATCAGTT ATATGCAGCT
CAAGAAAAAC TTGATGCATC GCTCAGCCAA TTGACTGCTG GACAAGCTTT GCTTCAAAAT
GCAGAATCTC AGACCTATGA AGGAAGACAG AGTCTTGAGG ACCAGCGCGT TGAGATTGAG
AAACAACTTG CTGACGGTCA GACAAAAATT GACGAAGCTC AAAAGAAAAT TGATGAGCTT
AAAGAGCCTG ATGTTTATGT ACTTGATCGT ACAAAAGAGA TTGGCGTTGC AGCTTATCAA
GCAGACTCTG AGCGTATTAA CAATATTGCT AATGTGTTCC CCTTGATGTT TTTCTTGGTT
GCAGCTTTGG TTTCACTTAC CAGTATGACG CGCATGGTAT CTGAAGAGCG TACGCTCATC
GGCGTTCATA AGGCGCTTGG TTATTCGACT TTACAAATTG CTGCAAAGTA CCTGTTATAT
GCTTTGCTTG CTTCACTTTT AGGAGCTGTT ATTGGCATAG CGCTTTTAAG TCAGGTGTTG
CCAGGGGTTA TTATCTCTGC ATATGGATCA ATTTATACTA TTCCAAATGG GGGAGCTCCT
TATCCTATTC AGCTTGACAG CGCGCTTCTT TCAGGACTTT TTGGTATAGG TGTTACGCTT
CTTTCAACGA TGGCTGCTGT TTTGTCTTCT TTGAAAGAAG AACCTTCTAG CCTTATGTTG
CCAAAGGCTC CAAAAGCGGG CTCAAAGATT TTGCTTGAGC ATATTAATCC TCTGTGGTCC
AGCTTGTCTT TTTCGTGGAA AGTTACTATT AGAAACCTTG TTCGCTATAA GCGTCGCTTG
ATTATGACAT TGGTAGGAAT TGCTGGATGT ACCGCACTTT TGTTGGTGGC CTTTGGTCTT
AGAGATTCTA TTAATGACGT CATTGACAGC CAATGGCCAA CTCTTTTTCA TTATGACTAC
ATAGTAGGAA TGACCTCTGA TGTTTCTGGT GCAGAAGCAG ATCAGATTGC TACAGAGCTT
AACCAGGTCG GTGCTACAAA TATCCATCGC ATAACCAGCG AAAATGTCCT TCTTGAGTCT
CCTGCGCAAA ATGCAAGTTC GTTGACAAGA ACAACTATTA TGACCAGTAA TTCTCTTCAA
GATCTTACAG GTGTAGTTAC CTTGCGGGAT CGTCTTTCTG GTCAAACAAT AGAGCTTAAG
GAAGATTCAG TTGTAATTAC AGAGAAGCTC GCAAAACGGC TTGGCGTAGG TGTGGGAGAT
GCCGTTCGGG TCTATGCTCA AGACAGAATT GGAAATGCTT CAGGTGAACC AAGTACCCTT
ACTGTTACGG GAGTTACCGA GAATTACGTT GGATCGTATC TTTATGTAGG GCCTTCTGCA
TGGCATTCTT TGAGCATCCA AGATCAAGCA ACGGATGGTT GGTATGCAAC ACTTCCTAAA
GATCAAGCAA CAAGAGATGC GTTTGGAGAA AAACTCATTA ACCGGGCAGG GGTTGCAACT
GTCGATGACA TCAATGAGGC TATTCGCACA TATAAAAAAT CACTTGAAGT GGTTAATCGT
GTTGTAGCTA TTCTTATTTT GGCTGCAGCC CTGTTGGCAT TTATCGTCTT GTATAACCTC
ACAAATATCA ATATTGAGGA GCGTATTCGA GAGATTGCCA GCCTTAAAGT GCTTGGTTTT
ACGCGACACG AAGTTGATGC GTACGTGTTT AGAGAGATTG CTTTACTGGC CGTCTTTGGA
GCACTCTTTG GGCTTGTTCT AGGAACTTAC CTTGAGGGAT TTGTGGTTCA AACAGCTGAG
ATTGATCTTG TTATGTTTGG CCGCTCTATT CATATGTCAA GTTATTGGTT TGCTTTTGGA
CTTACCCTTG TCTTCTCACT GTTAGTGTAT GTTGCTATGC GATCAAAGCT AAAAAATATT
GATATGGTGG AGAGTCTTAA GAGTGTTGAG TAG
 
Protein sequence
MPRALFTEII RTIKGSLARF LAIVGIVALG CGFFAGLKMA SPDMQEAAHT FYKNQHLYDL 
RIISTLGLSE KDVHALASVE GVEAVMPSRT VDVMATLTSS QSSARVSSFR PGELNQPVVV
EGRLPQGPYE CVMSADSKKR TDISLGHQIE LPETSNGVHL KGGSYTVVGF VNAPTYPYVS
NFGTTSLGNG IVQQFVYVTE DAFANDDPYT EVYLTVQGAT SYKSGSSMYQ SAIDSVAERI
KQMNPSLASL RLQELKDDAQ AQVDEARQKL EQSRQEAADK LGDAQKKLDE AEAQISAQQQ
RLTDGQKQYD AGRQQLTTLR SSAEQKFAQA EAQIKASEAQ IAQGTDELSA GEAQYQAGLA
AFNAGQTTFT QQKSEFEAGR DAYLTGLAAQ GITASTLEEA QQQLRALGLP TTQVDALLAT
QAQIVAAEAE LATQQQALAA ARGELDQRTA QLREAESQVA QARQNLTEAR NATADQLYAA
QEKLDASLSQ LTAGQALLQN AESQTYEGRQ SLEDQRVEIE KQLADGQTKI DEAQKKIDEL
KEPDVYVLDR TKEIGVAAYQ ADSERINNIA NVFPLMFFLV AALVSLTSMT RMVSEERTLI
GVHKALGYST LQIAAKYLLY ALLASLLGAV IGIALLSQVL PGVIISAYGS IYTIPNGGAP
YPIQLDSALL SGLFGIGVTL LSTMAAVLSS LKEEPSSLML PKAPKAGSKI LLEHINPLWS
SLSFSWKVTI RNLVRYKRRL IMTLVGIAGC TALLLVAFGL RDSINDVIDS QWPTLFHYDY
IVGMTSDVSG AEADQIATEL NQVGATNIHR ITSENVLLES PAQNASSLTR TTIMTSNSLQ
DLTGVVTLRD RLSGQTIELK EDSVVITEKL AKRLGVGVGD AVRVYAQDRI GNASGEPSTL
TVTGVTENYV GSYLYVGPSA WHSLSIQDQA TDGWYATLPK DQATRDAFGE KLINRAGVAT
VDDINEAIRT YKKSLEVVNR VVAILILAAA LLAFIVLYNL TNINIEERIR EIASLKVLGF
TRHEVDAYVF REIALLAVFG ALFGLVLGTY LEGFVVQTAE IDLVMFGRSI HMSSYWFAFG
LTLVFSLLVY VAMRSKLKNI DMVESLKSVE