Gene Apar_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1194 
Symbol 
ID8414072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1338122 
End bp1341037 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content51% 
IMG OID645022788 
Productmagnesium-translocating P-type ATPase 
Protein accessionYP_003180213 
Protein GI257784996 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01524] magnesium-translocating P-type ATPase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.30528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCA TGAAGCGCAC AACCCAAACA AAAACAACTC AGACCACCTC AGGAGAGCTT 
GGACAGTCCC TGACTTTTGC CGCAACTCAC TCGGTAAAAA GTGTCTGTAG AAAATTCCAT
ACCACTCCCG ACGGCCTTGC CATTGACATT GCGCAAGCAA AATACGACCT CGATGGCCCA
AATGTTATCA CCGGTGCCGC TGAAGAGCCC TTCATCATCC GCTTGCTCAA GAGCTTTGCA
AGTCCCTTCA CCTTCATTTT GATTGCTCTC GCAGGCATTT CTTACATCAC TAACGTGGTC
CTTGCCACAG ATGGCGAGAA GGACCCTTCA ACCGTCATTA TCATTACGTC AATGGTGTTT
ATTTCCGGCA TTATCGACTT TGTCCAGAGC TCCAAGGGAG CCTCGGCAGC GGCGGCGCTC
TCCAAAATGG TGACCTCCAC TACAAGGGTT ATCCGTCGTC CCATCGACTT TGATGACCTG
GCCACTGACC AGGATTCTAG CCAAAATCTC AACCAGAACT CAGACGAAAG TACAGATGAA
CTCAAGGACT TTGAAGATGA ATACGCTACA GATGAAACCG CGGGCGAATC CGAGGACGCC
GCCGACGAAG ACGAAGCTTC AGAGCCTAAA CTTGCTTCCG CGCTGGGAGA AGAAATACCC
TTTGAACAGG TAGTTATTGG TGACATCATC CGCCTTGCTT CGGGCGACAT GATTCCTGCA
GATTGTCGCG TTCTGGACGC CAAGGACCTC TTTGTCAACG AAACAGCGCT CACCGGAGAA
TCAGAGCCTG TCGAGAAAAC CGCTGGTGTA GTCCACGCTC GCCGTCGCGC AGATGGAACA
CGCTATCCTC TCTCGCTCTC GGAGTGCACA AACCTGCTGT TCGCCGGCAC TACCGTTCAG
TCAGGAAGTG CAACGGTGGT TGTTGTTGCA ACTGGCAACA AGACTTATGT GGGCACAATG
TCCGAGATGC TCCAGCAGCC TTCTGGCGAG ACCAGCTTTG ATGAGGGTCT CAAGTCCGTT
TCCAAGGTAC TTGTTTCCTT CATGTTGATC ATGTGCCCCA TCGTGTTCTT TGCAAACGGA
TTCCTTAAGG GCGACTGGTT TGATGCACTG CTCTTCAGCG TTTCCGTTGC CGTTGGTATT
ACTCCTCAGA TGCTGCCCGT TATTGTAACC ACTTGCCTTT CTCGCGGTGG AACACAAATG
GCCAAGCAGG ACGTTATTGT TAAGAATCCA GCCGCAATCC AGAACCTGGG CGCCATGGAC
ATCCTGTGTA CCGACAAAAC TGGCACCATC ACCGCAGACG AGGTTGTTCT CGAACGTCAC
CTCAACATTC TGGGTGAAGA GGACGCTCGC GTGCTCCGCC ATGCGTATCT GAACAGCTAC
TTCCAAACCG GCCTCAGAAA CCTTATTGAC AAGGCAATCA TCAAGACTTC AAACGATGAA
CTGCCCACCA ACTTGCTGAG CATTGAATAC GAGAAGATCG ACGAAGTCCC GTTTGACTTT
GAGCGTCGCC GCATGAGCGT TGTGGTAAGA AACACAAAAA CTAACAAGAC GCAGATGATC
ACTAAGGGCG CCGTTGAAGA AGTCCTCAAC GCATGCTCCT TTGTTGACCT TGACTCTGAG
ATTAAGCCAC TCACTCCCGC CCAGCGCAAG AGCGTTATGG ACCGCGTCTA CCAGCTCAAT
CAGGAGGGTA TGCGCGTGGT TGGCGTTGCT CAGAAGAGCG ACCCTCGCGG CGTCGGCGAG
TTTGGCGTAG ACGATGAGCG CGACATGGTG CTCATTGGCT ACCTGGCCTT CTTGGATCCG
CCAAAAGAGA GCGCTCGCGA GGCAATTGCC AAGCTTAACC AGAGAGGCGT GCAGGTCAAG
GTGCTCACCG GCGACAACGA GGGCGTTGCT GCCGCCGTCT GCAAAAAAGT GGGCATCCAC
GTTGACGAAC TCCTGCTTGG TAGCGACGTA GAGAACCTCA ACGATGAGCA GCTCAAAGAG
CGTGTTGAAA AGACCCAGCT CTTTGCAAAG CTTTCCCCCA TGCAAAAGGC CCGTGTTGTC
TCTGCGCTCA GATCCAACAA CCATGTTGTC GGCTTTATGG GCGACGGCAT TAACGATGCT
GCTGCTATGC GCTCTTCCGA TGTAGGAATT TCCGTAGATA CCGCTGTTGA CGTGGCAAAA
GAATCTGCTG ACATTATCTT ACTGCAAAAG GATCTGCTGG TACTTGAGCA CGGCGTTGAA
GAAGGTCGCA GAACCTACGG CAATACCATC AAGTACATCA AGGCAACCGC AAGCTCCAAC
TTTGGTAACG TACTGTCCGT TTTGGTGGCC AGTTTCTTCC TTCCGTTCTT GCCTATGAGC
GCCCTTCAGC TGCTCCTTTT AGGACTCGTA TACACCGTTA CCTGCATTGC CATTCCTTGG
GACAACGTGG ATGATTCATT CCTTTCAAGT CCGCGCTCTT GGGACGCCCA TTCAATTACA
AACTTTATGC TTTGGATTGG ACCTATCAGT TCAATTTTTG ATGTACTTAC CTTTGCCCTC
ATGTTCTTTA TGGTCTCTCC CACTCTTGCC GGAGGAACCT GGGCAGAACT TACCGCAGCT
GGAAACACCG CAGCTCAGAC GCTCTTTATC CTGTCTTTCC AAACAGGCTG GTTCATCGAA
TCTATGTGGA CCCAAACCTT TGTCCTCCAC GCACTTAGAA CCAATAAAAT CCCGTTTGTT
CAGAGCATGC CATCAGCTTC CTTACTTACT CTAACCACCG CGGGAATCGT TGTAGTCAGC
GCTCTTCCGT ATCTGCCGGT GTTCGCCCAG CCCTTGAGCC TTGTTGCTCT ACCTTTGTCG
TTCTTTGGAT GGCTAATTGC ACTCATGAGC GGGTATATGG TTCTTATCAC CATAATCAAG
AGCCTCTACG TTAAGCGATT TGGCTCTCTG CTCTAA
 
Protein sequence
MTIMKRTTQT KTTQTTSGEL GQSLTFAATH SVKSVCRKFH TTPDGLAIDI AQAKYDLDGP 
NVITGAAEEP FIIRLLKSFA SPFTFILIAL AGISYITNVV LATDGEKDPS TVIIITSMVF
ISGIIDFVQS SKGASAAAAL SKMVTSTTRV IRRPIDFDDL ATDQDSSQNL NQNSDESTDE
LKDFEDEYAT DETAGESEDA ADEDEASEPK LASALGEEIP FEQVVIGDII RLASGDMIPA
DCRVLDAKDL FVNETALTGE SEPVEKTAGV VHARRRADGT RYPLSLSECT NLLFAGTTVQ
SGSATVVVVA TGNKTYVGTM SEMLQQPSGE TSFDEGLKSV SKVLVSFMLI MCPIVFFANG
FLKGDWFDAL LFSVSVAVGI TPQMLPVIVT TCLSRGGTQM AKQDVIVKNP AAIQNLGAMD
ILCTDKTGTI TADEVVLERH LNILGEEDAR VLRHAYLNSY FQTGLRNLID KAIIKTSNDE
LPTNLLSIEY EKIDEVPFDF ERRRMSVVVR NTKTNKTQMI TKGAVEEVLN ACSFVDLDSE
IKPLTPAQRK SVMDRVYQLN QEGMRVVGVA QKSDPRGVGE FGVDDERDMV LIGYLAFLDP
PKESAREAIA KLNQRGVQVK VLTGDNEGVA AAVCKKVGIH VDELLLGSDV ENLNDEQLKE
RVEKTQLFAK LSPMQKARVV SALRSNNHVV GFMGDGINDA AAMRSSDVGI SVDTAVDVAK
ESADIILLQK DLLVLEHGVE EGRRTYGNTI KYIKATASSN FGNVLSVLVA SFFLPFLPMS
ALQLLLLGLV YTVTCIAIPW DNVDDSFLSS PRSWDAHSIT NFMLWIGPIS SIFDVLTFAL
MFFMVSPTLA GGTWAELTAA GNTAAQTLFI LSFQTGWFIE SMWTQTFVLH ALRTNKIPFV
QSMPSASLLT LTTAGIVVVS ALPYLPVFAQ PLSLVALPLS FFGWLIALMS GYMVLITIIK
SLYVKRFGSL L