Gene Apar_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1300 
Symbol 
ID8414184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1467467 
End bp1469140 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content50% 
IMG OID645022896 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_003180315 
Protein GI257785098 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.515384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATA AGACCACCAT ACAGGAAGGA ACGCTAAGCT CAGATGCTGT AATGGCACAG 
CTTAGAGAGC GCCTTTCTAA GGTCAATTCA ACCGTATCCC AGCAAGAGGT ATCTGCTGTT
ACTGAGGTAG CAGACGGTAT TGCTCGTGTT GCTGGTCTAC GCAGCGCTAT GGCAGGCGAG
TTGCTGGAGT TTACCAGCTC CGTTACTGGC CACAGCGTAT TCGGTCTTGC TCAGAACCTT
GATGAGACTT CTGTCGGTGC TGTTTTGTTC GGCGAAGTTT CTGAAATCAA GGAAGGCGAC
GAGTGTCGCA CTACTGGCCG TGTTATGGAC ATCCCTGCTG GATACGACAT GCTTGGTCGT
GTGGTTAATC CATTGGGTCA GCCTATTGAC GGTCTTGGTG CAATCCACGC TACACATCGT
CGTCCTATTG AGTTCAAGGC TCCTGGCATT ATGCAGCGTC AGCCTGTCTG CGAGCCAGTT
CAGACTGGTC TTCTTGCAAT TGACGCTATG GTTCCTGTCG GCCGTGGTCA GCGTGAGTTG
ATCATTGGTG ACCGTAAGAC TGGTAAGACC GCTATCGCAA TTGACGCAAT CGTAAACCAG
AAGAACACCG ACATGATTTG TATCTATGTC GCCATCGGTC AGAAGGCGTC TACAGTTGCA
AACATCCGCG AGTCTCTTTC CCGCCATGGT GTTCTTGATA AGACCGTCAT TGTTGCAGCT
ACCGCAGCTG ATTCTGCTCC AATGCAGTAC ATTGCCCCTA TGGCAGGCGC TGCTATTGGT
GAGTTCTTTA TGTATAACGA TAAGGACGGC AAGCCTGCTG ACAAAGATCA TCCAGGCGGA
CACGTTCTTG TTGTTTACGA TGACCTCTCT AAGCAGGCAG TTGCATACCG TCAGATGTCA
CTGACGCTTC ACCGTCCACC AGGACGTGAG GCATATCCTG GTGATATTTT CTACCTGCAC
TCACGCTTGC TGGAGCGTGC ATGTAAGCTC TCCGATGAAA ACGGAGCAGG TTCTCTTACA
GCACTTCCAA TTATTGAGAC GCAGGAAGGC GATGTCTCTG CATACATTCC TACAAATGTC
ATTTCCATTA CAGACGGCCA GATTTATCTG CAGTCTAACC TATTCTTCCA GGGTCAGCGC
CCAGCAGTCG ACGTAGGTAT TTCCGTATCC CGCGTCGGTG GCGACGCTCA GGTTAAGGCT
ATGAAGCAGG TTGCAGGTAC ACTTCGTCTG GACCTTGCAA GTTACCGTGA GAAGCAGGCA
TTCTCGCAGT TTGGATCCGA CTTGGATGCA ACTACGCAGT ACCAGCTTAA CCATGGTGCT
CATATGATGG AGCTTCTCAA GCAGCCACGT TACTCAGCTC TGGACGTAGT TGACCAGATT
TGCGCAATTT ACGCCGCAAA AGAGAATTTC ATTGATGACG TTGATCTTGA GAACGTTGCT
CTCTTCCGTG ACGGCCTTGC TCAGTACATG AGTGAGTATC ACCCACAGCT CCGCAACTCT
TTGCGCACAG GAAAGATTTC TGATGAGCAG GCAGAGCGCT TGAGTGATCA CATTAAGCAC
TTCAAGGCTA AGTTTATAGA AGAGCATCCA AACAAGGTTG TGGATGAAGC AGAAGCTAAT
GCAGATACAC CAGGTTCTTT TGACGCAGTA ACCCAGGGTT CATTGCAGGA GTAA
 
Protein sequence
MTDKTTIQEG TLSSDAVMAQ LRERLSKVNS TVSQQEVSAV TEVADGIARV AGLRSAMAGE 
LLEFTSSVTG HSVFGLAQNL DETSVGAVLF GEVSEIKEGD ECRTTGRVMD IPAGYDMLGR
VVNPLGQPID GLGAIHATHR RPIEFKAPGI MQRQPVCEPV QTGLLAIDAM VPVGRGQREL
IIGDRKTGKT AIAIDAIVNQ KNTDMICIYV AIGQKASTVA NIRESLSRHG VLDKTVIVAA
TAADSAPMQY IAPMAGAAIG EFFMYNDKDG KPADKDHPGG HVLVVYDDLS KQAVAYRQMS
LTLHRPPGRE AYPGDIFYLH SRLLERACKL SDENGAGSLT ALPIIETQEG DVSAYIPTNV
ISITDGQIYL QSNLFFQGQR PAVDVGISVS RVGGDAQVKA MKQVAGTLRL DLASYREKQA
FSQFGSDLDA TTQYQLNHGA HMMELLKQPR YSALDVVDQI CAIYAAKENF IDDVDLENVA
LFRDGLAQYM SEYHPQLRNS LRTGKISDEQ AERLSDHIKH FKAKFIEEHP NKVVDEAEAN
ADTPGSFDAV TQGSLQE