Gene Anae109_4497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4497 
Symbol 
ID5375587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5267661 
End bp5269199 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content69% 
IMG OID640846025 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001381659 
Protein GI153007334 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.694909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0411828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCC GCGCCGACGA GATCAGCCGC ATCATCCGCG AGCAGATCAA GGACTACGGG 
AAGAAGGTCG ACGTCGCCGA GACCGGCACC GTCCTCAGCC AGGCGGACGG CGTCGCCCGC
ATCTACGGCC TCGCGGGCGC GGCCGCCGGC GAGCTGCTCG AGTTCCCGCA CGGCACCCGC
GGCCTCGTGC TGAACCTCGA GGAGGACAAC GTCGGCGCCG CCATCATGGG CGCCTTCGAG
CACATCCGCG AGGGCGACCC GGTGAAGCGC ACCGGGAAGA TCGCCGAGGT GGCGGTGGGC
GAGGAGCTCC TCGGCCGCGT GGTCGACGGC CTCGGCAGCC CCATCGACGG CCGCGGCCCG
GTGAACGCGA AGCACACCCG CAAGATCGAG ATCAAGGCCC CCGGGATCGT GCAGCGCAAG
TCGGTGCACG AGCCGATGCA GACCGGCCTC AAGGCGATCG ACGCCCTCGT CCCGATCGGC
CGCGGCCAGC GCGAGCTCAT CCTGGGCGAC CGCCAGACCG GCAAGACCGC CGTGGCGATC
GACACCATCC TCAACAACAA GGGGAACAAC CTCTACTGCT TCTACGTCGC GGTCGGGCAG
AAGCAGTCGA CGGTGGCCCG CGTCGTCGAG ATCCTCAAGC AGCACGGCGC GATGGAGTAC
ACCACCGTCA TCGCGGCGAA CGCCTCCGAC CCCGCCCCGA TGCAGTACCT CGCGCCGTAC
ACCGGCGTGA CCATGGCGGA GTACTTCCGG GACACCGGCC GCCACGCGCT CATCATCTAC
GACGACCTCT CCAAGCAGGC CGTGGCGTAC CGCCAGCTCT CGCTGCTCCT CCGCCGCCCG
CCGGGCCGCG AGGCGTACCC GGGCGACGTG TTCTACCTCC ACAGCCGCCT GCTCGAGCGC
GCCGCGAAGC TCTCGGAGAA GGAGGGCGGC GGGTCGCTCA CCGCGCTGCC CATCATCGAG
ACGCAGGCCG GCGACGTGTC GGCGTACATC CCGACGAACG TCATCTCCAT CACCGACGGT
CAGATCTTCC TGGAGTCGAA CCTCTTCTAT CAGGGGGTCC GCCCGGCCAT CAACGTCGGC
ATCTCCGTCT CCCGCGTCGG CGGCTCCGCC CAGATCAAGG CGATGAAGCA GGTGGCCGGC
TCGCTCAAGC TCGAGCTCGC GCAGTACCGC GAGCTCGCCG CCTTCGCGCA GTTCGGCTCC
GACCTCGACA AGGCGACCCA GGAGACCCTC GCCCGCGGCG AGCGCCTCGT GGAGCTCCTG
AAGCAGGGCC AGTACTCGCC GATGCCGGTG GAGAAGCAGG TCATCCAGAT CTACGCCGCC
ACGCAGAAGG ACGAGAACGG CCAGGGCTGG ATCCGGCAGG TGCCGGTGGA GGAGGTCGGC
CGCTACATGC GCGAGCTCGT GGAGTTCCTC GACGCGCGCC ACCCGGGCGT CGCCAAGACC
ATCGCCGAGA AGAAGGCGCT CGACGACGGC ATCCGCTCCC AGCTCGACGC GGCGCTGCGC
GAGTTCCGCG GCGTGTTCCA GGTCGAGGGC CAGGCGTAA
 
Protein sequence
MEIRADEISR IIREQIKDYG KKVDVAETGT VLSQADGVAR IYGLAGAAAG ELLEFPHGTR 
GLVLNLEEDN VGAAIMGAFE HIREGDPVKR TGKIAEVAVG EELLGRVVDG LGSPIDGRGP
VNAKHTRKIE IKAPGIVQRK SVHEPMQTGL KAIDALVPIG RGQRELILGD RQTGKTAVAI
DTILNNKGNN LYCFYVAVGQ KQSTVARVVE ILKQHGAMEY TTVIAANASD PAPMQYLAPY
TGVTMAEYFR DTGRHALIIY DDLSKQAVAY RQLSLLLRRP PGREAYPGDV FYLHSRLLER
AAKLSEKEGG GSLTALPIIE TQAGDVSAYI PTNVISITDG QIFLESNLFY QGVRPAINVG
ISVSRVGGSA QIKAMKQVAG SLKLELAQYR ELAAFAQFGS DLDKATQETL ARGERLVELL
KQGQYSPMPV EKQVIQIYAA TQKDENGQGW IRQVPVEEVG RYMRELVEFL DARHPGVAKT
IAEKKALDDG IRSQLDAALR EFRGVFQVEG QA