Gene Cagg_0987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0987 
Symbol 
ID7268359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1220699 
End bp1222267 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content57% 
IMG OID643565836 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_002462341 
Protein GI219847908 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.746829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA TTACTGAGGA ATTGATCGCC CGGCTGAAGC AGGGGATCAT CAGCGGCGTC 
GACCTTCAGC CGCGTCAGGT CAATGTCGGT ACGGTGATTG CCGTCGGTGA CGGCGTCGCG
CGGCTGTCCG GCCTTGATCA GGTCGTTGCC TCTGAGATCG TTGAGTTCCC GCCTAAGGCC
GGCCGGAATG AGTCGATTTA TGGTATTGCC CTGAACCTCG AGCAGGATAG CGTCGCTGCG
ATCATTCTCG GTGACGATGA AAGTATTGAG GAAGGGGATC TGGTAACTTC GACTGGCCGC
GTGATTTCAG TGCCCGTTGG TCAGGGTCTG CTGGGCCGGG TGGTTAATCC GCTCGGTCAG
CCGATCGATG GCAAGGGGCC GATCCAGTAT GAGAAGATGC GCCCAATTGA GCGGATTGCC
CCCGGTGTGA TTACCCGTAA GTCGGTCGAT ACACCGGTGC AAACCGGGAT TATCGCGATT
GATGCCCTGA TCCCGATCGG TCGTGGTCAG CGTGAGTTGA TCATCGGTGA CCGCCAAACT
GGTAAGACGG CAGTGGCGAT TGATACCATC ATTAACCAGA AGGGTCAAGG GATGGTGTGT
ATCTACGTCG CCATCGGTCA GCGCCGGGCG CAGGTGGCGC AGGTGGTCGG TACGCTCGAG
AAGTATGGGG CGATGGAGTA TACCATCGTG GTTTCGGCCA CCGCTTCGGA AAGCGCTGCA
CTCCAGTATA TCGCGCCTTA CGCCGGTTGT GCGATGGGCG AAGAGATCAT GGAGAACGGC
GTGATGCTCA ACGGGCAATT GGTCAAGGAC GCTTTGATCG TCTACGATGA CTTGAGCAAG
CACGCAGTCG CTTATCGCCA AGTGTCACTG TTGTTGCGCC GTCCGCCCGG TCGTGAGGCG
TACCCCGGTG ATGTCTTCTA TCTGCATTCA CGGTTGCTCG AACGTGCTGC CCGTCTGAAC
GAGGAATACG GTGGTGGTTC GCTGACAGCC TTGCCCGTGA TTGAGACGCA AGCTAACGAT
GTTTCGGCCT ATATTCCGAC AAACGTTATT TCGATTACCG ATGGTCAGAT CTATCTGGAG
GCTGACTTGT TTAACGCCGG TCAGCGTCCG GCGCTCAACG TCGGTATTTC AGTGTCGCGT
GTCGGTAGTG CAGCGCAAAC GCGAGCGATG CGTGCCGTCG CCGGTAAGCT GAAGGGTGAG
CTGGCTCAGT TCCGCGACCT TGCCGCTTTT GCCCAGTTTG CCAGTGATCT CGATGCCACA
ACGAAGGCTC AGATCGAGCG TGGTCAGCGG CTGCAAGAGC TGCTCAAACA GCCGCAGTTC
CAGCCGTTGG CCGTCGAAGA TCAAGTGGCC GTGTTGTACG CAGCCACCAA CAACTACCTC
GACGATGTGC CGGTGGCAAT GATCACGAAG TGGCGCGATG ATTTTCTGGC CTTCCTGCGC
ACCGCCCATC CAGAAGTGCG GAAGCTGATC TACGACAATC GCCTTGATCG CAAGTTCCCC
ACGCCTGAGG TTAAGGAAGC GTTGGAGGCG GCGATTAAGG AGTTTAAGGC GACCAGTAAC
TATAGCTAG
 
Protein sequence
MTTITEELIA RLKQGIISGV DLQPRQVNVG TVIAVGDGVA RLSGLDQVVA SEIVEFPPKA 
GRNESIYGIA LNLEQDSVAA IILGDDESIE EGDLVTSTGR VISVPVGQGL LGRVVNPLGQ
PIDGKGPIQY EKMRPIERIA PGVITRKSVD TPVQTGIIAI DALIPIGRGQ RELIIGDRQT
GKTAVAIDTI INQKGQGMVC IYVAIGQRRA QVAQVVGTLE KYGAMEYTIV VSATASESAA
LQYIAPYAGC AMGEEIMENG VMLNGQLVKD ALIVYDDLSK HAVAYRQVSL LLRRPPGREA
YPGDVFYLHS RLLERAARLN EEYGGGSLTA LPVIETQAND VSAYIPTNVI SITDGQIYLE
ADLFNAGQRP ALNVGISVSR VGSAAQTRAM RAVAGKLKGE LAQFRDLAAF AQFASDLDAT
TKAQIERGQR LQELLKQPQF QPLAVEDQVA VLYAATNNYL DDVPVAMITK WRDDFLAFLR
TAHPEVRKLI YDNRLDRKFP TPEVKEALEA AIKEFKATSN YS