Gene Athe_1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1426 
Symbol 
ID7409169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1507808 
End bp1509331 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content40% 
IMG OID643715789 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_002573297 
Protein GI222529415 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGATG TAACAATTAG ACCAGATGAA ATTGCTTCAA TTATAAAAGA GCAAATTAAA 
AATTATGAAA AAAAGATTGA CACAAGTGAT GTCGGTGTTG TCATAATGTC AGGTGACGGT
ATTGCAAGAA TACATGGTCT TGACAACTGC ATGGCTGGTG AACTTTTAGA ATTTCCTAAT
GGAGTTTATG GAATGGCTCT CAATTTAGAA GAGGACAATG TTGGATGTGT CATACTTGGA
AATGATAAAG AAATAAAGGA AGGTACCATT GTAAAAAGGA CAGGTAGAGT TGTTGAGGTG
CCTGTAGGAG AAGAACTTTT AGGAAGAGTT GTAAACGCTC TGGGACAACC TATAGATGGT
CTTGGGCCCA TCAATGCCAA AAAGTTTAGA CCAGTAGAAA GAATAGCACC TGGCGTAATA
GAAAGAGAAC CTGTTAACAC GCCTCTTCAG ACAGGTATAA TGGCTATTGA CGCTATGATT
CCTATAGGAA GAGGGCAAAG AGAGCTTATA ATTGGTGATA GGCAAACGGG TAAAACTGCA
ATTGCAATAG ATACGATTAT AAACCAGAAA GATCAGGGTG TTTATTGCAT CTATGTGGCA
ATTGGGCAAA AGGCCTCTAC AGTTGCTCAG ATAGTCAATA CCTTAAAAGA ATACGGTGCG
ATGGATTATA CCATTGTTGT TAGTGCAACT GCAAGTGATT CTGCTCCTCT TCAATTCTTA
GCTCCATACG CAGGTTGCGC GATGGGAGAA GAGTTTATGG AGTCGGGCAA AGACGCACTC
ATTATATACG ATGACCTTTC TAAACACGCT GTTGCATACA GGGCAATGTC TCTTTTACTA
AGACGTCCAC CTGGAAGAGA AGCTTATCCT GGTGATGTGT TTTACTTACA TTCAAGACTT
TTGGAAAGAG CGGCAAAACT GAATGCTCAG CGTGGAGGCG GATCTCTTAC CGCACTGCCA
ATAATAGAAA CTCAAGCAGG TGACGTTTCA GCATATATTC CTACAAATGT CATTTCAATT
ACAGATGGGC AGATATACCT TGAAAGTGAA TTATTCTACG CAGGGGTAAG ACCTGCGATA
AATGCTGGAA TATCAGTGTC AAGAGTTGGT GGGAAAGCTC AGACAAAAGC TATGAAAAAG
GTTGCAGGAA GGTTAAGGCT TGATCTTGCT CAGTACCGTG AGCTTGAGGC TTTTGCTCAG
TTTGGTTCAG AACTTGATAA GTCAACGCGA GAAAGGCTTG CTCAGGGACA AAGAATTGTA
GAGACGTTAA AACAGCCACA GTACAAGCCG CTTCCTGTAT GGCATCAGGT GGTGATTTTG
TACAGTGCAA TAAATGGTTA TCTGATGGAT ATAGAAGTTT CAAAGGTCAG GGAATTTAAT
GAGAAGCTTG TACAGTATAT ATCAGCAAAC TATCCCCAGA TATTTGATTC TATAAAAGAG
ACAAAGGATT TGACACCTGA AACAGAAGAG CTTTTGAAGA AGGTCATAGT AGAGATAAAA
GAGAGATTTA AGAGTAACAA GTAG
 
Protein sequence
MIDVTIRPDE IASIIKEQIK NYEKKIDTSD VGVVIMSGDG IARIHGLDNC MAGELLEFPN 
GVYGMALNLE EDNVGCVILG NDKEIKEGTI VKRTGRVVEV PVGEELLGRV VNALGQPIDG
LGPINAKKFR PVERIAPGVI EREPVNTPLQ TGIMAIDAMI PIGRGQRELI IGDRQTGKTA
IAIDTIINQK DQGVYCIYVA IGQKASTVAQ IVNTLKEYGA MDYTIVVSAT ASDSAPLQFL
APYAGCAMGE EFMESGKDAL IIYDDLSKHA VAYRAMSLLL RRPPGREAYP GDVFYLHSRL
LERAAKLNAQ RGGGSLTALP IIETQAGDVS AYIPTNVISI TDGQIYLESE LFYAGVRPAI
NAGISVSRVG GKAQTKAMKK VAGRLRLDLA QYRELEAFAQ FGSELDKSTR ERLAQGQRIV
ETLKQPQYKP LPVWHQVVIL YSAINGYLMD IEVSKVREFN EKLVQYISAN YPQIFDSIKE
TKDLTPETEE LLKKVIVEIK ERFKSNK