Gene Acid345_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4334 
Symbol 
ID4071752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5142394 
End bp5143932 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content60% 
IMG OID637986367 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_593408 
Protein GI94971360 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00117083 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.611425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGA TTAAAGCAGA CGAAATTACA AAACTGATCC GGTCGCAGAT CGAGAACTAC 
GAGACCAAGA TCGCGGTGGA CGAAGTCGGC ACCGTGATGT CGATTGGCGA CGGCATTGCC
CGCGTTTACG GCATTGATAA GGTCATGGCC GGCGAGCTAT TGGCCTTCCC GCACGGCGTG
GCCGGAATCG CGATGAACCT GGATGAAGAC CAGGTGGGCG CGGTGCTGCT CGGCGAGTAC
ACCGCGATCA AAGAGGGCGA CGAGGTTAAG CGCACGAAGC GGATTATGAG CGTGCCTGTC
GGTGAGGCGA TGATTGGGCG GGTAGTGAAC GCGCTTGGTC AGCCGATTGA CGACAAGGGG
CCGATCGTCA CCGACAAGTT CAATCCGGTG GAGCGCATTG CGCCGGGTGT GATTGATCGC
CAGCCGGTGC GTGAACCGAT GGCGACAGGT TTGAAGGCGA TTGACGCGAT GATCCCGGTG
GGCCGCGGTC AGCGCGAGCT GATCATTGGC GACCGCCAGA CCGGCAAGAC CGCGGTGGCG
CTGGACACGA TCATCAACAG TAAGGGCAAG AACCTGATCT GCGTGTACGT TGCGATCGGC
CAGAAGCGGT CGAGCGTGGC GCAGGTGGTG AAGATGCTGG AAGACAACGG CGCGATGGAG
TACTCGATCG TGGTCGTCGC TTCGGCCAGC GACCCGGCGC CAATGCAGTA CATCGCTCCT
TACTCCGGAA CGGCGATTGC CGAGTACTTC CGTGACAGCG GGCGTCACGC GCTGTGCATT
TACGACGATC TGTCGAAGCA GGCTGCGGCG TACCGCGAAA TTTCGCTGCT GCTGCGGCGT
CCACCGGGAC GCGAGGCGTA TCCGGGCGAC GTGTTTTATC TGCACAGCCG TTTGCTCGAG
CGTTCGTCGA AACTGAGCGA TAAGTTGGGT GGCGGTTCGA TTACGGCACT GCCGATTATC
GAAACGCAAG CGGGCGACGT TTCGGCGTAC ATTCCGACCA ACGTGATTTC GATTACCGAC
GGCCAGATTT ACCTTGAAAC CGACTTGTTC AACTCGGGCG TGCGTCCGGC GGTGAACGTC
GGTCTGTCGG TGAGCCGTGT GGGATTCTCG GCGGCGATCA AGGCGATGAA GCAGGTCGGC
GCCAGTCTGA AGCTGGAACT TGCGCAGTAC CGCGAGTTGG CGGCGTTCTC GCAGTTCGGC
AGCGACCTGG ACAAGGCGAC GCAAGCACAG TTGAATCGTG GCCAGCGCCT GGTGGAGATC
CTGAAGCAGG ACCAGTTCCA GCCGCTTCCG TTCTCGAAGC AGATCACGAT CATCTTCGCC
GGAACCAACG GGCTTCTCGA TGATCTCGAA GTGAAGGACG TTCGTCCGTT CGAGAAAGCG
CTCTATGAAT ACGTGGAGAG CGCGAACCCG CAGTTGTTCC GCACGATCGA AGAGAAGAAA
GCGCTCGACG ATGCGATTAA GGCGGACATG ACGAAGACGA TCAAGGAAGC CAAAGAGCGT
TTCTTGTCGG ATCGCAAGGC GGCGAAGGCC GGGGCGTAA
 
Protein sequence
MAQIKADEIT KLIRSQIENY ETKIAVDEVG TVMSIGDGIA RVYGIDKVMA GELLAFPHGV 
AGIAMNLDED QVGAVLLGEY TAIKEGDEVK RTKRIMSVPV GEAMIGRVVN ALGQPIDDKG
PIVTDKFNPV ERIAPGVIDR QPVREPMATG LKAIDAMIPV GRGQRELIIG DRQTGKTAVA
LDTIINSKGK NLICVYVAIG QKRSSVAQVV KMLEDNGAME YSIVVVASAS DPAPMQYIAP
YSGTAIAEYF RDSGRHALCI YDDLSKQAAA YREISLLLRR PPGREAYPGD VFYLHSRLLE
RSSKLSDKLG GGSITALPII ETQAGDVSAY IPTNVISITD GQIYLETDLF NSGVRPAVNV
GLSVSRVGFS AAIKAMKQVG ASLKLELAQY RELAAFSQFG SDLDKATQAQ LNRGQRLVEI
LKQDQFQPLP FSKQITIIFA GTNGLLDDLE VKDVRPFEKA LYEYVESANP QLFRTIEEKK
ALDDAIKADM TKTIKEAKER FLSDRKAAKA GA