Gene Anae109_4487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4487 
Symbol 
ID5375960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5256622 
End bp5257842 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content70% 
IMG OID640846015 
ProductF0F1 ATP synthase subunit A 
Protein accessionYP_001381649 
Protein GI153007324 
COG category[C] Energy production and conversion 
COG ID[COG0356] F0F1-type ATP synthase, subunit a 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.553444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0963007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG CCAGCCTCGT CACCCTCGCC CTCTCGCTGT CCCTCGCGCA GGCCGCTGGT 
CACGCCGGCG AGCACGGCGC CCCCGCGCCC GAGGTCGCCA CCCCGGCCGA GGGGCACGGC
GCGCGCGACG CGGCCGGGGC CGCCACCGAC CCGCACGGCG CGGCCGCCGA GCACGGCGCC
GCCGCGCACG AGGACCCGGC GCAGCACGGC GCGGCGGGCG CGGAGGCCGG CCACGACGAG
AGCCTCGGCG CGGTGATGAT GCACCACGTG GCCGACGGCT ACGTGCTGGA GCTCCCCGGC
TTCTGCGGCG GCCTCTCGTG GGCCTGCCAC GTCGACCTGC GCGACGTCTT CGGCACCGAG
CACGTCAGCG AGATCGACGC CCACGGCCAC GCGGTGGAGC GGAACGTGAG CGGCCCGCTC
GTCTTCGGCA AGGTCGACAT GACCCCCACG AAGCACGTGG TGATGATGTG GATCGCCTCG
GCGATCCTCC TCCTCGTCGT GTTCGCGGCG GTCCGCAAGA AGAGCCTCGT CCCGCGCGGC
CTCTACAACT TCATCGAGAT GCTCGTGCAG TTCGTGCGCA ACGAGATCGC GGTGAAGAAC
ATCGGCGAGA AGGACGCGGA CCGCTTCGTG CCGTACCTCG TCTCCGCCTT CTTCTTCATC
CTCTTCCTGA ACCTCTTCGG CCTCGTGCCC TTCGCCGCCA CCGCCACCGC GAACATCTCG
GTGACGGTGA TGATGGCGGT GTTCACCTTC CTCATCACCC AGTACGCGCA GATCAGGGCG
GTGGGGGTGG GCGGGTACTT CGCGCACATG ACGGGCGGCG TGCCGAAGTC GCTCTGGCCG
CTGTGGTTCA TCATGATCCC GGTCGAGTTC CTCGGCCTGT TCACGAAGCC CTTCGCCCTC
ACCGTCCGTC TCTTCGCCAA CATGGTGGCG GGCCACTTCG TCATCCTGGC CCTGCTCGGC
CTCATCTTCG CGCTGAACTC GCAGTGGATC GCGATCGCCT CCGTCCCGAT GGCGCTCTCC
ATCTACATGC TCGAGCTCTT CGTGGCCTTC GTGCAGGCCT ACATCTTCAC CATGCTCTCC
TCGCTGTTCA TCGGCTCCGT CGTGGCGCAC CACGGCCACG AGGACGAGCA CGAGGAGCAC
GGGCACGGCG CGGCCGCCAC GGGCGGGGCG CACGGCTCTC ACGGTTCTCA CGTGGCGGGG
GCGTCCCCCG GCCATGGGTA G
 
Protein sequence
MTAASLVTLA LSLSLAQAAG HAGEHGAPAP EVATPAEGHG ARDAAGAATD PHGAAAEHGA 
AAHEDPAQHG AAGAEAGHDE SLGAVMMHHV ADGYVLELPG FCGGLSWACH VDLRDVFGTE
HVSEIDAHGH AVERNVSGPL VFGKVDMTPT KHVVMMWIAS AILLLVVFAA VRKKSLVPRG
LYNFIEMLVQ FVRNEIAVKN IGEKDADRFV PYLVSAFFFI LFLNLFGLVP FAATATANIS
VTVMMAVFTF LITQYAQIRA VGVGGYFAHM TGGVPKSLWP LWFIMIPVEF LGLFTKPFAL
TVRLFANMVA GHFVILALLG LIFALNSQWI AIASVPMALS IYMLELFVAF VQAYIFTMLS
SLFIGSVVAH HGHEDEHEEH GHGAAATGGA HGSHGSHVAG ASPGHG