Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4070 |
Symbol | |
ID | 5735928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5195478 |
End bp | 5197031 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281221 |
Product | F0F1 ATP synthase subunit alpha |
Protein accession | YP_001546830 |
Protein GI | 159900583 |
COG category | [C] Energy production and conversion |
COG ID | [COG0056] F0F1-type ATP synthase, alpha subunit |
TIGRFAM ID | [TIGR00962] proton translocating ATP synthase, F1 alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0283869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTAA CAGCAGAAGA TATTCTTAGT CGTCTTAAAG CCAGCATCCA GCAACCTGTC GGCGGAGACC CAACTGCGGT CAACGTCGGG ACGGTTGCCA GTGTTGGCGA CGGCGTGGCG CGGATCTCAG GTCTGCGCGA TGTTATGGCT TCGGAGTTGC TTGAATTCAA GCAAACGAAG TCAGGCGAAA CCGTCATGGG CATCGCGCTG AACTTGGAAA AAGATAACGT CGCTGCCGTT ATTTTGGGCG ATTATCTCGA AATTGAAGAA GGCGACTTGG TACGCTCAAC TGGCAAGATT ATTTCAGTGC CAGTTGGTCA AGAGTTGCTT GGCCGTGTCG TTGACCCACT GGGCCGCCCA CTCGATGGCA AAGGCCCAAT CAGCGCCAGC AAAACCCGCG AAGTCGAACG GATCGCCCCG GGTGTGATCG AGCGTAAATC GGTGAGCCAA CCAGTGCAAA CTGGGATCTT GGCGATCGAC GCACTGATTC CAATCGGGCG TGGTCAACGT GAGTTGATCA TCGGCGACCG CCAAACTGGT AAGACCGCGA TCGCGATCGA TACGATCATC AACCAACGCG GTCAAGGTAT GGTTTGTGTG TATGTCGCAA TCGGCCAAAA GCGCTCGAAA GTGGCTCAAA CCATCACTAC GCTTGAGCAA AATGGCGCGA TGGATTACAC CATCGTGGTC AATGCTTCAG CTTCAGAATC AGCAGCATTG CAATATATCG CACCGTACTC AGGCTGTGCC ATCGCTGAAG AAGTGATGGA AGTCGGCGTA ACGGTCGATG GCAACTTGGT CAAAGATGCC TTGATCATCT ATGACGATTT GTCGAAGCAC GCTGTGGCTT ATCGCCAAGT GTCGTTGTTG CTGCGCCGCC CTCCAGGCCG CGAAGCCTAC CCTGGCGACG TGTTCTACTT GCACTCACGC TTGTTGGAAC GCGCTGCCCG TTTGAGCGAA GTCAATGGCG GTGGTTCGAT TACGGCCTTG CCAATCATCG AAACCCAAGC TAACGACGTT TCAGCCTACA TTCCGACCAA CGTGATTTCG ATCACCGACG GTCAAATCTT CTTGGAATCG GACTTGTTCT ATGCTGGCCA ACGCCCAGCT TTGAACGTTG GTATCTCGGT GAGCCGCGTA GGTTCATCGG CCCAAACCAA AGCCATGAAG ACCGTTGCTG GTAAGATGAA GCTCGAATTG GCGCAGTTCC GCGAATTGGC AGCCTTTGCG ATGTTTGCTT CTGATTTGGA TGCAGGCACC AAGGCCCAAA TCGAACGCGG TCAACGCCTT TCAGAATTGC TCAAGCAACC ACAATACCAA CCAATCGCTT TGGAAGACCA AGTGATTATT CTGTGGGTTG CTGGCAACGG CTTCTTGGAT GATGTTCCAG TTGCTCGGAT CAACGACTTC AAGCGCGATT TCTGGCAGTT TATCCACAGC AGCTATGCTG AGGTTGGCCG CGCGATCGCT AGCGAAAAAG TCTTGAGTGA AGCAACCATC GCCAGCTTGC GCAAGGCCGT GACCGAATTC AAGCAAACGG CGAGTTATAA ATAG
|
Protein sequence | MAVTAEDILS RLKASIQQPV GGDPTAVNVG TVASVGDGVA RISGLRDVMA SELLEFKQTK SGETVMGIAL NLEKDNVAAV ILGDYLEIEE GDLVRSTGKI ISVPVGQELL GRVVDPLGRP LDGKGPISAS KTREVERIAP GVIERKSVSQ PVQTGILAID ALIPIGRGQR ELIIGDRQTG KTAIAIDTII NQRGQGMVCV YVAIGQKRSK VAQTITTLEQ NGAMDYTIVV NASASESAAL QYIAPYSGCA IAEEVMEVGV TVDGNLVKDA LIIYDDLSKH AVAYRQVSLL LRRPPGREAY PGDVFYLHSR LLERAARLSE VNGGGSITAL PIIETQANDV SAYIPTNVIS ITDGQIFLES DLFYAGQRPA LNVGISVSRV GSSAQTKAMK TVAGKMKLEL AQFRELAAFA MFASDLDAGT KAQIERGQRL SELLKQPQYQ PIALEDQVII LWVAGNGFLD DVPVARINDF KRDFWQFIHS SYAEVGRAIA SEKVLSEATI ASLRKAVTEF KQTASYK
|
| |