Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4072 |
Symbol | |
ID | 5735930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5198044 |
End bp | 5199459 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281223 |
Product | F0F1 ATP synthase subunit beta |
Protein accession | YP_001546832 |
Protein GI | 159900585 |
COG category | [C] Energy production and conversion |
COG ID | [COG0055] F0F1-type ATP synthase, beta subunit |
TIGRFAM ID | [TIGR01039] ATP synthase, F1 beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00152005 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTG GAAAAATTTT ACAAATTACT GGCGTGGTTA TCGACGCAGA ATTTCCTGCC GATGGCCTGC CACAAATTTA TAACGCGTTG GAAATTCCCT TGGGCGAAGG CCGCTCATCG CTGATCTGCG AAGTCCAACA GCAGTTGGGT GATAGCGTGG TTCGCGCGGT CGCTATGTCC ACCACCGACG GCCTTGTCCG TGGGATGGAC GTAATCGACA CTGGCGCACC AATCAGCGTG CCAGTCGGCC CCGAAACCTT GGGTCGGGTG TTCGACGTTC AAGGTCGGCC AATCGACGGT GAAGGCGCGG TTGGTACTAC CAAAACCATG CCAATTCACC GCCCAGCCCC AACCTTTGAA GAACAGTCAA ACCGCGCCGA GTTGTTCGAA ACCGGCATCA AGGTTATCGA CTTGATCGCG CCCTTCACCA AGGGTGGTAA AACTGGGGTG TTCGGTGGCG CAGGTGTGGG CAAGACCGTT ATTATCCAAG AGTTGATCTC GAATATCGCT AAAGAACAAT CCGGTTACTC GGTGTTCGCA GGCGTAGGCG AGCGCTCACG CGAAGGTAAC GACTTGATCC ACGAAATGAA GGATTCAAAG ATTCCTGGCA CCGACCAAAC CGTGTTCGAT AAAACGGTGA TGGTGTTCGG TCAAATGAAC GAACCACCAG GAGCACGCTT GCGGGTGGCG CTTTCAGCCT TGACCATGGC TGAATACTTC CGCGAAGAAG GCCGCGACGT ACTCTTGTTC GTCGATAACA TCTTCCGCTT TACCCAAGCA GGTTCGGAAG TATCGGCGCT CTTGGGCCGG ATGCCTTCAC AGGTGGGTTA CCAACCAACC TTGGGTACCG AGATGGGTGA ATTGCAAGAA CGCATCACTT CGACCAAGAC TGGCTCGATT ACCTCGTTGC AAGCCGTCTA CGTGCCTGCT GACGACTACA CCGACCCAGC TCCAGCGACA ACCTTTGCTC ACTTGGATGC AACGATTTCG CTGGAACGCT CGATCTCAGA AAAAGGTATC TATCCTGCGG TGGACCCACT GGCTTCAACC AGCCGGATTC TCGACCCCAA CATTGTGGGC GAAGAACACT ACCGCGTCGC GACCGAAGTT CAACGGATGT TGCAACGCTA CAAAGACTTG CAAGATATTA TTGCAATTTT GGGTGTCGAA GAATTGTCAG ACGACGACAA ATTGACAGTT TCACGCGCCC GCAAGCTCGA ACGCTTCTTC TCACAACCAT TCGGCGTGGC TGAAGTGTTT ACCAACATTC CAGGCAAGTA TGTCGCGGTT GGCGATACCG TCAAGAGCTT TGCCCGCGTC TTAGCAGGCG AGTTCGACCA CATTCCCGAA AGCTTCTTCT TTATGAAGGG TGGCATCGAC GATGTGGTAG CCGCTTACGA TGCGTCAAAG CAATAA
|
Protein sequence | MATGKILQIT GVVIDAEFPA DGLPQIYNAL EIPLGEGRSS LICEVQQQLG DSVVRAVAMS TTDGLVRGMD VIDTGAPISV PVGPETLGRV FDVQGRPIDG EGAVGTTKTM PIHRPAPTFE EQSNRAELFE TGIKVIDLIA PFTKGGKTGV FGGAGVGKTV IIQELISNIA KEQSGYSVFA GVGERSREGN DLIHEMKDSK IPGTDQTVFD KTVMVFGQMN EPPGARLRVA LSALTMAEYF REEGRDVLLF VDNIFRFTQA GSEVSALLGR MPSQVGYQPT LGTEMGELQE RITSTKTGSI TSLQAVYVPA DDYTDPAPAT TFAHLDATIS LERSISEKGI YPAVDPLAST SRILDPNIVG EEHYRVATEV QRMLQRYKDL QDIIAILGVE ELSDDDKLTV SRARKLERFF SQPFGVAEVF TNIPGKYVAV GDTVKSFARV LAGEFDHIPE SFFFMKGGID DVVAAYDASK Q
|
| |