Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0056 |
Symbol | |
ID | 5731928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 71377 |
End bp | 72540 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277177 |
Product | PGAP1 family protein |
Protein accession | YP_001542836 |
Protein GI | 159896589 |
COG category | [R] General function prediction only |
COG ID | [COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAAG CCCATTTGTA TGGTCGTGAT CTTCAAGGCA TCAGCCGCTT GGCGATCGAG GCGACAACCC AAGTGACCAA CATCAGCGAA GGCTTGTTTG CCACGATTAT GCGCCGCCCC AATCGCCGCA TTGGTGGCAT TCCAGGCTTG GTCTATCGCC AAATTCATGG CATTACGCGC TTGGTTGGCT GGAGCCTTGA TCAATTGTTT GGGGCACTCG CTCCGGCCAA CTCGCAGCGA GTCTCATCAC GCCAACGCAA CGATCTGATT GCCGCCTTGA ATGGAGTTAT GGGCGATTAT TTGGTTGCCA GCAATAATCC ATTGCAAACC AGCATGAGCT TGCGGGTAGC CGGAGCCGAA CTTGAGCCAG CGGCCATTGC CGCACCTCAA TCACGCATAT TGCTCTACAT TCATGGGGTT TGTATGCACG AGCAACAACT CCAGCAGCAG GGCCATGATC ATGGTTTAGC TGCCGCCGCA ACGCTTGGTT ATAGCCCAAT TCATGTGCGC TACAACACTG GCTTGCATAT CGCCGAGAAT GGTCAGCAAC TCAGCCAATT GCTCGAACAA CTGCTCGCCG ATTGGCCTGT ACCCATCGAA GAATTGGTGA TTCTCGGCTA TAGTATGGGC GGGCTTGTCG CACGCAGCGC TTGTTATTAT GCCGAGCAGT TGGGCCATAG CTGGAGCCAA CGGTTAACTA AATTGGTTTT TTTGGCCACG CCGCACCATG GCTCAGCCTT TGAACGTTAT GGTGCTTGGT TGCATTATTT GCTGGAGCGC AATCGCTATA GCGCCTTATT TGGCCAAATT GCCCGTTTGC GCAGCGCTGG CATCACCGAT TTACGCTATG GCACGATTTT GCCGCACGCG CAGCCGACGT TTGATCGCTT TGAACAGCCT GCTTATCGCC CCGATTTAAT GCCACTACCC GCCCATGTGG CCTGTTATGC CTTAGCGGCA ACCCGTTCAG AACCGCAAAG CCTCAGCAAT CTGCGGGGCG ATGGTTTGGT GGCAATTGCC AGCGCGTTGG GCCACGATTC AGAGCAGCCA AGCCATAATC TTCACATTCC TACGACCAAC CAAGCGATTG TGTATGCAAC GAATCATTTT GGCATGATCG CCAATCAAGC GGCCTATCAA CAGATTTTGG CGTGGTTGCA ATAG
|
Protein sequence | MPKAHLYGRD LQGISRLAIE ATTQVTNISE GLFATIMRRP NRRIGGIPGL VYRQIHGITR LVGWSLDQLF GALAPANSQR VSSRQRNDLI AALNGVMGDY LVASNNPLQT SMSLRVAGAE LEPAAIAAPQ SRILLYIHGV CMHEQQLQQQ GHDHGLAAAA TLGYSPIHVR YNTGLHIAEN GQQLSQLLEQ LLADWPVPIE ELVILGYSMG GLVARSACYY AEQLGHSWSQ RLTKLVFLAT PHHGSAFERY GAWLHYLLER NRYSALFGQI ARLRSAGITD LRYGTILPHA QPTFDRFEQP AYRPDLMPLP AHVACYALAA TRSEPQSLSN LRGDGLVAIA SALGHDSEQP SHNLHIPTTN QAIVYATNHF GMIANQAAYQ QILAWLQ
|
| |