Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3056 |
Symbol | |
ID | 5734928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3861139 |
End bp | 3862602 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280200 |
Product | PGAP1 family protein |
Protein accession | YP_001545822 |
Protein GI | 159899575 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAACA ACCAAGTGCG GCTTGATGGC AGCAGCCAAG CCCATGAAGT TGCCCCAAAT TATCGGATTA TTGCCCAAGG TATCACGGGC AATGCCGCCG TACCAAGCCA AGAGCCACGG CGCGGCGAGG CTGGCGCTGG CGTGATCAAA GGTTTGGATT TAGCATTAAT CACTACCGAA AGTAAATTAG CTCAAACCTT CACCCTCGAT CTTGAGAGCA GCCTGCCACC CCCACCCGGT GCAATGCGCG GCAGCAGCGA TTTTGTGCTC CAAACCCCCG ATTTTGGCAC TAATACCGCG CAAGCGGTGC TTTATACCGA TCAAGCGACT GGCTATAGTC AATGGATTTT TCCTGAGCCA AGCCCAAATC CAGCGACGCA ACGCCGAGGC GGAGCCAGCG TTACCTATCG CCTGCCCCGC GAACCAATCG TGCCACTCCC ACCAGCACCA GGCGAAGATG CCCGCCGTGG CGGCGTGATC AAAGCGATTC GCAAGGTTGT ACGGGTGATT GCTTGGAAAA CCGATGAATT GATTGGCAAC ACGCTTGAGG CGATTATTAG CAAGTGGGAA TCGGTCAAAC GGGCTTATGG CTGGCAAAAC TTACCGCTAA ATGATACCAA TCCGGTTGAT TGGGCACGGA TACAAACTGG CCGCAGTTTA TTATTAATTC ATGGCACATT TAGCAGCGCA ACCGGAGCAT TTGCCGCCTT GCCCAACGAA ACCATTGGGC GCTTGCAGCA AATCTATAAT GGGCGCTTAT GCGCCTTCAA TCACCCTTCG CTGAGCGTCT CGCCGCAAGC CAATATCGAC GAATTACTCA AAACCATGCC AAATAACCTC GAAATCGATG TAGTTACCCA CAGTCGCGGC GGCTTGGTTG GGCGCGAATT GCTGAGTCGG GTACAGCAGG GGATTCCGTT GCGGGTGCGC AAAATGATCA TGGTGGCTTG CCCCAACCGT GGCACGCCCT TGGCTGATGG TGAACATTGG CTCACCATGC TCGATCGTTA TACCTCGCTG TTGGCCGATT TGCCCGATAC TAGTGCCAGC GTGATTATGG AAGGCATTTT GACGGTTGTA AAATTAATTG GCCATGCTGG CTTGCGCAAA TTACCAGGCC TCGCCGCGAT GACTCCCAAA AACAGCTATT TACAACAACT CAATGCTGGT CAGCCTGGTC AAACCCAAGT TTATGCAATG GTCGCCGATT ACCAACCCAA CGACCCTAAT TGGATCAAAC GTTTTATGCG CCAACAGGGC AATAAATTGA TCGACCATTT CTTTGGCGAG GCCAATGATG GAGTTGTGCC AACAGCGGGG GGCTATCAAG GCAACAGCGA TGGCAGCGGC TGGTCAATCC CCAGCGAGCG CCGCATCCTA TTTGATACCG ACAAACGGAT CAATCACAGC AGTTTCTTTG GCAATCAAGC GGTTAACCAG CAACTAGTCG AATGGCTTCG ATAA
|
Protein sequence | MPNNQVRLDG SSQAHEVAPN YRIIAQGITG NAAVPSQEPR RGEAGAGVIK GLDLALITTE SKLAQTFTLD LESSLPPPPG AMRGSSDFVL QTPDFGTNTA QAVLYTDQAT GYSQWIFPEP SPNPATQRRG GASVTYRLPR EPIVPLPPAP GEDARRGGVI KAIRKVVRVI AWKTDELIGN TLEAIISKWE SVKRAYGWQN LPLNDTNPVD WARIQTGRSL LLIHGTFSSA TGAFAALPNE TIGRLQQIYN GRLCAFNHPS LSVSPQANID ELLKTMPNNL EIDVVTHSRG GLVGRELLSR VQQGIPLRVR KMIMVACPNR GTPLADGEHW LTMLDRYTSL LADLPDTSAS VIMEGILTVV KLIGHAGLRK LPGLAAMTPK NSYLQQLNAG QPGQTQVYAM VADYQPNDPN WIKRFMRQQG NKLIDHFFGE ANDGVVPTAG GYQGNSDGSG WSIPSERRIL FDTDKRINHS SFFGNQAVNQ QLVEWLR
|
| |