Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4212 |
Symbol | |
ID | 5736924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5366647 |
End bp | 5368293 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281367 |
Product | protein serine/threonine phosphatase |
Protein accession | YP_001546972 |
Protein GI | 159900725 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.685502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGAGC CGACTCAACC GATGACTGGT GGAACCCAAC CTCTCAACCC CGTCAACCAG CCAGAGGAAC ACGAACCCTT TGCGATTGGC ACTGTGTTGA AGGATATTTA TCGGGTAACC GCCTTGCTGA CCGATACTCC AACCTTGCGC GTCTATCGCG TGGCGCTGTT GGAGCCATGG GATCATTGCG CTCGTTGTGG CGCAGCCTTA CAAGCGAGCG ATCAGTTTTG CGAAGAGTGC GGGGCGCAGG TCGAAGAACA AACTGCTCTG TTACAAGAAA CTCCGGCGGC CCAGCCAATT GGCGCAGCTT TGCTTGATGA TTTACCCGAT GACCCCGCCC GCGCTGCCTT GCCCACCGTG CGCGAGGTCT TTGTGGGCGA AGATTCACGG TTTGCGGTGC TGCCCGATGG TACGAGCTTA GTGCGTTTCG ACACGTTGCT GAGCGAACCA AATACCTTTG TTGATCAAAC TGATGCTGTT GATATTGGAA TTCAAGTAGC CCGCGCCTTA GCCTATTTGC ATCGCCACGG GTTGGCGCTA GGCCAATTAA CCTTAGCTGA TTTGGCTTTG ACCAACAAGC GCGAAATTAA ACTGGCTGAT GCTGGCGCGA TTCGCCGTTC GTTGGGCAAA GAAGATCAAC TTGATGATGT TGAGCATTTA GGTTTGGTGC TAGAAAAAAT GGCGGGAATT CAGCGCCAAA CTCGCCGCCT TGATGATTCG AATAATCCTT CGCCGCTCGA TAGCGCTTTT GCCACAATTT TGAGTGATCT CCGCGCCAAG CGCATCACCG ATGCTAGCAT TTTGGCCCAA ACCCTCGAAA CCCTGCTGGC CGAACAAGCT ACGCCGATCA GTTTGCGAGT ACGGACTGGC TATGCTACCG ATGTTGGCAT GATTCGCGAT CATAACGAAG ATAGCGTGCT GACCTGGGAT TTACGCCTGA ACTGGGATGC CAAGCCAGTC AACGTTGGTC TGTATGTAGT GGCTGATGGT ATGGGTGGTC ACGAAGGCGG CGAGGTTGCT AGCGGTTTGG CGATCACGAC TACTGCTCAA ACCCTCGTGC CAACCTTGCT TGATCCGCAG TTACATGCTG GGCCAGTTTC GAGCAAACAC CTCGCCGAAT TGGTCAAGCA AGCAGCATTT CAAGCCAACC AAGCGGTTTA CGAAGAAAGC GTGCGCCGCA AAAACGATAT GGGTACGACC CTGACCATGG CGGTGGTTAT CGGCGATCGG GCGATTGTTG GCAACGTTGG CGATAGTCGG ACTTACCTTT ATCGCGATGG CAAATTGCAG CGCATCAGCA AAGATCACTC GTTAGTCCAG CGCCTAATCG ATATTGGCCA ACTTGATCCT GATGATATTT ACACCCACCC CCAACGCAAC GCCATTCTCA AATCGCTTGG CGATAGCGGC GACCCTGGCA CCGACACGTT CGAGGTGCAA TTACAGCCTA ACGATGCGCT ATTTCTCTGC TCTGACGGCA TGTGGGAAAT GGTGCGCGAC CCCAAAATGG CGGCACTTTT CGCTGAACAT GCCAACCCCG CCGATCTCTG CGATGCCTTG ATTGAGGCTG GTAATGCTGG TGGCGGCGAA GATAATATCA GCGTGGTGGT GGTGCGTTTT GATGCCCTTC CAATAGTTCA ACACTAA
|
Protein sequence | MSEPTQPMTG GTQPLNPVNQ PEEHEPFAIG TVLKDIYRVT ALLTDTPTLR VYRVALLEPW DHCARCGAAL QASDQFCEEC GAQVEEQTAL LQETPAAQPI GAALLDDLPD DPARAALPTV REVFVGEDSR FAVLPDGTSL VRFDTLLSEP NTFVDQTDAV DIGIQVARAL AYLHRHGLAL GQLTLADLAL TNKREIKLAD AGAIRRSLGK EDQLDDVEHL GLVLEKMAGI QRQTRRLDDS NNPSPLDSAF ATILSDLRAK RITDASILAQ TLETLLAEQA TPISLRVRTG YATDVGMIRD HNEDSVLTWD LRLNWDAKPV NVGLYVVADG MGGHEGGEVA SGLAITTTAQ TLVPTLLDPQ LHAGPVSSKH LAELVKQAAF QANQAVYEES VRRKNDMGTT LTMAVVIGDR AIVGNVGDSR TYLYRDGKLQ RISKDHSLVQ RLIDIGQLDP DDIYTHPQRN AILKSLGDSG DPGTDTFEVQ LQPNDALFLC SDGMWEMVRD PKMAALFAEH ANPADLCDAL IEAGNAGGGE DNISVVVVRF DALPIVQH
|
| |