Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0301 |
Symbol | |
ID | 4711211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 336617 |
End bp | 338245 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854761 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001001897 |
Protein GI | 121997110 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2206] HD-GYP domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACGGAC TGAACCCACC GCCGCTGACT GAGCAGCGCC TGGCGCAGCT GCTGGAGGTC GGTGTCGCCC TCTCGGCGGA GCAGGACACC GATCGCCTAC TCGAACGCAT CCTCTCCGGT GCGCGCGAGC TGACGGAGGC GGATGCCGGG ACCATCTACC GGGTCCACGA GGGGCAGCTG CACTTCGACA CCGTCCACAA CGACACTCTC GGTCTCCACC TGGGCGGCAC GTCCGGGGTG CCCATCGACT TCTCGCCCCT CCCCCTCTAC CTGGCCGACG ACACGCCGAA CACCCGCAAC GTGGCCGCCT ACGCGGCGGT CTCCGGCGAA ACCGTGCGCA TCGACGATGC CTACCACGCC GAGGGGTTCG ACTTCTCGGG CACCCGCGCG GTGGACGCGC GCACCGGATA CCGCTCGCAG TCCTTCCTCA CTGTCCCGCT GCGCGACCAC GAACAGACCA TCATCGGTGT CCTGCAACTG ATCAACGCCG TGCGCGAGGG ACAGACGGTC CCCTTCACCG GCCGCGATGC CCGGATTGCC GAGTCATTGG CCTCCCAGGC GGCCATTGCC CTGACCAAAC AAGGCCTGAT CGACACCCAA CGGCAGCTGT TCGAGTCGTT CACCGAGGTC CTCGCCCGGG CCATCGACCG CAAGAACCCG ACCACCGGCC GCCACTGCGA GCGCGTCCCG CAGCTGACGC TGATGATCGC CGACGCCGCC TGCCGCACCG GCGAGGGGGC GCTGGCCGAT TACCGCCTGA GCGCCGAGGA ACACGAAGAG CTGCGCCTGG CCGCCTGGCT CCACGACTGC GGCAAGGTCA CCACCCCCGA GGCGGTGGTG AACAAGGCCA CCAAGCTGGA ACGCCAGACC GATCGCATCG AAGAGGTCGC GACCCGTGCG GCCGTGGTCC GCCAGGAGGC GGAGGCCGAA CGACTGCGCC GCCGGCTGGC CGCCCGCGAG CAGGGCGTGG ACCGGGACGC CGAGCAGGCC GATGCCGACT ACCGCGCCCT CGTCCAGCGC CTCGACGACG ACCTGGCCTT CCTGCGCAGC GCCAACATCG GCGGCGAGGC CATGGACGAC GAGGCCTGCG AACGGGTCGC CGCCATCGCC CGCACCTACA GCTGGACGGA CGCCGCGGGC GAGCGGCGCC CGCTACTCAG CGCCGAAGAG GTGGAGCACC TGCAGATCCG CCGCGGCACC CTGAGCGCCG CGGAGATGGA GCAGATGCGC GACCACGTGC GGGTCAGCCG CGAGATGCTC GAGCAGCTCA CCTACCCGCG CCACCTGCAG CGGGTGCCGG AGATCGCCTC GCAGCACCAT GAGCGCATGG ACGGCGGCGG CTACCCCGAC GGCATCACCG GCGGGCAGAT GTGCCGGCGG GCCCGAATGA TGGCGCTGGC CGACGTCTTC GAGGCCCTGA CCGCCGCCGA CCGCCCCTAC AAGCCAGGCA AGAAGCTCAG CGAAGCGGTG CGCATCATGG GCTTCATGAC CCAGGACGGC CACTTCGACC CGGAGCTGTT CGACCTCTTC ATCCGTGAAG GGGTCTACCT GGACTACGCC CGCCAGTTCA TGCACCCGAG CGCCATCGAC GCGGTGGACG AGACCGCCAT CCCCGGCTAC CGGCCGTAG
|
Protein sequence | MDGLNPPPLT EQRLAQLLEV GVALSAEQDT DRLLERILSG ARELTEADAG TIYRVHEGQL HFDTVHNDTL GLHLGGTSGV PIDFSPLPLY LADDTPNTRN VAAYAAVSGE TVRIDDAYHA EGFDFSGTRA VDARTGYRSQ SFLTVPLRDH EQTIIGVLQL INAVREGQTV PFTGRDARIA ESLASQAAIA LTKQGLIDTQ RQLFESFTEV LARAIDRKNP TTGRHCERVP QLTLMIADAA CRTGEGALAD YRLSAEEHEE LRLAAWLHDC GKVTTPEAVV NKATKLERQT DRIEEVATRA AVVRQEAEAE RLRRRLAARE QGVDRDAEQA DADYRALVQR LDDDLAFLRS ANIGGEAMDD EACERVAAIA RTYSWTDAAG ERRPLLSAEE VEHLQIRRGT LSAAEMEQMR DHVRVSREML EQLTYPRHLQ RVPEIASQHH ERMDGGGYPD GITGGQMCRR ARMMALADVF EALTAADRPY KPGKKLSEAV RIMGFMTQDG HFDPELFDLF IREGVYLDYA RQFMHPSAID AVDETAIPGY RP
|
| |