Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1117 |
Symbol | |
ID | 4710067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1212585 |
End bp | 1213748 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855589 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001002695 |
Protein GI | 121997908 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.704778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGAC CCAGCTACGA GGGCAGCGGC CTGGTCAACC TGATGGCGTC CCTGGGCCGT GCCTTCGGGG CGGCGTCCAG CCACTATCCG GCCCTGGATC CTGAGCCGGA GCTGGGGCTG GAGGAGGCGC GCACGGTCAT CCTCTGGATC ATGGATGGCC TCGGCGATCA CTACCTGGCC AGGCAGCCGG GCAGCAGCCT GGCGCGGGAT CGGGTGCGGG TGCTGACCTC GGTCTTCCCC GCCACCACCT CGGCGGCCTT GACCAGCATC ATTACCGGGC GTCCGCCGCG GGGGCACGGG GTCACCGGCT GGTTTATGTA CGTCCACGAG CTGGGGGCGG TCACCGCCTG GCTCCCCTTC GGTCCGCGGG TGGGCAAGGG GCAGTGGTCC AGCATCGAGC CGGAGAGCGC CGAGCTGCTG CAGCGCGACC CGATCTGGGA TCGGTTTCAG GCCGAGACGC ACGTCGTTCA ACCCTCCTGG CTGGTCGACA CGCCGTATAG CCGGGCGGTC ACCGGGCGCT ATGCCCGCCG GCACGGCTAT CAGGGGTTGG ACGAGCTGCG CGAGGTGCTG GTGCGCATCG CCCGCGAGCC CGGTCGGCAG CGACGGTTCG TCTACGCCTA CTGGCCGGAC CTGGATACGC TGAGCCACCA GCACGGTGTC GACAGTGCAG CGGTGCGCGA CCAGTTCCGC TCCATCGACA TCGCCTGGCA GCGGCTGCTC GATGGCCTTC AGGGCACCGA CACCGTGATC CTCGGCACCG CCGACCACGG CCTGATCGAT ACTGCCCCCG AGCGGACCCT CTATCTGGGG GACCATCCGG AGTTGGCCGA GATGCTGGCC CTGCCGCTGT GCGGCGAGCC CCGGGCGGCC TACTGCTACC TGCGTCCGGG CACCGAACTC GACTTCCAGT CCTATTGCCG CGAGCGCCTG GGTACGGTCT GCCAGGTCGC CCGCTCCGAG GAGCTCCTGG CGGCGGGTTG GTTCGGGCCC ATGCCGGAAC ACCCGAAGCT GCGCCGGCGG ATCGGTGATT GGGTGCTGCT GCCGGCCGAT GGCTGGGTGA TCAAGGACCG GCTGGTGGGC GAGGGGCGCT TCGCCCAGGT GGGGGTGCAC GGCGGGGCGT CGGCGAGCGA GCAGTGGGTG CCGCTGATTG CCGCACGGCC GTGA
|
Protein sequence | MDRPSYEGSG LVNLMASLGR AFGAASSHYP ALDPEPELGL EEARTVILWI MDGLGDHYLA RQPGSSLARD RVRVLTSVFP ATTSAALTSI ITGRPPRGHG VTGWFMYVHE LGAVTAWLPF GPRVGKGQWS SIEPESAELL QRDPIWDRFQ AETHVVQPSW LVDTPYSRAV TGRYARRHGY QGLDELREVL VRIAREPGRQ RRFVYAYWPD LDTLSHQHGV DSAAVRDQFR SIDIAWQRLL DGLQGTDTVI LGTADHGLID TAPERTLYLG DHPELAEMLA LPLCGEPRAA YCYLRPGTEL DFQSYCRERL GTVCQVARSE ELLAAGWFGP MPEHPKLRRR IGDWVLLPAD GWVIKDRLVG EGRFAQVGVH GGASASEQWV PLIAARP
|
| |