Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_5215 |
Symbol | |
ID | 7109547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 5790271 |
End bp | 5791017 |
Gene Length | 747 bp |
Protein Length | 248 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643483422 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 3 |
Protein accession | YP_002380431 |
Protein GI | 218442102 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00121216 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAAT TACAGGCTCT AATTTTTGAT GTTGATGGAA CATTAGCAGA AACAGAACGA GATGGTCATC GTTTGGCCTT TAATCAAGCC TTTAATCAAG CTCAATTAAC CTGGGATTGG TCGGTGTCAA TTTATGGTCA ATTACTAACA GTAGCCGGAG GAAAAGAACG GATTCGATTT TATTTAGAGC AATACAATCC TCAGTTTGAA AAACCAACTA ATTTAGCTCA ATTTATTACC CAATTACATC AGAGCAAAAC AGAATTTTAT CAAGAGTTGT TAAGTCAGGG AGAAATTCCT TTACGTCCAG GGGTAAAACG ATTAATTGAA GAAGCTCGTA GTCAAGGGAT AAGAATAGCG ATCGCCACAA CGAGCGCCTT ACCGAATGTA TTAGCCTTAT TAGAGCGTAC CCTTGATCCA ACGTGGTTTG AAGTCATTGC GGCGGGGGAT ATTGTGCCGG CGAAAAAACC CGCTCCAGAT ATTTATAACT ATGTTTTAGA TAAATTAGGA TTAACGCCAT CAGAGTGTTT AGTTTTTGAA GACTCTTTTC ATGGATTACA AGCGGCAACA AAAGCGGGAT TAAAAACGAT AGTAACTGTT AACGATTATA CTAAAAACCA GGATTTTAGT GAGGCTATTT TAGTGTTAGA TCATTTAGGT GAGTCTGATT TACCCTCTAC TGTAATTCGG GGAGATTTAA GAAATCATCC CTATGTTGAT TTAGCTTTAT TAGAAAAGTT AGTCTGA
|
Protein sequence | MAKLQALIFD VDGTLAETER DGHRLAFNQA FNQAQLTWDW SVSIYGQLLT VAGGKERIRF YLEQYNPQFE KPTNLAQFIT QLHQSKTEFY QELLSQGEIP LRPGVKRLIE EARSQGIRIA IATTSALPNV LALLERTLDP TWFEVIAAGD IVPAKKPAPD IYNYVLDKLG LTPSECLVFE DSFHGLQAAT KAGLKTIVTV NDYTKNQDFS EAILVLDHLG ESDLPSTVIR GDLRNHPYVD LALLEKLV
|
| |