Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4793 |
Symbol | |
ID | 5736637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6110182 |
End bp | 6111711 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281958 |
Product | phosphodiesterase |
Protein accession | YP_001547552 |
Protein GI | 159901305 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0108516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACTTTA TCCTTCCAGC TCTGGCTTTA CTCATTGGCC TAGCTGCAGG CGCAGGCGGT GCCTATGTTT GGTTAACGAA AAAATCTGGT GTAGCACTGG CCCAAGCTGA AGCAGCAGCA CAACTACAAT TAGAACAAGC CAAGACTGAG TACAAACAAT TGCAATTAAA AGCCCGCGAA GAACAACAAC ACCTGCGGGA AGTTGGTGAA AACGAACTCC GCGAAGCCCG CTTACGCTTG CAAAAACAAG AAGAACGGGT CAATCGCAAA GAAGAAACTC TCGAACGTAA GCTCGACCAA ACCGAAAAGC GCGATCGCCA GTTAGGCCAG CGCGAGCGAG CGGTCGAAAA CTTACACAGC GAAGCCGAAC GCCTACGCAC AGCTCAACAA GCTGAGCTAG AGCGGGTCGC AGGTCTCACT AACGAACAAG CACGTGAAAT TATTCTCTCT CGTGTTGAAC AAGAAACCCG TGCCGAGGCT GCTCGCCGCA TCCACGAAAT CGAAGCGGCA GCAAAGCTCG ATGGCGATCG GATTGCACGC AAAATCGTTG GGTTGGCGAT CCAACGCTGT GCATCAGATT ATGTCGCCGA AATTACTGTA AGCACGGTAC AACTGCCCAG CGAGGAACTC AAGGGTCGGA TTATCGGGCG CGAAGGGCGC AATATTCGCG CATTCGAGCA AATTACTGGT GTCGATATTA TTGTCGATGA TACGCCCGAA GCGGTCACAC TCTCGTGTCA CGACCCAGTA CGTCGTGAAG TAGCCCGCTT AACCCTGAAC CGATTGTTGA AAGATGGTCG GATTCACCCC GCCCGGATTG AGGAAGTCGC TGATAAAACC CGCCAAGACC TTGAGCAAAT AATCCGCGAA GAAGGCGAAC GCGTGGCCTA CGAAGCCAAC GTCCAAGGCC TTCATCCAGA TCTCTTGAAG ATTCTGGGGC GGTTGAAATA TCGTACTAGC TACGGCCAAA ATGTCTTGCA GCATTCGTTA GAATGTGCCT TTTTGGCAGG AGCGATGGCA GCAGAATTGG GTGCCAATGT GCAGGTTGCC AAGACCGCAG CGCTGCTCCA TGATATTGGG AAAGCCATTG ACCACGATGT ACAAGGGCCG CACGCGTTAA TCGGGGCTGA TATTGCCCGA CGGCTTGGCC GCAACGCGGC GGTAGTTCAT GCAATTGCTG CCCATCATAA CGATGAGGAA CCTCAAACTG TTGAGGCTTT TCTTGTTCAA GCCGCCGATG CCATCTCTGG TGGTCGCCCA GGCGCACGCC GCGAGACCAT CGACTTGTAT ATTAAGCGGC TCGAAGCTCT TGAACATGTC GCCACATCGT TTCCTGGTGT TCAACGGGCA TTCGCAATTC AAGCAGGCCG GGAAGTGCGG GTTATGGTGC AACCCGATGC GCTTGATGAT CTTGAAAGTA TTCATTTGGC CCGCGAAGTG GCCAAAAAGA TCGAGAGTAC CCTGCAATAT CCAGGTCAAA TCAAAGTTAC CGTCATCCGC GAAACCCGTG CGGTTGATTA TGCACGGTAA
|
Protein sequence | MDFILPALAL LIGLAAGAGG AYVWLTKKSG VALAQAEAAA QLQLEQAKTE YKQLQLKARE EQQHLREVGE NELREARLRL QKQEERVNRK EETLERKLDQ TEKRDRQLGQ RERAVENLHS EAERLRTAQQ AELERVAGLT NEQAREIILS RVEQETRAEA ARRIHEIEAA AKLDGDRIAR KIVGLAIQRC ASDYVAEITV STVQLPSEEL KGRIIGREGR NIRAFEQITG VDIIVDDTPE AVTLSCHDPV RREVARLTLN RLLKDGRIHP ARIEEVADKT RQDLEQIIRE EGERVAYEAN VQGLHPDLLK ILGRLKYRTS YGQNVLQHSL ECAFLAGAMA AELGANVQVA KTAALLHDIG KAIDHDVQGP HALIGADIAR RLGRNAAVVH AIAAHHNDEE PQTVEAFLVQ AADAISGGRP GARRETIDLY IKRLEALEHV ATSFPGVQRA FAIQAGREVR VMVQPDALDD LESIHLAREV AKKIESTLQY PGQIKVTVIR ETRAVDYAR
|
| |