Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3188 |
Symbol | |
ID | 5735063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4033255 |
End bp | 4034211 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280334 |
Product | ROK family protein |
Protein accession | YP_001545953 |
Protein GI | 159899706 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00496766 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTTG CAATCGGGAT TGATCTCGGG GGCACGCATT TACGTGCGGC CTTAGTTGAC CGAGATGGTG AAATTCTTGC TCATGAACGG ATTCGCACCG AAGCGCATGA AGGTGCTGAA GCAGTTGTTG GTCGGATTAC TCAATTAATT AACGCCATGA TCGCTGCGGC GAATGGTGCA ACAATTGTCG GCGCTGGCAT CGCCGCACCT GGCCCACTCA ACCCCTTTAC CGGTACGGTC ATTACCATGC CCAACTTGCC AGGTTGGGAG AACTTCCCCA TCCGCGATCG AATCGCCGCC CAAGTTCCGT TTCCAGTCGT GCTCGGCAAT GATGCCAATT TGGCTGCTGT CGGCGAATGG TTATTCGGCG GTGGTCGTGG CATGCAAAAT ATGATTTACG TCACGATCAG CACGGGCGTT GGTGGTGGGG TCATTTGTGA TGGTCGGTTG TTGCTCGGTC ACAATGGCTT TGCCGCCGAA GTTGGTCACA TGGTGCTCGA CCCGCATGGC TTTGCACCCG CCACGGCCAC CCCAGCTGGT TCGTGGGAAG CGCTCGCATC AGGCACATTT TTGGCCTACC ACGCTGCCGA AGCGATGCGA GCAGGCACTG CCACCGTACT TAATCAATTA ACCACTCCCG ATGCCGTCAC CACCCATCAT TTAGATCTGG CGGCGCAACA AGGCGATGAG TTGGCAATTC GCTTAATCGA AAATGCTGGC TTTTGGTGTG GGATTGCCTT CGTCAATTTG CTGCATATGT TCAGCCCTGA AGCGATTTTC GTGGGCGGCG GGGTTTCCAA CTTAGGCGAT CGTTTGCTCA ACCCAGCTCG CGCCGAAATT ACCAAACGCG CCTTGCCCGG CTATCGCAAT GTGCCAATTC ATCAAACCAA GATGGGCGAT AATCTAGGGG TGCTTGGCGC TGCTGCCTAT GCATTTAGCT CAATCCAACA AGCCTAA
|
Protein sequence | MAFAIGIDLG GTHLRAALVD RDGEILAHER IRTEAHEGAE AVVGRITQLI NAMIAAANGA TIVGAGIAAP GPLNPFTGTV ITMPNLPGWE NFPIRDRIAA QVPFPVVLGN DANLAAVGEW LFGGGRGMQN MIYVTISTGV GGGVICDGRL LLGHNGFAAE VGHMVLDPHG FAPATATPAG SWEALASGTF LAYHAAEAMR AGTATVLNQL TTPDAVTTHH LDLAAQQGDE LAIRLIENAG FWCGIAFVNL LHMFSPEAIF VGGGVSNLGD RLLNPARAEI TKRALPGYRN VPIHQTKMGD NLGVLGAAAY AFSSIQQA
|
| |