Gene Haur_3188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3188 
Symbol 
ID5735063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4033255 
End bp4034211 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content55% 
IMG OID641280334 
ProductROK family protein 
Protein accessionYP_001545953 
Protein GI159899706 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00496766 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTTG CAATCGGGAT TGATCTCGGG GGCACGCATT TACGTGCGGC CTTAGTTGAC 
CGAGATGGTG AAATTCTTGC TCATGAACGG ATTCGCACCG AAGCGCATGA AGGTGCTGAA
GCAGTTGTTG GTCGGATTAC TCAATTAATT AACGCCATGA TCGCTGCGGC GAATGGTGCA
ACAATTGTCG GCGCTGGCAT CGCCGCACCT GGCCCACTCA ACCCCTTTAC CGGTACGGTC
ATTACCATGC CCAACTTGCC AGGTTGGGAG AACTTCCCCA TCCGCGATCG AATCGCCGCC
CAAGTTCCGT TTCCAGTCGT GCTCGGCAAT GATGCCAATT TGGCTGCTGT CGGCGAATGG
TTATTCGGCG GTGGTCGTGG CATGCAAAAT ATGATTTACG TCACGATCAG CACGGGCGTT
GGTGGTGGGG TCATTTGTGA TGGTCGGTTG TTGCTCGGTC ACAATGGCTT TGCCGCCGAA
GTTGGTCACA TGGTGCTCGA CCCGCATGGC TTTGCACCCG CCACGGCCAC CCCAGCTGGT
TCGTGGGAAG CGCTCGCATC AGGCACATTT TTGGCCTACC ACGCTGCCGA AGCGATGCGA
GCAGGCACTG CCACCGTACT TAATCAATTA ACCACTCCCG ATGCCGTCAC CACCCATCAT
TTAGATCTGG CGGCGCAACA AGGCGATGAG TTGGCAATTC GCTTAATCGA AAATGCTGGC
TTTTGGTGTG GGATTGCCTT CGTCAATTTG CTGCATATGT TCAGCCCTGA AGCGATTTTC
GTGGGCGGCG GGGTTTCCAA CTTAGGCGAT CGTTTGCTCA ACCCAGCTCG CGCCGAAATT
ACCAAACGCG CCTTGCCCGG CTATCGCAAT GTGCCAATTC ATCAAACCAA GATGGGCGAT
AATCTAGGGG TGCTTGGCGC TGCTGCCTAT GCATTTAGCT CAATCCAACA AGCCTAA
 
Protein sequence
MAFAIGIDLG GTHLRAALVD RDGEILAHER IRTEAHEGAE AVVGRITQLI NAMIAAANGA 
TIVGAGIAAP GPLNPFTGTV ITMPNLPGWE NFPIRDRIAA QVPFPVVLGN DANLAAVGEW
LFGGGRGMQN MIYVTISTGV GGGVICDGRL LLGHNGFAAE VGHMVLDPHG FAPATATPAG
SWEALASGTF LAYHAAEAMR AGTATVLNQL TTPDAVTTHH LDLAAQQGDE LAIRLIENAG
FWCGIAFVNL LHMFSPEAIF VGGGVSNLGD RLLNPARAEI TKRALPGYRN VPIHQTKMGD
NLGVLGAAAY AFSSIQQA