Gene Haur_1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1181 
Symbol 
ID5733074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1355675 
End bp1356685 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content53% 
IMG OID641278321 
ProductROK family protein 
Protein accessionYP_001543957 
Protein GI159897710 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATCAAC GACAGCGCCG GAACGCCAAT GGCTGGCGGC TGGGAGTCGA TGTAGGGGGG 
ACGAAAATTG CCACCCTACT TGTCGATGGC GATAATCGAG TTTGGGCCAG CGTTCAACGC
CCAACCGATA CCGCAACTCC AGACCATGCC CTCAAGTCTC TTGCTGATGC GATTTGGGAA
ACCTTGAAAC AAGCCGACCT ACCCATTCGG CTCCTTGCGG GCATTGGGAT TGGCATTCCA
GGCCAAGTCG ATACGCAGAG CGGGATTGTT CGGCACGCCG TCAATCTTGG CTGGCAAGCA
GTTGATCTAC GTGGATTCAT CAACGCAACC TTTGGCACGG CCTGTGTGAT TGAAAACGAT
GTTCGAGCAG CAGCATTAGG CATTCAACGC TATTGGTTGG CTGGTTCGAT CGATTCGATG
TTATATGTTA GTATAGGCAC TGGCATAGCC GCTGGCATGA TTCTTGATGG TACTGTCTAC
CGTGGTAGTC ACGGAATGGC GGGCGAAATC GGCCATGCAC GCTTCGGATC ATCAACAATT
CGCTGTCGCT GTGGCAATTA TGGCTGTCTT GAAGCCATCG TTGCTGGGCC AGCAATTGCC
AACTATGCAC ACTCTCTGCT CTCAACATTC CCGCATAGCC AACTGCATCA ACTCGATTCG
ATAACCACTC CAGCAGTTTA CGCCGCTGCT GAAGCTGGCG ATGATTTAGC GTTGGCAGTT
GCCCACATGG TTGGCGAACA ACTTGCTCAA GCCCTCTATA CCATGGTGCT TGCCTACGAT
TGCGATCATA TTGTGCTTGG AGGCGGTGTT AGCCGCGCAG GCTCAGCCTT CTTCGCACCA
ATCGAACAAG CACTTGATGT CTTACGTCAG CAAAGTTCTC TAGCAACATC ATTACTTCCG
ACGGGGCGGG TTAAGCTCTT AGATCGTGAT TTTGCTGCTG GTGCATGGGG CGGAATCGCC
TTGCTCGATA GCCAAGCGTT GGCGCGGGTT GCGCAAACGC AACTGGCATA A
 
Protein sequence
MDQRQRRNAN GWRLGVDVGG TKIATLLVDG DNRVWASVQR PTDTATPDHA LKSLADAIWE 
TLKQADLPIR LLAGIGIGIP GQVDTQSGIV RHAVNLGWQA VDLRGFINAT FGTACVIEND
VRAAALGIQR YWLAGSIDSM LYVSIGTGIA AGMILDGTVY RGSHGMAGEI GHARFGSSTI
RCRCGNYGCL EAIVAGPAIA NYAHSLLSTF PHSQLHQLDS ITTPAVYAAA EAGDDLALAV
AHMVGEQLAQ ALYTMVLAYD CDHIVLGGGV SRAGSAFFAP IEQALDVLRQ QSSLATSLLP
TGRVKLLDRD FAAGAWGGIA LLDSQALARV AQTQLA