Gene Haur_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1182 
Symbol 
ID5733075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1356745 
End bp1357905 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID641278322 
ProductROK family protein 
Protein accessionYP_001543958 
Protein GI159897711 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGAC TGCCCAAGAA AGCCAGTCGG GAGCAAAGCA AGCTTCACAA CTACCGACTT 
GTCTTCAAAG CCATCTACGA CGGTGGCGCG ATTAGTCGGG TGGATGTAGC ACGCTTAACC
AACCTCACGC CAACCACAGT TTCGAGTAAC GTCGCGATCC TACTCGAAGA AGAATTGGTG
CAAGAAGTCG GGTTAGCGCC CTCTGGTGGT GGAAAGCCAG CGACATTGCT GAGCGTATTG
GATGATGGTC GCCACTTGAT AGGTTTGGAT GTTGCGGGAC ACGAACTGCG GGGCACGATC
ATCAACTTAC GTGGGGCAAT TCGCCAACGC CAAACGCTCG CGCTGAATGG GGGCAATGTT
CTCGAACAAT TGTATCAACT GATTGATCAA TTGCTAGCGA ACACCCACAG CCCAGTTCTC
GGAATTGGCA TCGGCGCACC AGGGGTCATT AACACCACCG CTGGAGTTGT CCAACAAGCA
GTCAACCTTG GTTGGCACAA TCTCGCACTC CGCGATTTGT TGGGCAAGCG TTATGGGTTA
CCGGTCTATT TGGCCAACGA TAGTCATGTA ACGGCGATTG CTGAACACAC GTTTGGCAGC
CAGCGCAACG CGGCAAACCT TGTGGTGATC AACGTTGGGC GTGGGATCGG CGCAGGCATT
TTTATCAATG GTCGAATTGT TGGTGGTGAT GCTTGGGGAG CGGGTGAAAT CGGTCACGTC
GTGGTTCAAC CTCATGGAAC TCTCTGTCGT TGTGGCCATT ATGGCTGCCT CGAAACTGTT
GCCAGCACAA GTGCGCTGCT AACAAAACTT GATGCAACCC AACCACAATC ACAGCCATGG
ACGATTGCCG AGGTCCAAGC GGCCTTAGCC GCGAATGATC CGACTGTCCG AGCCTTGGTT
GACGAAGCTG CCTACTATCT TGGCATCGCC ATTGCAAATG TAGTGGGTTT GCTCAACGCT
CAATCAATTA TCCTTGCTGG GTCGCTGGCC CAACTTGGCA ATGATTTACT CCAACCGTTA
CGCCGTTCGC TAGCACAACA CGCTTTGCAG ACTTTGGTCG CCGCCACCGA TGTGCAAGTG
AGCACCCTCG GCAGCGATAT CGTTACCCTA GGTGCAGCAG CTCTGTTACT AGCCAATGAG
CTAGGCATTG TTCGGGATTA A
 
Protein sequence
MQGLPKKASR EQSKLHNYRL VFKAIYDGGA ISRVDVARLT NLTPTTVSSN VAILLEEELV 
QEVGLAPSGG GKPATLLSVL DDGRHLIGLD VAGHELRGTI INLRGAIRQR QTLALNGGNV
LEQLYQLIDQ LLANTHSPVL GIGIGAPGVI NTTAGVVQQA VNLGWHNLAL RDLLGKRYGL
PVYLANDSHV TAIAEHTFGS QRNAANLVVI NVGRGIGAGI FINGRIVGGD AWGAGEIGHV
VVQPHGTLCR CGHYGCLETV ASTSALLTKL DATQPQSQPW TIAEVQAALA ANDPTVRALV
DEAAYYLGIA IANVVGLLNA QSIILAGSLA QLGNDLLQPL RRSLAQHALQ TLVAATDVQV
STLGSDIVTL GAAALLLANE LGIVRD