Gene Haur_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3662 
Symbol 
ID5735523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4603950 
End bp4605638 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content53% 
IMG OID641280811 
Productribulokinase 
Protein accessionYP_001546426 
Protein GI159900179 
COG category[C] Energy production and conversion 
COG ID[COG1069] Ribulose kinase 
TIGRFAM ID[TIGR01234] L-ribulokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGGC AAACATACGC GATCGGTGTC GATTTCGGCA CCGAGTCGGG CCGTGCTGTG 
CTCGTTGATG TGCGCAATGG TCAAGAAATC GCCACGGCAA TTTATCCCTA TGCCAATGGT
GTGATCGACG AGAAGCTGCC TGGCACCAAC ATTCGGCTGG AGCCAGATTG GGCACTACAA
GACCCCAATG ATTATCTCGA TGTGTTCAAA ATAACCATTC CTGCTATTCT CAAAGAAAGT
GGCGTTGACC CAGCCAATGT GATTGGGATT GGGGTCGATT TTACTGCTTG TACGATGTTG
CCAACCAAAG CCGATGGCAC GCCATTGTGT ATGCTGCCTG AGTGGCGCAA CACTCCTCAT
GCCTGGGTCA AATTGTGGAA ACATCACGCG GCCCAGCCAG AAGCCAACCA ACTTAACCAC
CTTGCTCGTG AGCTTGGCTA TAGCTTTCTT GATCGCTACG GCGGCAAAAT TAGCTCGGAG
TGGTTTTTTC CCAAGGCTTG GCAAATTCTC AACGAAGCCC CCGAAGTCTA TGCCGCCGCT
GATCGTTTGA TCGAAGCGAC TGACTGGGTA GTTTGGCAAT TAACTGGGGT CGAAACCCGC
AATGAATGCA CCGCAGGCTA CAAAGCCATG TGGTCGAAAT CCGAGGGCTT TCCACCCAAC
GAATTTTTCA AAGCACTCGA CGAACGTATG GAACAGATCG TCGATCAAAA AATGTCGCGC
ACGCTCTTGC CGCTTGGCGC AAAAGCTGGC GGTCTCAGCC AACAAGCCGC CGAATGGACG
GGCTTACTGG CAGGCACAGC AGTTGCCGTC GCCAATGTTG ATGCCCACGT CACCCTGCCA
GTTACTGGCA ACACCGAAAT CGGCACGATG GTGATGATTA TGGGCACCAG CACCTGCGAC
GTGATGAACG GCGAACATCG CGATGAATTG CCAATTGTCG AGGGCATGTG CGGGGTGGTT
GATGGCGGGA TCGTGCCAGG CATGCTGGGC TACGAGGCAG GCCAGAGCGG GGTTGGCGAT
ATTTTCGCTT GGTTTATTGA GCATGGCGTG CCTGGCGACT ATTTTGAGCA AGCTAAGGCC
GAAGACATCA ATATTCACAC CTTGCTCGAA CGTGAAGCCG CCAAACTTCA GCCTGGCGAG
AGCGGTCTCT TGGCGCTCGA TTGGTTCAAT GGCAATCGCT CAACTTTGGT CGATGTTGAA
CTCAACGGCT TGGTGTTGGG CATGACCTTG GCCACCAGCG CACCCGAAAT TTACCGTGCC
TTGCTTGAAG CGACGGCCTA TGGCAAACGC GAAATTATCG AAACCTTCAA TCAATCGGGC
GTGCCAATTC GCAAATTGAT TGCGGCTGGC GGCCTGCCCG AGAAAAATCA TCTGCTGATG
CAAATTTACG CTGATGTGAC CAACTATGAA ATTAGCGTGA TTGCCAGCAA ACAAGCCCCA
GCGCTTGGTT CGGCCATGCA CGGCGCAGTT GCTGCTGGCG TTGAAGCAGG TGGCTACGCC
GATATTGCCA GCGCCGCCAA ACAGATGGGC CGACTTAAAA CCGAAACGTT CAAGCCCATT
CCCGCCAATG TTGAAATTTA CGACCAGCTC TATGCTGAAT ATAAAGTGCT ATACAACTAC
TTTGGTCGTG GCGAAAACGA TGTGATGAAG CGCTTGCGAA TGCTCCGTCA CGCCGCACTC
ACGGCGTAG
 
Protein sequence
MSRQTYAIGV DFGTESGRAV LVDVRNGQEI ATAIYPYANG VIDEKLPGTN IRLEPDWALQ 
DPNDYLDVFK ITIPAILKES GVDPANVIGI GVDFTACTML PTKADGTPLC MLPEWRNTPH
AWVKLWKHHA AQPEANQLNH LARELGYSFL DRYGGKISSE WFFPKAWQIL NEAPEVYAAA
DRLIEATDWV VWQLTGVETR NECTAGYKAM WSKSEGFPPN EFFKALDERM EQIVDQKMSR
TLLPLGAKAG GLSQQAAEWT GLLAGTAVAV ANVDAHVTLP VTGNTEIGTM VMIMGTSTCD
VMNGEHRDEL PIVEGMCGVV DGGIVPGMLG YEAGQSGVGD IFAWFIEHGV PGDYFEQAKA
EDINIHTLLE REAAKLQPGE SGLLALDWFN GNRSTLVDVE LNGLVLGMTL ATSAPEIYRA
LLEATAYGKR EIIETFNQSG VPIRKLIAAG GLPEKNHLLM QIYADVTNYE ISVIASKQAP
ALGSAMHGAV AAGVEAGGYA DIASAAKQMG RLKTETFKPI PANVEIYDQL YAEYKVLYNY
FGRGENDVMK RLRMLRHAAL TA