Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3662 |
Symbol | |
ID | 5735523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4603950 |
End bp | 4605638 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280811 |
Product | ribulokinase |
Protein accession | YP_001546426 |
Protein GI | 159900179 |
COG category | [C] Energy production and conversion |
COG ID | [COG1069] Ribulose kinase |
TIGRFAM ID | [TIGR01234] L-ribulokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGGC AAACATACGC GATCGGTGTC GATTTCGGCA CCGAGTCGGG CCGTGCTGTG CTCGTTGATG TGCGCAATGG TCAAGAAATC GCCACGGCAA TTTATCCCTA TGCCAATGGT GTGATCGACG AGAAGCTGCC TGGCACCAAC ATTCGGCTGG AGCCAGATTG GGCACTACAA GACCCCAATG ATTATCTCGA TGTGTTCAAA ATAACCATTC CTGCTATTCT CAAAGAAAGT GGCGTTGACC CAGCCAATGT GATTGGGATT GGGGTCGATT TTACTGCTTG TACGATGTTG CCAACCAAAG CCGATGGCAC GCCATTGTGT ATGCTGCCTG AGTGGCGCAA CACTCCTCAT GCCTGGGTCA AATTGTGGAA ACATCACGCG GCCCAGCCAG AAGCCAACCA ACTTAACCAC CTTGCTCGTG AGCTTGGCTA TAGCTTTCTT GATCGCTACG GCGGCAAAAT TAGCTCGGAG TGGTTTTTTC CCAAGGCTTG GCAAATTCTC AACGAAGCCC CCGAAGTCTA TGCCGCCGCT GATCGTTTGA TCGAAGCGAC TGACTGGGTA GTTTGGCAAT TAACTGGGGT CGAAACCCGC AATGAATGCA CCGCAGGCTA CAAAGCCATG TGGTCGAAAT CCGAGGGCTT TCCACCCAAC GAATTTTTCA AAGCACTCGA CGAACGTATG GAACAGATCG TCGATCAAAA AATGTCGCGC ACGCTCTTGC CGCTTGGCGC AAAAGCTGGC GGTCTCAGCC AACAAGCCGC CGAATGGACG GGCTTACTGG CAGGCACAGC AGTTGCCGTC GCCAATGTTG ATGCCCACGT CACCCTGCCA GTTACTGGCA ACACCGAAAT CGGCACGATG GTGATGATTA TGGGCACCAG CACCTGCGAC GTGATGAACG GCGAACATCG CGATGAATTG CCAATTGTCG AGGGCATGTG CGGGGTGGTT GATGGCGGGA TCGTGCCAGG CATGCTGGGC TACGAGGCAG GCCAGAGCGG GGTTGGCGAT ATTTTCGCTT GGTTTATTGA GCATGGCGTG CCTGGCGACT ATTTTGAGCA AGCTAAGGCC GAAGACATCA ATATTCACAC CTTGCTCGAA CGTGAAGCCG CCAAACTTCA GCCTGGCGAG AGCGGTCTCT TGGCGCTCGA TTGGTTCAAT GGCAATCGCT CAACTTTGGT CGATGTTGAA CTCAACGGCT TGGTGTTGGG CATGACCTTG GCCACCAGCG CACCCGAAAT TTACCGTGCC TTGCTTGAAG CGACGGCCTA TGGCAAACGC GAAATTATCG AAACCTTCAA TCAATCGGGC GTGCCAATTC GCAAATTGAT TGCGGCTGGC GGCCTGCCCG AGAAAAATCA TCTGCTGATG CAAATTTACG CTGATGTGAC CAACTATGAA ATTAGCGTGA TTGCCAGCAA ACAAGCCCCA GCGCTTGGTT CGGCCATGCA CGGCGCAGTT GCTGCTGGCG TTGAAGCAGG TGGCTACGCC GATATTGCCA GCGCCGCCAA ACAGATGGGC CGACTTAAAA CCGAAACGTT CAAGCCCATT CCCGCCAATG TTGAAATTTA CGACCAGCTC TATGCTGAAT ATAAAGTGCT ATACAACTAC TTTGGTCGTG GCGAAAACGA TGTGATGAAG CGCTTGCGAA TGCTCCGTCA CGCCGCACTC ACGGCGTAG
|
Protein sequence | MSRQTYAIGV DFGTESGRAV LVDVRNGQEI ATAIYPYANG VIDEKLPGTN IRLEPDWALQ DPNDYLDVFK ITIPAILKES GVDPANVIGI GVDFTACTML PTKADGTPLC MLPEWRNTPH AWVKLWKHHA AQPEANQLNH LARELGYSFL DRYGGKISSE WFFPKAWQIL NEAPEVYAAA DRLIEATDWV VWQLTGVETR NECTAGYKAM WSKSEGFPPN EFFKALDERM EQIVDQKMSR TLLPLGAKAG GLSQQAAEWT GLLAGTAVAV ANVDAHVTLP VTGNTEIGTM VMIMGTSTCD VMNGEHRDEL PIVEGMCGVV DGGIVPGMLG YEAGQSGVGD IFAWFIEHGV PGDYFEQAKA EDINIHTLLE REAAKLQPGE SGLLALDWFN GNRSTLVDVE LNGLVLGMTL ATSAPEIYRA LLEATAYGKR EIIETFNQSG VPIRKLIAAG GLPEKNHLLM QIYADVTNYE ISVIASKQAP ALGSAMHGAV AAGVEAGGYA DIASAAKQMG RLKTETFKPI PANVEIYDQL YAEYKVLYNY FGRGENDVMK RLRMLRHAAL TA
|
| |