Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2689 |
Symbol | |
ID | 5734570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3446848 |
End bp | 3447843 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279832 |
Product | glycosyl transferase family protein |
Protein accession | YP_001545455 |
Protein GI | 159899208 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0468436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGTT CTATCATTAT TTTAAATTGG AATGGCCGAG CACTGCTCGC TGATTGCCTC AACGCCTTAT TGCCCCAATG CGATGCTTCA ATCGAAGTGT TGGTGGTGGA TAATGGCTCA CATGATGGCT CGGCGGCGTG GCTGCATCAA CATTATCCCC AAGTGCGCTT GTTGGCGCTC ACCAACAATC GTGGATTTAG CGGTGGGGTC AATGTTGGCT TGCATGTGGC GCGTGGCGAT GTGCTGTTGT TGTTGAATAA TGATGCAATC GTTGAGCCAA ATTTTATCTC GGCGATTCTC GCGCCGTTTC AGCACCAACC AACGCTTGCT GCTAGCGCTG GTCTGATGAC GTTCGCGCAT CGACCTGAAA TCATCGCCTC AGCCGGAATT CAGCTCTATC GCGATGGCGT GGCAACTGAT GCTGGGTTGT TGCAGCCAGT TGCTCAATTA GCCAGCCAAC CAAGCCCAAT TTGGGGTGGC AGCGGTGGAG CGGTAGCCTA TCGACGTGCC GCTTTAGCCG ATGTCGGCAT GTTCGATGAA GGCTATTTTG CCTATTTGGA AGATGTCGAT TTGGCTTGGC GTTTGCAGTT GCGCGAATGG CAAACCGTGC TAGCCCCGCA GGCCGTCGCT CGGCATATCT ACTCGGCCAC TGGCGGCGAA GGTTCGCCCT TTAAGGATTG GCTGATTGCG CGTAATCGCT GGCGGGTAAT TTTGCGTTGC TGGCCAACGC CGTTGTTGGC CCGCGCTCTC CCATTAATGC TAGCTTACGA TGGTTTGGCT TGTGCACAGG CTATCGTGCG CCGCCGCTGG ACAACGGTCA GCGGGCGTTT GCATGCCTTG CGCCAACTGC CCCAACTACG CCAACAGCGC CAAGCAATTC AAGCTCGCCG CACGGCTAGC ATCGCTGAGC TTGATCATTG GATTAAACCA GCCCGTTCAC CACTGGCAAT TTGGCGTGAA AATCAAGCAC TCGGCCAATT GATCGCTCAA CGCTAG
|
Protein sequence | MNCSIIILNW NGRALLADCL NALLPQCDAS IEVLVVDNGS HDGSAAWLHQ HYPQVRLLAL TNNRGFSGGV NVGLHVARGD VLLLLNNDAI VEPNFISAIL APFQHQPTLA ASAGLMTFAH RPEIIASAGI QLYRDGVATD AGLLQPVAQL ASQPSPIWGG SGGAVAYRRA ALADVGMFDE GYFAYLEDVD LAWRLQLREW QTVLAPQAVA RHIYSATGGE GSPFKDWLIA RNRWRVILRC WPTPLLARAL PLMLAYDGLA CAQAIVRRRW TTVSGRLHAL RQLPQLRQQR QAIQARRTAS IAELDHWIKP ARSPLAIWRE NQALGQLIAQ R
|
| |