Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0991 |
Symbol | |
ID | 5732894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1136365 |
End bp | 1137405 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641278125 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543767 |
Protein GI | 159897520 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00704021 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAGC CCCTCTACAT TTCGCCGCTC GGCAAGGGCG GCGTTGATAT TGGCATTCAA AATATCACCC GTTCGATGCA ACGAGCTGGC CATCAACCAA GTTTACGCCG TTTATCGCAC TTATTTAATT TTGCCCCGAT GGCCGCGAAA TTGGCGCTGG GCCGCAATTG GTGGCAAGGC CATGATCTGG TTCAGGCGCG TTCACGCTGC GCATGGGCGT TACGCGCACC TGGTTTGCCA TTGGTGACGA CGGTGCATCA CATCGTGACC GACCCATTGC TTCAGCCCTA TAGCTCACCG CAACAACGCT TGTTTTATCG CTTGGTTGAA AAAACCTACG ATGGCTTATC GGTACGCTTG GCTGATGAAG TGATTTGTGT CAGCCGTTAC ACGCAGGAGC AAACCCGCCT GACCTACGGC GACCGCCCGA CCACCGTGAT TTACGATGGG ATTGATACCG AAGTTTTTAC CCCAGCCCCA GATTTTCAGC GCACCAACGA TTTGCCTGAG CATCCCGCAC CGATTCGTTT GCTGTTTGTT GGCAATCGCA CGCGGCGCAA AGGCTTTGAT CTGCTGCCAA AAATTATGGA TCAACTGGGG CCGCAGTATG TGCTGTATTA CACCGAGTCG TTTCAAGGGG TGCAGGCAGC GCCGCCGCAT CCGCAGATGG TGCGGATTGG CACGCCCGAC CGTGATGGTT TGATTGCGGC CTATCGTTCC TGTGATGTAT TGTTGTTTCC GGCGCGAGTT GAAGGCTTTG GAATTGTTGC AGCCGAGGCG GGAGCTTGTG GCAAGCCTGT CATTACCACC AACGCCTCAG CGCTGCCCGA AGTGGTTAAT CATGGCGAAA CGGGGCTGCT CTGCGAATTG GATAATGTCC AAGCCTTTGT TCAAGCCATC CAAGAGTTGG GCGAAGATCC AGCACGGCGT TTGCAGATGG GCCAAGCCGC CCGCGAACGC GTGGCCAGCA ATTTTGGCTA CGATGTCTTG GCGCGAGAAC TAGCGGCGGT TTACCGTCGC GCCTTATTTG GCACTGCCTA G
|
Protein sequence | MLKPLYISPL GKGGVDIGIQ NITRSMQRAG HQPSLRRLSH LFNFAPMAAK LALGRNWWQG HDLVQARSRC AWALRAPGLP LVTTVHHIVT DPLLQPYSSP QQRLFYRLVE KTYDGLSVRL ADEVICVSRY TQEQTRLTYG DRPTTVIYDG IDTEVFTPAP DFQRTNDLPE HPAPIRLLFV GNRTRRKGFD LLPKIMDQLG PQYVLYYTES FQGVQAAPPH PQMVRIGTPD RDGLIAAYRS CDVLLFPARV EGFGIVAAEA GACGKPVITT NASALPEVVN HGETGLLCEL DNVQAFVQAI QELGEDPARR LQMGQAARER VASNFGYDVL ARELAAVYRR ALFGTA
|
| |