Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4228 |
Symbol | |
ID | 5736082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5389144 |
End bp | 5390625 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281383 |
Product | undecaprenyl-phosphate galactose phosphotransferase |
Protein accession | YP_001546988 |
Protein GI | 159900741 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00880688 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGCG ATTGGTCTAC CAGCCAACCG ATTTTTTCGC AGCGCGATGT GCGTCAAACC ACATCGCGGC TGGCTCTGAC CTTGCTCGAT GGATGTTTGA TTTTGCTAGC CTTTGCCGTA GCGCACTGGC TGCGTTACGA TGTCCGCTTA GGCCGCGATA TTTACGACCC AGCTTCATAT CGCCAACTCT CGGCCTTCTA CCCGATGATG TTGGTGTTTA TGCTGACGCT GATTAGCACG TTGCACTGGC GCGGGTTTTA TCGCCTGCCC CGTTCAGCCT CAGCCTTCGA TTCATTTAGT ATTATCGTTA CCAGCACCAC GATTGCCCTT GCCCTCACGG TCATGTGGCT GTTTATCAAT CGCGCCGATT TATGGTCGCG CTTGATTATG GTGTTTGTTT GGTTCTGCGT AATTGTGGCG CTGACGCTGG GTCGGATTAG TTTGCGCATG CTGAGGCGTT GGGCTTGGCG ACGCGGCGTT GGCTTAGAGC AAGTCGTGGT GGTTGGCAAT CGTGGCCTAG CCAAACAGGT GATGGAAGAA TTGCAGTATA CGCTCGATCA TGGCCATCAT TTGTTGGGTT ATGTCGAAGG ACCGCCTGAT GATCGCGACG ATGTAGCCGC ACCAGGTGAG CAATTTCGCT GGCTTGGCAC GCTCAGCCAA TTCGAGCAGA TTATGCGTCA GCGCCATGTC GATCAAGTGA TTATTGCCTT GCCATTTTGG GCGCACACCA GCCTACCCGA AGTTGTGGCA ATTTGCCGTA AGTTCAACAT TGAATTTCGC GTCGCCCCCG ATTTATATGA ACTCAGCTTC GATCGCGTCA GCATTCAGCG CTTGAGCACG ATTCCCCTGT TGCGCTTGAA AAAGAATGTG ATTCGCGGCT GGAATTATGT GTTCAAACGC AGCACCGATT TGCTGATGAT TGGCCTGACT GCGCCGATTT GGGCCACAAT TTGGGGCTTG GCCGCCTTGA TGATCAAACT TTCTGATCCA CAAGCGCCGG TAGTGTTTCG CCAACCACGC ATCGGCAAGC ATGGCCAAAC ATTTATGGTC TACAAATTAC GCACGATGGT GCCCAATGCC GAGGCGCTCA AAAAGAGCCT GATGGATCAA AATGAGGCCG AGGGAGCACT GTTCAAAATC AAAGACGACC CACGAGTCAC CCGCTTAGGC CGGATTTTAC GCAAACTGAG CATCGATGAA CTGCCCCAAC TCTACAATGT GCTACGCGGC GAAATGAGCC TGGTTGGCCC ACGCCCGCAA GTGCCCGACG AAGTAGCCCA ATATCAAGAA TGGCACTATC GCCGTTTGGA AGTGACCCCA GGTTTGACTG GTTTATGGCA AGCCTCAGGC CGCTCCAACA CCACCTTCGA TGATATGGTG CGCTTGGATA TTTACTACAC CGAGCACTGG TCGCTCTGGC TTGATCTGCG GATTATGATC ATGACGATTC CAGCGGTGCT ATTTGGTCGT GGCGCATACT AA
|
Protein sequence | MMSDWSTSQP IFSQRDVRQT TSRLALTLLD GCLILLAFAV AHWLRYDVRL GRDIYDPASY RQLSAFYPMM LVFMLTLIST LHWRGFYRLP RSASAFDSFS IIVTSTTIAL ALTVMWLFIN RADLWSRLIM VFVWFCVIVA LTLGRISLRM LRRWAWRRGV GLEQVVVVGN RGLAKQVMEE LQYTLDHGHH LLGYVEGPPD DRDDVAAPGE QFRWLGTLSQ FEQIMRQRHV DQVIIALPFW AHTSLPEVVA ICRKFNIEFR VAPDLYELSF DRVSIQRLST IPLLRLKKNV IRGWNYVFKR STDLLMIGLT APIWATIWGL AALMIKLSDP QAPVVFRQPR IGKHGQTFMV YKLRTMVPNA EALKKSLMDQ NEAEGALFKI KDDPRVTRLG RILRKLSIDE LPQLYNVLRG EMSLVGPRPQ VPDEVAQYQE WHYRRLEVTP GLTGLWQASG RSNTTFDDMV RLDIYYTEHW SLWLDLRIMI MTIPAVLFGR GAY
|
| |