Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2249 |
Symbol | |
ID | 5734136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2866610 |
End bp | 2867881 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279390 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001545017 |
Protein GI | 159898770 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.143292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCGCG TTGCAATGTT GAGTGTCCAT TCGAGTCCTC TAGCTGCTTT GGGTGGTAAA GAAGCTGGTG GTATGAATGT TTATGTTCGT GAATTAAGCC GTGAGTTGGG TCGCCAAGGG GTGGCGGTCG ATATTTATAC TCGGTCGCAA GACCCTCACA CACCGTTGAT TACCAATCTT GCGCCAAATG TGCGGGTGTT TGCGGTGCGT GTCGGCCCGG CTGCGCCCTA CGATAAGAAT TGGGTTTTGG ATTATTTACC AGAATTTGTC CATCGGATTC GCTGTGTTGC TGATGGCGAA GATATTCATT ACGATGTGAT TCATAGCCAT TATTGGCTCT CTGGCGTTGC CGCGCTAGAG TTACGCCAAG CTTGGGGTAC ACCAGTTATT CATATGTTTC ACACCTTGGG GGCGATGAAA AATACAATTG CCCGTGGCGA TGAAGCTGAA ACTGAACAAC GAATCGCGAT CGAACGCATG CTGTTGCACG AAGTTGATCG AGTTGTTGCG GCTACGCCGC TTGATCGTGC TCAGATGCTC GAACACTATG ACGCTGAGTG TGAGCGGATT GTGGTTGTGC CGTGTGGGGT TGATGTTGAG CATTTCCACC CGATTGCGCA TCAAATTGCC CGCAATGAAT TAGGCGTGCC GCCGCATCCT CATCGTATGT TGCTGTTTGT CGGGCGGATC GAGCCACTCA AGGGAATTGA TACGCTGCTA CGTTCGATGG CCTTGTTGGC TGAGCAACAG CCCTCGTTAC GTGGCGATAT TTGTTTGGCG ATTATTGGCG GCGATCGGCG CGAAACCCCA GATCAATGGA GCAGCGAAAT GCGGCGTTTG CGGCGTTTGC AGGGCGAATT AGGCATAGGC CATTTGGTCA CCTTCCAAGG ATCGCAAGAT CAGCGCAAAT TGCCTTTGTT TTATAGTGCT GCCGATATGG TGGTGGTGCC ATCGCACTAC GAATCGTTTG GCATGGTGGC GCTCGAAGCC ATGGCCTGTG GCACGCCAGT GATAGCTTCC AACGTTGGTG GTTTGCGCTA CACCGTGCGT GACGGCGAAA CGGGCCTACT GGTGCCGCGC GAAGATCCCG AAGCTTTAGC CGAAAAAATT AGTTTGCTCT TGAATGATGA GCCTTTGCGT TTACAATTAG GCCGCAACGG CGTGCAAGCA GCCCAACGCT ATAGCTGGGC CGCAGTTGCC CACGATATTC GTGAGTTGTA TGATCATGTT GTGTGTGGCG AACCATATGC CGATGTGGTT GGAGCCATGT AG
|
Protein sequence | MRRVAMLSVH SSPLAALGGK EAGGMNVYVR ELSRELGRQG VAVDIYTRSQ DPHTPLITNL APNVRVFAVR VGPAAPYDKN WVLDYLPEFV HRIRCVADGE DIHYDVIHSH YWLSGVAALE LRQAWGTPVI HMFHTLGAMK NTIARGDEAE TEQRIAIERM LLHEVDRVVA ATPLDRAQML EHYDAECERI VVVPCGVDVE HFHPIAHQIA RNELGVPPHP HRMLLFVGRI EPLKGIDTLL RSMALLAEQQ PSLRGDICLA IIGGDRRETP DQWSSEMRRL RRLQGELGIG HLVTFQGSQD QRKLPLFYSA ADMVVVPSHY ESFGMVALEA MACGTPVIAS NVGGLRYTVR DGETGLLVPR EDPEALAEKI SLLLNDEPLR LQLGRNGVQA AQRYSWAAVA HDIRELYDHV VCGEPYADVV GAM
|
| |