Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0636 |
Symbol | |
ID | 5732534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 733732 |
End bp | 734901 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277763 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543412 |
Protein GI | 159897165 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0521552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCTG CAAGCCTCAG TGTACTAACC CCCACTCCAG TGTATCCAGC CCATGCTGGT TCAAAAAACT ATAGTTTGAA TGCCGTACAG CAATTAAGTC ATTATTATAC TGTCGATAGT TATTGTTTAG CCACCCAACC TGAAGCCGTC GATTGGGGGC CATTGCCGCA ATGGTGTCGT GATCTACGGG CGTTTACCCC AACCAAGCCA GCGCGAAAAG GCATCGATCC ACCAGCGGTG CATTTGGAAT TTTCGCAACC CATGTGCGAT TATTTACAAC AACGCTGGAT GCGCAATCTG CCTGATTTGT TGCAACTTGA AGGCACAACC ATGGCTCAGT ATGCGCCATT TGCCCGCCGT TTGGGGCTAA AAGCAATTAT CTGTACCATA CATCAAGTTG GGTTTGTGGC ACAATGGCGA CGTTTACAAC GTGAACACCA TTGGAAACTA CGTGCCCGCC GCTTAGCTGG TTTACTCAGC TTATGGCTAT ATGAGCAGCG AGCCTTGCGC CAGTGCGATT TGCTGGTTAC CTTGAGCACA ACCGATCAAC AAACCTTGAA TCGTTGGCAA CCAAAACTCA ACGTTGTCAC TGTGCCAGCA GGGGTTGATT TAAGCCAATG GCCATTATGT CGCCAGTCCC AAGCACCACA GCAGGTGCTG TTTGTCGGTA ACTATTTTCA CCCGCCAAAT GTTGAAGGAG CCTTGTGGTT AGCACGGGAG GTTTGGCCAT TGGTGCAAGC TCAACTGCCT GAAGCACGCT TGATGCTAGC AGGGCGCAGC CCAACCCCTG AAATTCAACA ACTTGCGAGT GCTACAATTC AAGTGCCTGG CACAATTGAT GATTTACAGG CCGTGTATCG GCAAAGCCAG GTGGTGGCAG CGCCAATTTT TTGGGGCAGC GGCGTGCGCA TCAAAATTTT AGAGGGCTTG GCGACAGGCT TGCCCTTAGT CACCACAACG TTGGCGGCTG AAGGCTTGCC GCTGAAACAC GAAGAGCATG CGCTGTTTGC CGAAACGCCG CAAACCTTTG CCGCAGCGCT TGTGCGCATT TTGAACTCGC CACGTTTGGC CGAACAACTC GGCGAAGCAG GCCGGCAATT GATCGCCCAA CAGTATGATT GGCAGGCAAT TGGGCGACAA TTAGCCCAGC ACTATCAGAA GTTACGCTAA
|
Protein sequence | MQPASLSVLT PTPVYPAHAG SKNYSLNAVQ QLSHYYTVDS YCLATQPEAV DWGPLPQWCR DLRAFTPTKP ARKGIDPPAV HLEFSQPMCD YLQQRWMRNL PDLLQLEGTT MAQYAPFARR LGLKAIICTI HQVGFVAQWR RLQREHHWKL RARRLAGLLS LWLYEQRALR QCDLLVTLST TDQQTLNRWQ PKLNVVTVPA GVDLSQWPLC RQSQAPQQVL FVGNYFHPPN VEGALWLARE VWPLVQAQLP EARLMLAGRS PTPEIQQLAS ATIQVPGTID DLQAVYRQSQ VVAAPIFWGS GVRIKILEGL ATGLPLVTTT LAAEGLPLKH EEHALFAETP QTFAAALVRI LNSPRLAEQL GEAGRQLIAQ QYDWQAIGRQ LAQHYQKLR
|
| |