Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1120 |
Symbol | |
ID | 5733012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1282621 |
End bp | 1283781 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278259 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001543896 |
Protein GI | 159897649 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.233547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAT TATCTGCCTT GACCTACTAC GCGCCGCACT GGACGGGCTT GACCATGCAT GCCCAACGGG TGGCTGAAGG ACTCGCAGCT CGTGGACACC ACGTCACCGT CTTAACAATC CAGCATGAGC CAACGCTGCC AACTGAAGAA ATCTTGAATG GGGTGCATGT ATTGCGGCTT AAGCCTGCGG CTCAAATTAG CCGTGGCATG CTCGCACTCA ATTTTCCATT TGTCGCCGCC AAATTGATTC GTAGCCACGA TGTGGTGCAT GTGCATACGC CCCAACTTGA AGCCTTGCTG CTTTCAGGGT TATGCCGCGT GCTGAATAAG CCCTTGTTGA TGAGCCATCA TGGCGATTTG GTCATGCCAA CTGGCTTGAT TAATCGGGCG ATCGAGAAGG TCATGATTGG CCAGATGGTT TTAGCAGGCA AATTGGCGCG GCGCGTTAGT GCCTATAGTC GCGATTACGC CGCAAACTCC AGCTTTTTGC AAAAATTCAC CAAAAAATTG ACGTATATTT ACCCACCGGT TGATTTACCA ACCCCCAATC CAAGCCAAGT AGCCGCTTGG AAAGCCGAAC TTGGGATCAG CGATAAGCCG ATTGTTGGCT TTGCTGGGCG CTTCGTTGAA GAAAAAGGCT TTGATTTCTT GCTCAAAGCT ATGCCAATGA TCGCCGAGGT CTTTCCTGAG GTGCGCTTTG TGTTTGCGGG CGAGCACAAA ATGGTCTATG AAGATTTTTA TTCCACATGT TTGCCGCTGA TCGAGCAAAA CCGTGAACGA ATTGTGTTCC TTGGTTTGTT GCGTGATTCG CAAAAACTTG CCAATTTTTA TGCAATGTGC GACTTGTTTA CCTTGCCCAG CCGCACCGAT TGTTTGGCGA TGGTGCAGAT CGAGGCTTTA CTGGCGGGCA CGCCGTTGGT CACCAGCGAT ATTCCAGGCG CACGGGTTGT CGTGCAGGAA ACTGGCTTTG GGCGCTTGGT GCAAACTCAA AATCCTCGCG CCTTAGCCGA TGGGATTATT GAAGTGCTGA AAAACCCTGA AACCTATAAA GTGCAACCCG CCAAAGTTGA ACAAGTCTTT TCAGTCAAAA CCATTCTCGA TAGCTACGAG CGGACGATGG CCGAAATGTG TGGTCAGCCC GTTTCTGCCT CGGTTGTATA A
|
Protein sequence | MKILSALTYY APHWTGLTMH AQRVAEGLAA RGHHVTVLTI QHEPTLPTEE ILNGVHVLRL KPAAQISRGM LALNFPFVAA KLIRSHDVVH VHTPQLEALL LSGLCRVLNK PLLMSHHGDL VMPTGLINRA IEKVMIGQMV LAGKLARRVS AYSRDYAANS SFLQKFTKKL TYIYPPVDLP TPNPSQVAAW KAELGISDKP IVGFAGRFVE EKGFDFLLKA MPMIAEVFPE VRFVFAGEHK MVYEDFYSTC LPLIEQNRER IVFLGLLRDS QKLANFYAMC DLFTLPSRTD CLAMVQIEAL LAGTPLVTSD IPGARVVVQE TGFGRLVQTQ NPRALADGII EVLKNPETYK VQPAKVEQVF SVKTILDSYE RTMAEMCGQP VSASVV
|
| |