Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2166 |
Symbol | |
ID | 5734053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2732635 |
End bp | 2735121 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279307 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001544934 |
Protein GI | 159898687 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAC GTCAGTTAAA TGTCGTTTTC TGGAGTGGTT GTGGTGGCGA GACCCAGCGT TATCGCTGTC AGCATGCAAT TGAGCAATTA CAGTATCGTG GGCATAAAGC CCAATTGTTT AATCAGATTG ATCAAGCGGC GATTGTAGCG GTTGCTGCTG CCGATTTGGT GGTTGTGCAT CGTCCTAAAG AAACTCACTT TTGGGAAACG ATTCAGCAAG CAGCACAAGG CAAACCCGTG GTCTATGAAA CCGACGACTT GCTGTTTGAC CCAGCCTTGA TCGATTCGAT GCCAATTGTG GCTGAGAGCA CGGGCTTTGA ACAGCAATTT TGGCGTGGCT ATGCCCGGGG CAATCCGCCA GTGTTTGCTC GTTGTGATGC AGCAATTGTC AGCACTACGC CCTTGGCTCA GGCGGCTGAA GCATTGCAAA AACCGGTTTG GGCGCATCGT AATGTGTTGG GCGACGATTG GATTGCATGG TGTGAGGCGG CCTATCGCGA GCGCCAAACT CAAGCCCATG TAACAATTGG CTATTTTAGT GGCACCTTTT CGCACGATGC CGACCTGCGT TTGATTGCCC CAGCTTTGCT AAAACTGTTG CAACAACAGC CCAAACTACG CTTAATGCTG GGTGGCAAAA TCACGGTGCC TGATATTTTA GCCCCGGTTG CCAACCAAAT TGAGCAATTG CCGTTTGTGC CGCTTGAGCA ATTGCCGCAA CTCATGTCCA AAGCCGATAT TATTTTGGCT CCCTTGGATG TGGATAATGC TTTTACTCGC TGCCGTAGCG AATTGAAGTA CCTCGAAGCC GCCGCCTTGC GCTTGCCCGT GGTGGCTAGC CCGATTCCGG CCTTTGCCGA GGCAATTCGG CATGGAGAAA CCGGCTTTTT AGCTACTTCC GAAGCCGAAT GGTATAGCCA ATTAAGCAAT TTGCTGGCCG ATGCCACGTT ACGTCAGCGG GTTGGGCAAG CTGCCTATAC CCATGTGCTT GGTCATTACA CAATTGCAAC CGCAGCGGCT GATTATGAAG CGATGCTGTT AGCCATCTTG CAGCAGTTTC CGACCAAACC TGCTCAGCCG GCATTGCAAC CCCTACTCAG CCAATTTCAG CGTGATTTGA CCTTCCACGA TCGCTCAGTC CATATGATCA CAGGCTGTGA TATTGGCAAC GCAGGCAATT ACCGCTGCCG CCATCGTCAA GAGCAACTCG ATTGGTTTGA TATGTATAGC GGCGTAACGA GCCTTTACAA TGAGCCATTT AAAATTGCCG ATAGCATTAA GTTTGGGATC TTGATTCTCC ATCGGGTTGC GCTTGATTCG AATATTGCCA CGTTGATTGA TGCGCATCAA GCCTTGGGCC ATCCGGTTAT TTTTGATACC GATGATTTGG TGTTTCGTAC CGATTTGCTG CATCATATCG ATGCAATTAA AGATTGGCCA GCTGACGAAG TGGCCCTCTA TCGCGATGGA GTCGAGCGTT ACCTCAAAAC CATGCTGCTG TGCGATGCAG TGATTGTTTC GACTGAGCCG TTGGCAGAGC AGGTACGAGC ATTCGACCTG AATGCCTATG TGGTGCGCAA TGCCTTGAGC CAAAACCAAA TCAGCTATGC CGAGCCAATT GCTGCTCAGC GCCAAGCTAA GCCACTGGCC CAACCGCATG ATCCGGTGTT GATTGGCTAC TTTAGCGGTA CAGCTACCCA TAATCGCGAT TTTATGCAAG CCGAGCAAGC AATTTTGCAT ATTTTGGCAA CCTATACCCA TGTGCGCTTG CGCATTGTGG GGCCATTGCA ATTATCGAAG GCCTTTGATC CATATATTGA TCGGATTGAG CGCCGCGAAC TTGTGCCGCT CGAACAACTG GCCGACGAAA TTGCTGCTGT TGATTTTGCG CTTGCTCCCT TGGAGCTTGA TAATCCATTT TGCCAATCCA AGAGCGAAGT TAAATATATG GAAGCTGCTT TGGTTGGTGT GCCCTTGATT GCAACCCCGA TTGAAGCTTT TCGTTATGCA ATTACCCATG GCATCAACGG TATGTTGGCG GCAAATGAGC AAGAATGGAT TGAGGCACTT GAAGCTTTGG TAACTGATCC ATCATTGCGC CAACGCCTAG GCCATGAGGC CTTGGCTGAT GCCCATGCCC GCTATAGCCC AAAAGCCCGT AGCCGCGAGT TGCACAATGT GCTTCAACAA ATTTGGAGCA TGTATACCCG CCAAATGTCC TTGATCAAGA GCAATGCCAT GTTGTTGGGT GCTAATAAAT CGCTCTCAAA GGGCAATGAA GTTTTAGTGA TGCATATCGA TGGCTTGATT ACCAGCAACC AAGCGTTGCA TTATCGAGTG CAGGAGCTTG AGCGCGACAA TGCTCAAGCC AAGGCTTATG CCCATCAATT AGAGATTCAG CTGCAACAGA TTGCCAATGG TATCTTCATG CAGTTCAGCG GTAAAGCCAA AGGGCTATTA CACCGTCTTA TCAACCGTAA AGGATAG
|
Protein sequence | MSKRQLNVVF WSGCGGETQR YRCQHAIEQL QYRGHKAQLF NQIDQAAIVA VAAADLVVVH RPKETHFWET IQQAAQGKPV VYETDDLLFD PALIDSMPIV AESTGFEQQF WRGYARGNPP VFARCDAAIV STTPLAQAAE ALQKPVWAHR NVLGDDWIAW CEAAYRERQT QAHVTIGYFS GTFSHDADLR LIAPALLKLL QQQPKLRLML GGKITVPDIL APVANQIEQL PFVPLEQLPQ LMSKADIILA PLDVDNAFTR CRSELKYLEA AALRLPVVAS PIPAFAEAIR HGETGFLATS EAEWYSQLSN LLADATLRQR VGQAAYTHVL GHYTIATAAA DYEAMLLAIL QQFPTKPAQP ALQPLLSQFQ RDLTFHDRSV HMITGCDIGN AGNYRCRHRQ EQLDWFDMYS GVTSLYNEPF KIADSIKFGI LILHRVALDS NIATLIDAHQ ALGHPVIFDT DDLVFRTDLL HHIDAIKDWP ADEVALYRDG VERYLKTMLL CDAVIVSTEP LAEQVRAFDL NAYVVRNALS QNQISYAEPI AAQRQAKPLA QPHDPVLIGY FSGTATHNRD FMQAEQAILH ILATYTHVRL RIVGPLQLSK AFDPYIDRIE RRELVPLEQL ADEIAAVDFA LAPLELDNPF CQSKSEVKYM EAALVGVPLI ATPIEAFRYA ITHGINGMLA ANEQEWIEAL EALVTDPSLR QRLGHEALAD AHARYSPKAR SRELHNVLQQ IWSMYTRQMS LIKSNAMLLG ANKSLSKGNE VLVMHIDGLI TSNQALHYRV QELERDNAQA KAYAHQLEIQ LQQIANGIFM QFSGKAKGLL HRLINRKG
|
| |