Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2678 |
Symbol | |
ID | 5734543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3433413 |
End bp | 3435938 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279820 |
Product | glycosyl transferase family protein |
Protein accession | YP_001545444 |
Protein GI | 159899197 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGGC CCAGCGTTTC GATCATTGTG ATCAATTTCA ATGGTAAAAA ACACCTCGTT GATTGTTTAA ATTCGTTGTT TGTTCAGCGC TACCCAAGCA GTGCGCTTGA AATAATTGTC GTCGATAATG CCTCGCACGA TGGCTCCTGC GACTTACTTC GTCAGCAATT TCCCAAGGTT CGTTTAATTG AAAATCGTGA GAATCTGGGT TTTGCTCCGG CAGTCAATCA GGCGGTACGG CTCAGTCAAG CCCAATATGT TGCCTTGATC AATAACGATG CCAAAGCCGA TCCCAACTGG ATCGAGCATT TAGTTGCCGA TATCGAAGCG CATAAAGCCG AAAAGGTAAT TGCGGTTGGG GCAAAAATGC TCGATTGGGA AGGCCAACAG ATTGATTTTA TCCAAGCGGC GTTGAATGTC TTTGGCCATG GCAATCAGCC ATTTACGCGC ATGCCAACTG CTAGCATCGC AGGCCAAGCT GGCCCACAAC TCTTTGCTTG TGGCGGGGCT ATGCTGGCCG ATCGGGCGTT TTTTTTGGCA ATTGGCGGCT TCGACGAAAG CTATTTTGCC TATTTTGAAG ATGTCGATTT TGGCTGGCGA GCATGGTTGT TGGGCTACCA AATTCGTTTT AATCCATCAG CGCTGGTCTA TCATCGCCAA CATGCCACCG CCAACACTAT GGGCGGACAT CAAATTCGCG CCTTACTTGA GCGCAACGCC CTACGCACCA TCATCAAACA TTATGCCGAT GAGCAATTGT GGCGCATTCT GCCAGCTGCC ATTTTGCTGA TTATTCAGCG TAGTTTGCTC GATGGCAGCG GTGGCTTTGA TCGCAAAGAA TTCGATTTAC GGCTGCGCAA ACAGGGCGAC CAAACCAGCA CCATGCAAGT TCCCAAGATT ATGTTGAGTT ACATCGCGGC TTTGGGCGAT GTGCTTGATG GCTGGGATAG CTTGTGGGCT GAGCGTGAAC GCTTGCAAAC GCTTCGCCAA CGTAGCGATG CTGAATTATT TAATTTGTTT GAACAACCAT TTGGCCTGAT CGATCTTGAT GTGCGTTTGC ATATGCAGCA GCAAACCATG GTTGAAAGCT TTAAATTGCG AGAACTTATG CCCAACCCAA CCACGAATGT GCTGATTGTC AGCATTGATC CCTTGCAAGC AGCCTTGGCT GGCCCGGCGA TTCGCAGCGT GCAAATTGCC AAACAACTGA GCCACTCCTG CAAAGTTGTG CTAGCAGCGC CCGATCAGGC CGATCTTGCC ATTCCAAATG TTCAAACCAT CGCCTTTCCT AGCAACGATG GCCGCAGCTT GGGTGAGTTG GCGCTAAATG CCGAGGTCAT TATTGTTCAA GGCTATAGTT TGCAAAAATA TCCCCAATTG CTGAATGCTG AACGCATTTT GGTGGTCGAT CTCTACGATC CCTTCCATTT TGAAGCCCTT GAATTAGCCG AACGCCGTGG CCTCAGTTTA GAACGAGCGC TTGAACTGAA TGATGCCAGC GTGGCAGCCT TGACGCAACA ACTAGCGCTT GGCGATTTCT TTATCTGTGC CAGCGAACGC CAACGTGATT TGTGGCTGGG AGCCTTGACC GTTAGCAAGC GCTTGACTCC CGAACACTAT CGCAATGATC CAACCTTACG CAAGTTGATC GATATTGTGC CATTTGGCTT GCCCAGTGAG CCACCCCAAG CCACTCAGCC AGTGATGCGT GGCGTAATTG AGGGCATTCA GCAAAACGAT GTAATTGCAT TATGGGGCGG CGGCATCTGG GAATGGCTTG ATCCATTGAC GATCATTCGG GCCATGGCCG AATTGCAGCA GAGCCACCCC CAATTAAAGC TGGTCTTTAT GGGTGGGCAA CACCCGAATA CCCAAGATGT TGGGGTGATG CAGCGGTATA GCGAAGCAGT TGAGCTAGCA AAACAGCTGG GTTTATACGC CAAAACGGTC TTTTTCAATC AAACATGGGT CGCCTATGAT CAACGGGTCA ACTATTTGCT TGAGGCCGAT TTAGGAGTTA GCGCTCATCA TAATCATACT GAAACCCGTT TTGCCTTTCG CACGCGATTG CTCGATTACC TCTGGGCCAG CTTGCCAATG ATCGTTTCGG CAGGCGATAG TTTGGCCGAT TTGGTGCAGC AACAACAGCT TGGTCAGGTT GTCGCAATCG AAGATGTCCA GGGCTGGGTC GCCGCTTTAA CCCATGCCGC CGATCATCCT TCCGATCGTC AGCAACGCCA AGCCCAATTT GCCAACATTC AACAAGCCTA TACCTGGGAA CAAGCTTGTG CGCCGTTGGT CGAGTTTTGT CGCCAACCAC AGTATGCTGC CGATAAACGC CGCAACGTCA AAGCCCAAGG CCAACAATCA GGCCAAACCA GCATGCGCTA CCGTATGGAT GAGCTTGATC GGGCGGTTGC CGAGAAAAAT GAGCATATCG CCCAGCTTGA GCAGCATATC AAAGCGCTAG AAAACGGTAA AGTCATGCGC TTGTTAAAGT GGGTCAATCG GTTGCGAAAA AGTTAA
|
Protein sequence | MDRPSVSIIV INFNGKKHLV DCLNSLFVQR YPSSALEIIV VDNASHDGSC DLLRQQFPKV RLIENRENLG FAPAVNQAVR LSQAQYVALI NNDAKADPNW IEHLVADIEA HKAEKVIAVG AKMLDWEGQQ IDFIQAALNV FGHGNQPFTR MPTASIAGQA GPQLFACGGA MLADRAFFLA IGGFDESYFA YFEDVDFGWR AWLLGYQIRF NPSALVYHRQ HATANTMGGH QIRALLERNA LRTIIKHYAD EQLWRILPAA ILLIIQRSLL DGSGGFDRKE FDLRLRKQGD QTSTMQVPKI MLSYIAALGD VLDGWDSLWA ERERLQTLRQ RSDAELFNLF EQPFGLIDLD VRLHMQQQTM VESFKLRELM PNPTTNVLIV SIDPLQAALA GPAIRSVQIA KQLSHSCKVV LAAPDQADLA IPNVQTIAFP SNDGRSLGEL ALNAEVIIVQ GYSLQKYPQL LNAERILVVD LYDPFHFEAL ELAERRGLSL ERALELNDAS VAALTQQLAL GDFFICASER QRDLWLGALT VSKRLTPEHY RNDPTLRKLI DIVPFGLPSE PPQATQPVMR GVIEGIQQND VIALWGGGIW EWLDPLTIIR AMAELQQSHP QLKLVFMGGQ HPNTQDVGVM QRYSEAVELA KQLGLYAKTV FFNQTWVAYD QRVNYLLEAD LGVSAHHNHT ETRFAFRTRL LDYLWASLPM IVSAGDSLAD LVQQQQLGQV VAIEDVQGWV AALTHAADHP SDRQQRQAQF ANIQQAYTWE QACAPLVEFC RQPQYAADKR RNVKAQGQQS GQTSMRYRMD ELDRAVAEKN EHIAQLEQHI KALENGKVMR LLKWVNRLRK S
|
| |