Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0783 |
Symbol | |
ID | 8543165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 1016369 |
End bp | 1018717 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646385557 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003265292 |
Protein GI | 262194083 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTC TCCAGGTCAT ACACGGCTAT CCGATGCGGT ACAACGCCGG CTCCGAGGTG TACACGCAGA CGCTGTGTCA CGGACTCGCA GACCGGCACG AGGTACACGT CTTCACACGC GAGGAGGATT CCTTCGCGCC CGACTATCGC ATGCGTCGCG AGCATGACCC CGATGATGCG CGCATCACGC TCCACCTCGT CAACAATCCG CGCAACAAGG ACCGCTATCG CGCGGCCGGT ATCGACCAAC GCTTTGCCGA GCTGCTCGAT CGCCTTCGAC CCGAGGTTGT GCACGTCGGA CACTTGAACC ATCTCTCGAC CTCGCTCCTG CGCGAGGCCG CGACGCGTTC GATTCCGATC CTCTATACAC TTCACGATTA CTGGGTGATG TGCCCGCGCG GGCAGTTCAT GCAGATGTTC CCCGAGGACG GAACCGACCT CTGGGCCGCC TGCGACGGGC AGGACGACCG CAAATGTGCG GAGCGCTGCT ATACGCGCTA CTTCAGCGGG GCGCCTGAAG AACGCGAGCA CGACCTGCGG TATTGGACCG ACTGGGTGGC GCGTCGTATG GCGCACATCC GGGAGATGGC AGACCTCGTC GACGTGTTCA TCGCTCCGGC GCGGTATCTA CGCGACCGCT ACCGCGACGC CTTTGGGCTG CCCGAGCGCA AGCTCGTCTA TCTCGACTAC GGCTTCGACC GCGCGCGTCT GGGCGGGCGC AAACGCACGG CGAACGAGGC ATTTACCTTC GGCTACATCG GCACGCACAT CCCGGCCAAG GGCATCCAAG TGCTACTGCG CGCGTTCGGC GAGCTGCGCG GCAGTCCGCG CCTGCGCATC TGGGGACGCC CCCGCGGCCA GGAGACCGCC GCGCTGAAGG CGCTGGCCGC CGACCTGCCG GGCGACGCGG CCGACCGTGT CGAATGGCTG TCCGAATACC GCAATCAGGA CATCGTCCGC GACGTATTCG ACCGCACGGA CGCAATCGTG GTTCCGTCGG TCTGGGTAGA GAACTCGCCG CTGGTCATCC ACGAGGCGCA GCAGGCGCGC GTGCCCGTGA TCGCGGCCGA CGTCGGCGGC ATGGCCGAAT ACGTCCACCA TGAGATCAAC GGCCTGCAGT TCGAACATCG CTGTGAGCGC TCGCTCGCGG CCCAGATGCA GCGCTTCGTG GACGATCCTG CGTGGGCGCG CGGCCTGGGA GAGCGCGGCT ACGCGTTCAG CGAGAGCGGC GATATCCCGG ATATCGACAC GCAGGTCGAC GATATCGAAC GGCTCTACGA GGCGGCGCTT GCAAGTCGCG ACAGCGCCCG GGTCGAGGTC GGGCCGGGGC CGTGGCGCAT CACCTTCGAC ACCAACCCGG ACACTTGCAA CATGCGCTGC GTGATGTGCG AAGAGCACTC GCCGCACAGC CCGTTGCAGA CGTTGCGCAA GGCGGAGGGG CGAGCGCGCC GGGAGATGCC CATCGAGCTC ATCCGCGAGG TCGTGGCCGA CGCAGCAGCG CACGGACTGC GCGAGATCAT CCCATCGACC ATGGGCGAGC CGCTGCTCTA CGAACACTTC GAAGCGATCC TGGCGCTGTG CGTAGAGCAC GGCGTTCGCC TCAATCTCAC GACCAACGGC AGCTTCCCGC GGCTCGGCGC GAGAGCGTGG GCCGAGCGCA TCGTGCCGGT CACTTCCGAT GTGAAGATCT CCTGGAACGG AGCCACCAAG GCGACGCAGG AGGCGATCAT GCTGGGTTCA GACTGGGAGG CGGTTTTAGA CAACGTCCGC GCTTTCCTCG CGGTCCGTGA CGGGCACGCG GCGCGCGGCG GCAACCGCTG CCGAGTGACC TTTCAGCTCA CCTTCCTCGA AGCCAATGTC GCCGAACTCG CCGACATCGT GCGGCTCGCG GTCTCGCTGG GAGTCGATCG GGTCAAGGGT CATCACCTGT GGGCGCACTT CGACGAGATC AAAGAACAGT CGATGCGCCG CAGCCCCGAG GCGATCCAGC GCTGGAACGC GGCCGTGCTC GCGGCGCGCG AGGCCGCCGC CGAGCGGCCT CTGCCCAACG GCAAGTACGT CCTGCTGGAG AATATATTTT TACTGGAGGA GCAAGCTACG GCGGACCTGG CGCCTGGCGG GCCGTGTCCG TTTCTCGGCA AGGAGGCATG GGTGAGCGCC GAAGGCCGCT TCGATCCCTG CTGCGCGCCC GATGCTCAGC GGCGTACGCT GGGCTCGTTC GGTAATCTCG GCGATAGCGG CATCATGGAG ATCTGGAACG GGCCCGCGTA CCGCGAGCTG GCCGCCAGCT ATCGCAATCG CGCCTTGTGT CTGCGTTGCA ACATGCGGAA ACCCGCGGAG GAGCCGTGA
|
Protein sequence | MKVLQVIHGY PMRYNAGSEV YTQTLCHGLA DRHEVHVFTR EEDSFAPDYR MRREHDPDDA RITLHLVNNP RNKDRYRAAG IDQRFAELLD RLRPEVVHVG HLNHLSTSLL REAATRSIPI LYTLHDYWVM CPRGQFMQMF PEDGTDLWAA CDGQDDRKCA ERCYTRYFSG APEEREHDLR YWTDWVARRM AHIREMADLV DVFIAPARYL RDRYRDAFGL PERKLVYLDY GFDRARLGGR KRTANEAFTF GYIGTHIPAK GIQVLLRAFG ELRGSPRLRI WGRPRGQETA ALKALAADLP GDAADRVEWL SEYRNQDIVR DVFDRTDAIV VPSVWVENSP LVIHEAQQAR VPVIAADVGG MAEYVHHEIN GLQFEHRCER SLAAQMQRFV DDPAWARGLG ERGYAFSESG DIPDIDTQVD DIERLYEAAL ASRDSARVEV GPGPWRITFD TNPDTCNMRC VMCEEHSPHS PLQTLRKAEG RARREMPIEL IREVVADAAA HGLREIIPST MGEPLLYEHF EAILALCVEH GVRLNLTTNG SFPRLGARAW AERIVPVTSD VKISWNGATK ATQEAIMLGS DWEAVLDNVR AFLAVRDGHA ARGGNRCRVT FQLTFLEANV AELADIVRLA VSLGVDRVKG HHLWAHFDEI KEQSMRRSPE AIQRWNAAVL AAREAAAERP LPNGKYVLLE NIFLLEEQAT ADLAPGGPCP FLGKEAWVSA EGRFDPCCAP DAQRRTLGSF GNLGDSGIME IWNGPAYREL AASYRNRALC LRCNMRKPAE EP
|
| |