Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_45520 |
Symbol | mdoH1 |
ID | 7763420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4620924 |
End bp | 4623479 |
Gene Length | 2556 bp |
Protein Length | 851 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807400 |
Product | glucosyltransferase MdoH |
Protein accession | YP_002801641 |
Protein GI | 226946568 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATC CGACGCCTCC CGAGACTCCG CTGGCCGAAT ACCTCGAACA TCTGCCGCTG GATGCGCTGG AGCGCGAACG TCTGGCCGAT GCCGATTCGC TGGCCGAACT GCATCGCCGG CTGGCCGGAG CGTCTCCGGC GCCGGCCGGC GATCCCGCTT TGGCCTCGGC CGCCAGGCGC CTGGGACTGG GTGACCGCGA GCGGGACGGG ATCGACATGG ATGACGCCGG CCGTCCCTGC CTCAAGGCCG CACCGCCGAT CCGGCGCACC AAGGTCGTTC CCGAGCCCTG GCGAACCAAT GTGCTGGTAC GTCTCTGGTA CCGGCTCACC GGCCGACGGA GTCCGCCCAG GCCGACCCGC GATCTGCCGA AGGCTCGTTG GCGGCGGGTC GGCTCGCTGC GCCGGCTCAT CCTGCTGGTG CTGATGCTGG GCCAGACCGC CCTGGCCACT TGGTACATGA AGGGCATCCT GCCCTATCAG GGCTGGGCCT TCGTCGATCT CGCCGAGATC AGCCGGCAGA GCTGGCAACA GAGCCTGCAA CAGGTCTTGC CGTATATCCT CCAGTTCGGC GTGCTGTTTC TCTTCACCAT CCTGTTCTGC TGGGTTTCGG CCGGCTTCTG GACGGCCTTG ATGGGCTTCT GGGAACTGCT CAGCGGGCGT GACCGTTACC GCATTTCCGG CAGCAGCGCG GGCGACGAGC CAATCGCCGC CGAGGCACGC ACGGCCATCG TCATGCCGAT CTGCAACGAA GACGTGGCGC GCGTGTTCGC CGGCCTGCGG GCGACTTACG AGTCGCTGGC CGCCACTGGC GAGCTGGCGC GCTTCGACTT TTTCGTGCTC AGCGACAGCA GTTCGGCCGA CATCGCCGTC GCCGAGCAGC AGGCCTGGCT CGAAGTGTGC CGCGAGACCG GCGGCTCCGG GCGCATCTTC TATCGCCGGC GCCGGCGCCG GGTGAAGCGC AAGAGCGGCA ACATCGACGA CTTCTGCCGG CGCTGGGGCA GTCAGTACCG CTACATGGTG GTGATGGATG CCGACAGCGT GATGAGCGGA GACTGCCTGG CCAAGCTGGT CCGACTGATG GAGGCCAATC CCGAGGCCGG GATCATCCAG ACCGCGCCGA AGGCCTCGGG CATGGACACG CTCTACGCGC GCTTGCAGCA GTTCGCCACC AGTGTCTACG GCCCGCTGTT CACCGCCGGC CTGCATTTCT GGCAGCTCGG CGAGTCGCAC TACTGGGGAC ACAACGCGAT CATCCGCGTC AAGCCGTTCA TCGAGCACTG CGCGTTGGCT CCGCTGTCGG GCAGGGGGGC GTTCGCCGGG GCGATCCTCT CCCACGACTT CGTCGAGGCC GCCCTGATGC GGCGCGCCGG CTGGGGGGTG TGGATCGCCT ACGACCTGCC GGGCAGCTAC GAGGAACTGC CGCCGAACCT GCTCGACGAG CTCAAGCGCG ACCGCCGCTG GTGCCACGGC AACCTGATGA ATTTCCGCCT GTTCCTGGTC AAGGGCATGC ACCCGGTGCA CCGCGCGGTG TTCCTCACCG GGGTGATGTC CTATCTGTCG GCGCCGCTGT GGTTCGCTTT CCTGGTGCTC TCCACCGCGC TACTGGCGGT ACATCAATTG ATGGAGCCGC AGTACTTCCT GGCGCCCAGG CAACTCTTCC CGATCTGGCC GCAATGGCAC CCGGAGCGTG CCATCGCGCT GTTCTCCACC ACCCTGACGC TGCTGTTCCT GCCCAAGCTG CTCAGCGTGA TTCTGGTCTG GGCGAAAGGG GCCAAGGCCT ACGGCGGAGC GTTCAAGGTG GCCTTGAGCA TGTTGCTGGA GATGCTCTTC TCGATGCTGG TGGCACCGGT ACGCATGCTT TTCCACACCC GCTTCGTGAT CGCTGCGTTT CTCGGCTGGT CGGTGCAGTG GAAGTCCCCG CAGCGCGACG ACGACGCCAC CACCTGGGGC GAGGCCGTGC GCCGGCATGG TGGGCAGACC CTGCTCGGCA TCGCCTGGGC GCTGCTGGTG GCCTGGCTGA ACCCGCGCTT CCTCTGGTGG CTGTCGCCCA TTCTGGGGTC GCTGATACTC TCCATTCCGG TTTCGGTGGT CACGAGCTGG GTCGGATGGG GCCTGCGCGC GCGTCGCGGC AGGTTATTCC TCATTCCCGA GGAGTACGAT ACGCCGCCCG AGTTGCGCGC CACCGAGCGC TACACCGGGG AGAACCGGCA GCGGGCGTTC GGGGACGGGT TCATCCGGGC GGCCGTCGAC CCGTGGCTCA ATGCCCTGGC CTGCGCCATG GGCACGGCGC GGCACGGCAC GGCCGAGGCC ATCGAGAGGC GGCGTCGCGA GCGCGTGGAA CAGGCGCTGG CCGCCGGACC GGAAGGACTG GACGGCGAGA GCCGCCTGGC CCTGCTGAGT GACCCGGTGG CCCTGGCGCG CCTGCATCTG CGCTTGTGGG AGGAGGGCCG AGAGAATTGG CTGGCGCCCT GGCGCCAGCC GTTGGCCGGT GCCTGCCGAA GCGGCGTGGC GGCCGGGAGC ACGCCGGAGG CGGCGGGATT GCTGCTGGCG AGATAG
|
Protein sequence | MNNPTPPETP LAEYLEHLPL DALERERLAD ADSLAELHRR LAGASPAPAG DPALASAARR LGLGDRERDG IDMDDAGRPC LKAAPPIRRT KVVPEPWRTN VLVRLWYRLT GRRSPPRPTR DLPKARWRRV GSLRRLILLV LMLGQTALAT WYMKGILPYQ GWAFVDLAEI SRQSWQQSLQ QVLPYILQFG VLFLFTILFC WVSAGFWTAL MGFWELLSGR DRYRISGSSA GDEPIAAEAR TAIVMPICNE DVARVFAGLR ATYESLAATG ELARFDFFVL SDSSSADIAV AEQQAWLEVC RETGGSGRIF YRRRRRRVKR KSGNIDDFCR RWGSQYRYMV VMDADSVMSG DCLAKLVRLM EANPEAGIIQ TAPKASGMDT LYARLQQFAT SVYGPLFTAG LHFWQLGESH YWGHNAIIRV KPFIEHCALA PLSGRGAFAG AILSHDFVEA ALMRRAGWGV WIAYDLPGSY EELPPNLLDE LKRDRRWCHG NLMNFRLFLV KGMHPVHRAV FLTGVMSYLS APLWFAFLVL STALLAVHQL MEPQYFLAPR QLFPIWPQWH PERAIALFST TLTLLFLPKL LSVILVWAKG AKAYGGAFKV ALSMLLEMLF SMLVAPVRML FHTRFVIAAF LGWSVQWKSP QRDDDATTWG EAVRRHGGQT LLGIAWALLV AWLNPRFLWW LSPILGSLIL SIPVSVVTSW VGWGLRARRG RLFLIPEEYD TPPELRATER YTGENRQRAF GDGFIRAAVD PWLNALACAM GTARHGTAEA IERRRRERVE QALAAGPEGL DGESRLALLS DPVALARLHL RLWEEGRENW LAPWRQPLAG ACRSGVAAGS TPEAAGLLLA R
|
| |