Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_30020 |
Symbol | |
ID | 7761903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3108502 |
End bp | 3109587 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643805875 |
Product | Glycosyl transferase, group 1 family protein |
Protein accession | YP_002800143 |
Protein GI | 226945070 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.349292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCC TCCTGCTCAC CCGCTATCCG CGCACGGGCG CCAGCAGCCG GCTGCGCACC CTGCAGTACC TGCCCCACCT GCGGGCCGCC GGCCACGAGG TCCGGGTGCA GAGCCTGTTC GACGAAGCCT GGCTGGAAGG CCTCTACCGG CACGGCCGGC GCCCGCCGGG ACGGACCGCC GCGCTCTATC TGAGCCGCCT GGCGGCGCTC GCCAGGGCCG CCGACCATGA CCTGCTGTGG ATCGAGAAGG AACTCTTCCC CTACCTGCCC GCCTGGATCG AACGCCAGTT GGGCACGCCC TGGATCGTCG ACTACGACGA TGCGGTGTTC CACAACTACG ACCTGGCGCG CAGCCCCCTG GTGCGCCGGC TGCTGGGCGA CAAGATCGAC GCCGTCATGC GCGGCGCCAG CGGCGTGATC GCCGGCAACC GCTATCTGGC CGAGCGCGCC CGTGCCGCCG GCGCGGCCCG GGTGACGCTC ATTCCCACCG TGGTCGACCT CGGCCGCTAC CGGCCGCGCA TAACGGCGCC GGCCGCGCAC CCGGTGATCG GCTGGATCGG CTCGCCCTCC ACCCAGAAGT ACCTGCTCGA CATCCGCGCG CCCCTGCAGC AGGCCTGCCG CGGCCATGGC GCACGCCTGC TGCTGGTGGG CGCGACGCCG GAAATCCGCG CCGGCCTGCC CGGCATCGAG GTACAGCTCG AACCCTGGAG CGAAGAGCGC GAGGCGGCGC TGATCCGGCG CATGGACATC GGCATCATGC CGCTGCCCGA CGGCCCCTGG GAACGCGGCA AGTGCGGCTA CAAGCTGATC CAGTACATGG CCTGCGGCGT GCCGCTGGTG GCCTCGCCGA CCGGCGCCAA CCGGGAGATC GTGGAGCACG GCGGGGCGGG CCTGCTGGCC GACTCCGCCG ACGCCTGGCA CGACGCCCTG TCCCGACTGC TCAGTTCGGC CTCCGAACGG GAACGGCTGG GCCGGGCCGG GCGCCAGGCC GTGGAGAACC GCTACTCGCT GCAGCGCCAG TTCCCGGTGC TGCTGGAAGC CCTGGGCGCG GCACTGCCGA GCGCCGGCAA GGCCCTGGTC AATTGA
|
Protein sequence | MKILLLTRYP RTGASSRLRT LQYLPHLRAA GHEVRVQSLF DEAWLEGLYR HGRRPPGRTA ALYLSRLAAL ARAADHDLLW IEKELFPYLP AWIERQLGTP WIVDYDDAVF HNYDLARSPL VRRLLGDKID AVMRGASGVI AGNRYLAERA RAAGAARVTL IPTVVDLGRY RPRITAPAAH PVIGWIGSPS TQKYLLDIRA PLQQACRGHG ARLLLVGATP EIRAGLPGIE VQLEPWSEER EAALIRRMDI GIMPLPDGPW ERGKCGYKLI QYMACGVPLV ASPTGANREI VEHGGAGLLA DSADAWHDAL SRLLSSASER ERLGRAGRQA VENRYSLQRQ FPVLLEALGA ALPSAGKALV N
|
| |