Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_20650 |
Symbol | |
ID | 7760991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2055211 |
End bp | 2056371 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643804962 |
Product | Glycosyl transferase, family 2 |
Protein accession | YP_002799243 |
Protein GI | 226944170 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGACCT ATCATTTTGT CAGCACCCTC CTGTCCTGGC TGGGTACCGG ACTGGCAGTC GTCACTGCCG GCTACGCCGT CGTTACCCTG GGCGCCGCGC TCAGAGGGAT TCGCGACCGG GCCCCTGCGC TCGGTGCTGC CGATCGTACT CGGCCGGTCA GCATGCTCAA GCCCCTGCAT GGCGCGGAGC CGCGGTTGTA CGAGAATTTG CGCGACTTCT GTCGGCAGAC CCATCCGGAC TACCAGTTGA TATTCGGCGT ACGTGAAGCC GATGATCACG CCATCGCCGT GGTGCACAGA CTGTGCGCGG AGTTCCCGCA CCTGGACATC GATCTGGTCA TCGATCCGCG TGTACACGGC GCCAACCTGA AAGTCAGCAA CTTGCTGAAC ATGCTGCCGC TGGCCCGCCA TGACTGGCTG GTGCTGGCCG ACAGCGACAT CAGCGTGCCG GCGGATTACC TGGTGCGGGT GACGGCGCCG CTGGCAGATC CTGGCGTGGG TATCGTCACC TGTCTTTACT ACGGCGTGCC GCAGGAAAGC TTCTGGTCGC GCCTGGGCGC TCTGTTCATC GACGATTGGT TTGCGCCCTC GGTCCGCTTG TCGCATGTTT TCGGCTCCAC CCGTTTCGCC TTCGGTTCGA CCATCGCGCT GCGCCGCGAG GTATTGCAGG CTATTGGTGG CTTCGAGGTC TTGCGTGATA CTCTGGCCGA CGATTTCTGG TTGGGGGAAC TGACCCGGCG GGCCGGGTTG CGCACCGTGC TGTCGGATCT GCTGGTCGGT ACCGAAGTGA GCGAAACCCG CCTGATCGAG CTGTGGACGC ATGAGTTGCG CTGGTTGCGC ACGATCCGCG CGGTCGCGCC AACTGGTTTT GCGCTGAGCT TCGTCTGTTT CACTTGGCCG GTGTCCCTGC TCGGCCTGGC GCTGAACCCT TCCATGTTGA ATGCCTGGCT CGTCGCGGTA GCGGGTGGCG CGCGTGTTGC CCGCTTTTTC TTCGGCCAGA AAATCAGGCG TTCATCTGTG TCCTGGTACG AGGTTCTGCT GACTCCGTTT CGTGACCTGC TGCTGTTGTT GGAGTGGGCC ATGGCCCTGA CCAGTTGGCG AGTGGAGTGG CGTGGTCGGG TTTTGCATGC GTGCAAGGAT GGGCCCATGC GTTATCTTTG A
|
Protein sequence | MPTYHFVSTL LSWLGTGLAV VTAGYAVVTL GAALRGIRDR APALGAADRT RPVSMLKPLH GAEPRLYENL RDFCRQTHPD YQLIFGVREA DDHAIAVVHR LCAEFPHLDI DLVIDPRVHG ANLKVSNLLN MLPLARHDWL VLADSDISVP ADYLVRVTAP LADPGVGIVT CLYYGVPQES FWSRLGALFI DDWFAPSVRL SHVFGSTRFA FGSTIALRRE VLQAIGGFEV LRDTLADDFW LGELTRRAGL RTVLSDLLVG TEVSETRLIE LWTHELRWLR TIRAVAPTGF ALSFVCFTWP VSLLGLALNP SMLNAWLVAV AGGARVARFF FGQKIRRSSV SWYEVLLTPF RDLLLLLEWA MALTSWRVEW RGRVLHACKD GPMRYL
|
| |