Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_30080 |
Symbol | |
ID | 7761909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3115656 |
End bp | 3117953 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805881 |
Product | Polysaccharide biosynthesis protein |
Protein accession | YP_002800149 |
Protein GI | 226945076 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.819591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCGC CACACAAGGA ATCTTCGCAA CCCATCCGCC AGCGCCTGCT GGACCTTTCC CATGCCCAGA AGCGGCTCAT CCAGGTCGGC GTCGACCTCC TGCTCGTCTG GCTGGCGCTC TGGCTGGCGT TCTACATCCG CCTCGAAGAC ATGACCCCGA TCGAACCCCT CGGCGACCAC GCCTGGCTGT TCACCGCCGC CCCGGCCGTC GCCCTGCCGA TCTTCGTGCG CCTGGGCATG TACCGGGCGG TCATGCGTTA CCTGGGCAAC GAGGCTTTGC TGAGCATCGC CCGGGCGGTC ACCCTGTCGG CGCTGCTGCT GGCCCTGGTC ATCCTCCTCC ACGGCAAGAC GAGTCCGCCG ATCCCGCGCT CGGTGATCTT CAACTACTGG GGGCTGAGCC TGCTGTCGAT CGGCGGCCTG CGCATCGCCA TGCGCCAGTA TTTCACCGGC GACTGGTTCA ACCTGCGCGA GCTGTCCTTC CGGCGCCCGG AGAACGGCCC GCCACGGGTG GCCATCTACG GCGCGGGCTC GGCCGGCAAC CAGCTCGCCC ACGCCCTGCT CATGGGCCAC GCGCTGCGTC CGGTGGCCTT CATCGACGAC GATGCGCACC TGGCCGGCCG CACGATCGCC GGCCTGCCGG TCTACGCCCC CGACCGGCTG GAGCGGATGC TCGGGGAAAC CGGCGCCACG GAGATCGTCC TGGCGATCCC CTCGGCCAGC CGCAGCCGCC GGCGCAGGAT CCTCGAGATG CTGCAGGCCC ACCCGCTGCC GGTGCGCTCG ATGCCCAGCA TCGCCAAGCT GGCCTGCGGC CGCCTCACGG TGAACGACCT GCAGGAAGTG GACATCGCCG ACCTGCTCGG CCGCGATGCG CTGCCCCTGC GGCCCGAATT GCAGGAACGC TGCATCCGCG GCCAGGTGGT GATGGTGACC GGTGCCGGCG GCTCGATCGG CGCGGAACTG TGCCGGCAAA TCCTCGCCAA CGGACCGGCC ACGCTGATCC TCTTCGAGCA CTCGGAGTAC AACCTGTACA GCATCCACGG CGAACTGGAA CGGAGAATCC ACCGGGAAGC CCCGGGCCTG CGCCTGGTCC CCGTCCTCGG CTCGGTCCGC CACGGCAGCC GCCTGCTCGA CACGCTGCGC CACTGGCGGG TCGACACGCT CTACCACGCC GCGGCCTACA AGCACGTGCC GCTGGTCGAA GTGAACATCG GCGAAGGCGT GCTCAACAAC ACCTTCGGCA CCCTGCGCGC CGCCCAGGCA GCGATACGCG CCGGAGTACG CAGCTTCGTG CTGATCTCGA CCGACAAGGC GGTACGCCCC ACCAATGTGA TGGGCGGCAG CAAGCGCCTG GCCGAGATGG TGCTGCAGGC CTTCTCCGGG GAGCGCGAGG TGGAGCTGTT CGATGAGCCC GGCCTGTCGC CGCAACCCAA CCGCACCCGT TTCACCATGG TCCGCTTCGG CAACGTGCTG GGTTCGTCCG GCTCGGTGAT CCCGCTGTTC CGCGAACAGA TCCGCAACGG CGGGCCGATC ACCGTCACCC ACCCGGAGAT CACCCGCTAC TTCATGACCA TCCCCGAGGC GGCGCAACTG GTGATCCAGG CCGGCGCCAT GGGCGAAGGC GGCGACGTCT TCGTGCTGGA CATGGGCGAG CCGGTGAAGA TCCTCGACCT GGCGGAGAAG ATGGTGCGCC TGTCCGGCCT GTCGCTGCGC ACCAAGACCA GCCCGGACGG CGACATCGAG ATCCGCTTCG TCGGCCTGCG TCCCGGCGAG AAGCTCTACG AGGAACTGCT GATCGGCGAC GGCGCGACGC CGACCAGCCA CAGCCGGATC ATGCGCGCCC ACGAGGAGCA CCTGCCCTGG CGGGAGCTGA AGCCGCGCCT CGAAGCCTTG GCCGGGGCGC TCGAAGCCGA CGATTTCCCG CGCATCCGCG AGTTGCTGCA ACGCACCGTC AGCGGCTACC GGCCGAGCAG GGAAATCCTC GGCCGGCCCC TGCAGGAACA TCCGGCCAGT TGTCAGCCCG TTCCGCCTGG ATGCACCTGG TGGTTCCACA GGATGCCAGG CCGAACGGCG AAAACCCCTG ACGGGGCCTG TCCTGCCGTA ATGGCCGCTG CGAGTCGACC CGGCCTCGAC AAAACTCTCC CGCGAATCAC GGACGAGGCG ACATGGAGAC GACTCGATCT GCCCTGCCCT CGTTCGGCGG ACCGCTCTTT TCTCCTGGCC GAAACCCTCG CCCGCCAGGC CGAGACCGAT CCAGCCGCCA CAAACATATT GACCGGAAAA GAGCGACAGA ACAGATGA
|
Protein sequence | MKSPHKESSQ PIRQRLLDLS HAQKRLIQVG VDLLLVWLAL WLAFYIRLED MTPIEPLGDH AWLFTAAPAV ALPIFVRLGM YRAVMRYLGN EALLSIARAV TLSALLLALV ILLHGKTSPP IPRSVIFNYW GLSLLSIGGL RIAMRQYFTG DWFNLRELSF RRPENGPPRV AIYGAGSAGN QLAHALLMGH ALRPVAFIDD DAHLAGRTIA GLPVYAPDRL ERMLGETGAT EIVLAIPSAS RSRRRRILEM LQAHPLPVRS MPSIAKLACG RLTVNDLQEV DIADLLGRDA LPLRPELQER CIRGQVVMVT GAGGSIGAEL CRQILANGPA TLILFEHSEY NLYSIHGELE RRIHREAPGL RLVPVLGSVR HGSRLLDTLR HWRVDTLYHA AAYKHVPLVE VNIGEGVLNN TFGTLRAAQA AIRAGVRSFV LISTDKAVRP TNVMGGSKRL AEMVLQAFSG EREVELFDEP GLSPQPNRTR FTMVRFGNVL GSSGSVIPLF REQIRNGGPI TVTHPEITRY FMTIPEAAQL VIQAGAMGEG GDVFVLDMGE PVKILDLAEK MVRLSGLSLR TKTSPDGDIE IRFVGLRPGE KLYEELLIGD GATPTSHSRI MRAHEEHLPW RELKPRLEAL AGALEADDFP RIRELLQRTV SGYRPSREIL GRPLQEHPAS CQPVPPGCTW WFHRMPGRTA KTPDGACPAV MAAASRPGLD KTLPRITDEA TWRRLDLPCP RSADRSFLLA ETLARQAETD PAATNILTGK ERQNR
|
| |