Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04970 |
Symbol | |
ID | 7759454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 470163 |
End bp | 472154 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643803418 |
Product | Phytase domain protein |
Protein accession | YP_002797726 |
Protein GI | 226942653 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCCG CGTTTTCCCG CGTCCTCCCG GCCGGCCTGC TGCTCTGCGC CGGCCTGGCC CAGGCCGCCG ACCCGCCGCG CCTCGAACTG CAACCCTGGA AGGCGCCGGC CGGCGTCGAG ATCGCCGACC TGCGCCTGGT CCCGGACGGA GCGGCCGGCG CCGGGCTGCG TCTGGCCGCC AGCGAACGCC AGGGGCTGCT GCTGCTCGAC GGCGAGGGCC GCGAACTGGC GCGCCAGGGC GGCAGCTACG CCAGCCTGGA CAGCCGCCTC GCCGGTTCCC GGCTGATGGT CGCCGCGCTG GACGAGACCG CCCAGCGGGT CGCACTGTTC GCCCTCGACC CGGCCAGCCG CCAGTGGGGC CAGCCGCTGT GGCTTCCGGC GCGCGACTAC GGCCTCGCCG GGCTGTGCCT GTACCGCGAC CAGGCGGCCA ATCTCCACCT GTTCCTGCTC AGCGAGGAGG GGCGGGGCGA GCAGTGGCTG GTCGGCAGCG GCGAGCGACT GGCCGGCGAG CCGCGCCTGG AACGCAGCCT GCCACTGCCG GCCGGGGCCG GGCACTGTCA GGTCGAGGAT GGCGCGGGCC TACTGTTCGT CAACGAGGAA GACGTCGGCC TGTGGGCCTA TCCGGCGCAT CCCGAGGCCG ACGGCACGCG CCGGCCGGTG GACATGCTCG ATCCCTTCGG CTCGCTGGGC GGACGCGCCG GGGCCGTGGC CGCGCTGCCC GGCGGTCTGC TGGCGCTGGA CCCGCGGCGC GCTGAGCTGC ACCTCTACCA GTGGCAGGCG TGGGGCTGGC AGGCGCTGGG CGCGCTGCCG CTGGCCGGCC TGGCGGCGCC CGAGCGGCTG GCCGCGCGAC CCACCGCCAA CGGGCTGGAA CTGCTGGTCC GCGACGACGA CGGCCGCCTG TTCGCCGGCA CGCTGGACTG GCGGGCGAGC CCGCCGGCCC TGCCGAAGGC GCTGCCCGAG GTCGCCGCGC TGCGCCAGAG CGAGCCGGTC GGCCGCCATG GCGACGCCGC CGACGACCCG GCGATCTGGG TCCATCCCGG CGATCCGGCG CGCTCGCGGG TGCTCGGCAC CGACAAGAAG CAGGGCCTGC AGGTCTACGA CCTCGACGGC AAACTGCTGC AGGAGTTGCC GGTGGGGCGC CTGAACAACG TCGACCTGCG CCCGGACTTC GCGCTCGGCG GTACGCGGGT CGACCTGGCC GTGGCCAGCC ACCGCGACCG CAACAGTATC GTCGCGTTCG CCATCGACCG CGCCAGCGGC GAGCTGCGCG AGGCCGGCGA AATCTCCACG CCGCTGGCGG AGATCTACGG CATCTGCCTG TTCCAGCCGG CGCCGGGCGA GTTGTACGCC TTCGCCAACG GCAAGGACGG CAGCTTCCGG CAGTACCGCC TGTACGACGC CGGCGGCCGG GTGGCGGGCG AGCCGCTGCG CGGCTTCCGG GTCGCCAGCC AGCCCGAGGG CTGCGTCGCC GACGACCGCC GCCAGCGCCT GTTCCTCGGC GAGGAGGACA CCGGAGTGTG GGCGCTGGAT GCCCGCCCGG ACGCGCCCGT CGAGCTGCAA AGCGTGATCC GCGTCGGCGC GGACCTGCAG GCCGATGTCG AGGGGCTGGC CCTCTACCGG GGCGCGGCCC ACGACTATCT GGTGGTCTCC AGCCAGGGCA ACGACAGCTA TCTGGTGCTC GACGCCGAGC CGCCGCATGC GCTCAAAGGC GCCTTCCGGG TCGGCCTGAA CGTCGAGCTG GGCATCGACG GCGCCTCCGA GACCGACGGC CTGGAGATCG TTTCGGCCGA CCTCGGCGGT CCCTGGAGCA CCGGCCTGCT GGTGGTGCAG GACGGCCGCA AGCGCATGCC CGAGCGGACC CAGAACTTCA AGTTCGTGCC CTGGAGCGCG GTCGCCGAGC GCCTGGGCCT GGCGCCGCCG GCGGCCGGCG AAAACGACGT CGAACCGCAG TCGCCGGCCG ACGCGGACGG ACCCACGGGA GTCGCACCAT GA
|
Protein sequence | MIPAFSRVLP AGLLLCAGLA QAADPPRLEL QPWKAPAGVE IADLRLVPDG AAGAGLRLAA SERQGLLLLD GEGRELARQG GSYASLDSRL AGSRLMVAAL DETAQRVALF ALDPASRQWG QPLWLPARDY GLAGLCLYRD QAANLHLFLL SEEGRGEQWL VGSGERLAGE PRLERSLPLP AGAGHCQVED GAGLLFVNEE DVGLWAYPAH PEADGTRRPV DMLDPFGSLG GRAGAVAALP GGLLALDPRR AELHLYQWQA WGWQALGALP LAGLAAPERL AARPTANGLE LLVRDDDGRL FAGTLDWRAS PPALPKALPE VAALRQSEPV GRHGDAADDP AIWVHPGDPA RSRVLGTDKK QGLQVYDLDG KLLQELPVGR LNNVDLRPDF ALGGTRVDLA VASHRDRNSI VAFAIDRASG ELREAGEIST PLAEIYGICL FQPAPGELYA FANGKDGSFR QYRLYDAGGR VAGEPLRGFR VASQPEGCVA DDRRQRLFLG EEDTGVWALD ARPDAPVELQ SVIRVGADLQ ADVEGLALYR GAAHDYLVVS SQGNDSYLVL DAEPPHALKG AFRVGLNVEL GIDGASETDG LEIVSADLGG PWSTGLLVVQ DGRKRMPERT QNFKFVPWSA VAERLGLAPP AAGENDVEPQ SPADADGPTG VAP
|
| |