Gene Avin_04970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_04970 
Symbol 
ID7759454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp470163 
End bp472154 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content75% 
IMG OID643803418 
ProductPhytase domain protein 
Protein accessionYP_002797726 
Protein GI226942653 
COG category[I] Lipid transport and metabolism 
COG ID[COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCCG CGTTTTCCCG CGTCCTCCCG GCCGGCCTGC TGCTCTGCGC CGGCCTGGCC 
CAGGCCGCCG ACCCGCCGCG CCTCGAACTG CAACCCTGGA AGGCGCCGGC CGGCGTCGAG
ATCGCCGACC TGCGCCTGGT CCCGGACGGA GCGGCCGGCG CCGGGCTGCG TCTGGCCGCC
AGCGAACGCC AGGGGCTGCT GCTGCTCGAC GGCGAGGGCC GCGAACTGGC GCGCCAGGGC
GGCAGCTACG CCAGCCTGGA CAGCCGCCTC GCCGGTTCCC GGCTGATGGT CGCCGCGCTG
GACGAGACCG CCCAGCGGGT CGCACTGTTC GCCCTCGACC CGGCCAGCCG CCAGTGGGGC
CAGCCGCTGT GGCTTCCGGC GCGCGACTAC GGCCTCGCCG GGCTGTGCCT GTACCGCGAC
CAGGCGGCCA ATCTCCACCT GTTCCTGCTC AGCGAGGAGG GGCGGGGCGA GCAGTGGCTG
GTCGGCAGCG GCGAGCGACT GGCCGGCGAG CCGCGCCTGG AACGCAGCCT GCCACTGCCG
GCCGGGGCCG GGCACTGTCA GGTCGAGGAT GGCGCGGGCC TACTGTTCGT CAACGAGGAA
GACGTCGGCC TGTGGGCCTA TCCGGCGCAT CCCGAGGCCG ACGGCACGCG CCGGCCGGTG
GACATGCTCG ATCCCTTCGG CTCGCTGGGC GGACGCGCCG GGGCCGTGGC CGCGCTGCCC
GGCGGTCTGC TGGCGCTGGA CCCGCGGCGC GCTGAGCTGC ACCTCTACCA GTGGCAGGCG
TGGGGCTGGC AGGCGCTGGG CGCGCTGCCG CTGGCCGGCC TGGCGGCGCC CGAGCGGCTG
GCCGCGCGAC CCACCGCCAA CGGGCTGGAA CTGCTGGTCC GCGACGACGA CGGCCGCCTG
TTCGCCGGCA CGCTGGACTG GCGGGCGAGC CCGCCGGCCC TGCCGAAGGC GCTGCCCGAG
GTCGCCGCGC TGCGCCAGAG CGAGCCGGTC GGCCGCCATG GCGACGCCGC CGACGACCCG
GCGATCTGGG TCCATCCCGG CGATCCGGCG CGCTCGCGGG TGCTCGGCAC CGACAAGAAG
CAGGGCCTGC AGGTCTACGA CCTCGACGGC AAACTGCTGC AGGAGTTGCC GGTGGGGCGC
CTGAACAACG TCGACCTGCG CCCGGACTTC GCGCTCGGCG GTACGCGGGT CGACCTGGCC
GTGGCCAGCC ACCGCGACCG CAACAGTATC GTCGCGTTCG CCATCGACCG CGCCAGCGGC
GAGCTGCGCG AGGCCGGCGA AATCTCCACG CCGCTGGCGG AGATCTACGG CATCTGCCTG
TTCCAGCCGG CGCCGGGCGA GTTGTACGCC TTCGCCAACG GCAAGGACGG CAGCTTCCGG
CAGTACCGCC TGTACGACGC CGGCGGCCGG GTGGCGGGCG AGCCGCTGCG CGGCTTCCGG
GTCGCCAGCC AGCCCGAGGG CTGCGTCGCC GACGACCGCC GCCAGCGCCT GTTCCTCGGC
GAGGAGGACA CCGGAGTGTG GGCGCTGGAT GCCCGCCCGG ACGCGCCCGT CGAGCTGCAA
AGCGTGATCC GCGTCGGCGC GGACCTGCAG GCCGATGTCG AGGGGCTGGC CCTCTACCGG
GGCGCGGCCC ACGACTATCT GGTGGTCTCC AGCCAGGGCA ACGACAGCTA TCTGGTGCTC
GACGCCGAGC CGCCGCATGC GCTCAAAGGC GCCTTCCGGG TCGGCCTGAA CGTCGAGCTG
GGCATCGACG GCGCCTCCGA GACCGACGGC CTGGAGATCG TTTCGGCCGA CCTCGGCGGT
CCCTGGAGCA CCGGCCTGCT GGTGGTGCAG GACGGCCGCA AGCGCATGCC CGAGCGGACC
CAGAACTTCA AGTTCGTGCC CTGGAGCGCG GTCGCCGAGC GCCTGGGCCT GGCGCCGCCG
GCGGCCGGCG AAAACGACGT CGAACCGCAG TCGCCGGCCG ACGCGGACGG ACCCACGGGA
GTCGCACCAT GA
 
Protein sequence
MIPAFSRVLP AGLLLCAGLA QAADPPRLEL QPWKAPAGVE IADLRLVPDG AAGAGLRLAA 
SERQGLLLLD GEGRELARQG GSYASLDSRL AGSRLMVAAL DETAQRVALF ALDPASRQWG
QPLWLPARDY GLAGLCLYRD QAANLHLFLL SEEGRGEQWL VGSGERLAGE PRLERSLPLP
AGAGHCQVED GAGLLFVNEE DVGLWAYPAH PEADGTRRPV DMLDPFGSLG GRAGAVAALP
GGLLALDPRR AELHLYQWQA WGWQALGALP LAGLAAPERL AARPTANGLE LLVRDDDGRL
FAGTLDWRAS PPALPKALPE VAALRQSEPV GRHGDAADDP AIWVHPGDPA RSRVLGTDKK
QGLQVYDLDG KLLQELPVGR LNNVDLRPDF ALGGTRVDLA VASHRDRNSI VAFAIDRASG
ELREAGEIST PLAEIYGICL FQPAPGELYA FANGKDGSFR QYRLYDAGGR VAGEPLRGFR
VASQPEGCVA DDRRQRLFLG EEDTGVWALD ARPDAPVELQ SVIRVGADLQ ADVEGLALYR
GAAHDYLVVS SQGNDSYLVL DAEPPHALKG AFRVGLNVEL GIDGASETDG LEIVSADLGG
PWSTGLLVVQ DGRKRMPERT QNFKFVPWSA VAERLGLAPP AAGENDVEPQ SPADADGPTG
VAP