Gene Avin_20770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20770 
Symbol 
ID7761002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2068901 
End bp2070034 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content58% 
IMG OID643804972 
ProductSqualene/phytoene synthase 
Protein accessionYP_002799253 
Protein GI226944180 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID[TIGR01559] farnesyl-diphosphate farnesyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00645323 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAAT CCCAAACAGC TCTAGCTATC GAGTCCTCAA CAACCAGTGC CGATGCTTAT 
CAAAACGCTA TCCTTTCAAA AGTCTCCAGA ACTTTCGCAC TGACCATTCC CCAATTGCCG
CCGCTATTAC GCCGTGCGGT GACCAATGCT TATCTGTTGT GTCGCATCGC AGATACCATC
GAGGATGAAC CGGCGTTTTC CGCCGAGGAA AAGCGTCGTT ACGAGGATGC ATTCATCGAT
GCGGTGACTG GCCGCATCGC ACCGCAATAT TTCTCGACCG AACTGGCGTC ACGATTCTCC
ACGGAAACCT CGGAAGCCGA GCGTGACCTG GTGAGCCAAT TGCCGTTGGT GTTACAGGTT
ACCAATAGCT TGAAGCCGGC ACAGCGCATG GCGATCGTCA ATTGCCTGAA AGTGATGTCC
CACGGCATGC ACGACTTCCA GCGCAACGTA GGCCAGCATG GACTGGAAAC GCTGTGCGAC
ATGGATTGCT ACTGCTACTG CGTAGCCGGC GTAGTGGGCG AAATGCTGAC GGAACTGCTC
ATCGATTTCG ATCCCGCCCT GGCCAGCCAG CGTGACCCTC TGATGCGTCT GGCGATCTCT
TTCGGCCAAG GGCTGCAGAT GACCAACATC CTCAAGGATC AGTGGGAAGA TTACCGCCGT
GGCGTCTGCT GGCTGCCGCA GGACGTCTTC GCCCGATATG GCGTGCGATT GGAGGAGTTG
CAAGCGGGCC GGCAGGATGC GAACTATATG AGCGCACTGA CCGAGCTCAT CGGGGTGGCT
CACGCCCACC TGCGCGACGC GTTGGAATAT ACGCTGATGA TTCCGAACAG ACACTCCGGG
TTCCGCCGCT TCTGCTTGTG GAGTATCGGC CTCGCCGTGC TGACACTGCG CAAGCTGCAG
CAAAACCCCC ATTTCTCCGC CGGCGAGCAA GTGAAGGTAT CGCGCAAGGC GGTAGCCTAC
ACCATCGCGC TCACGCGACT GACAGGCAAT TACAATACCG GTTTGCGCTG GCTGTTCGCA
GCATCCGCAC GCAAGCTTCC GTTGACGCCG CTGTCCGCGG AATGGAGCAC CTCTCCCCAC
CCACACCTTG CCTGGCCGAA GAGCGCCATC TCCTACTTCG CCGAATCGGC CTAG
 
Protein sequence
MLESQTALAI ESSTTSADAY QNAILSKVSR TFALTIPQLP PLLRRAVTNA YLLCRIADTI 
EDEPAFSAEE KRRYEDAFID AVTGRIAPQY FSTELASRFS TETSEAERDL VSQLPLVLQV
TNSLKPAQRM AIVNCLKVMS HGMHDFQRNV GQHGLETLCD MDCYCYCVAG VVGEMLTELL
IDFDPALASQ RDPLMRLAIS FGQGLQMTNI LKDQWEDYRR GVCWLPQDVF ARYGVRLEEL
QAGRQDANYM SALTELIGVA HAHLRDALEY TLMIPNRHSG FRRFCLWSIG LAVLTLRKLQ
QNPHFSAGEQ VKVSRKAVAY TIALTRLTGN YNTGLRWLFA ASARKLPLTP LSAEWSTSPH
PHLAWPKSAI SYFAESA