Gene Avin_20760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20760 
Symbol 
ID7761001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2066888 
End bp2068684 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content63% 
IMG OID643804971 
Productsqualene-hopene cyclase 
Protein accessionYP_002799252 
Protein GI226944179 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.577107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCATT TCACGGATGA GATTGACGAA GACCTGCAGG AAAGGATGGC GCGCTACTTG 
CGCGCTACCC AGGTCCAGGA GACTCATGGC GGCTGGCCGC AATACGTGGG AGGCGCGATC
GACCTGTCGT GCACGGTCAA GGCCTATTAT GCGCTCAAGG CCGCCGGCGA CTCCCCCGAA
GCGCCACATA TGCGTCGGGC GCGCGAGGCG GTGCTGGCGC TTGGCGGCGC CGCCAAAAGC
AACGTATTCA CCCGCATCCT ACTGGCGATG TTCGAGCAGG TCCCCTGGCG TGCCGTACCC
TATCTGCCGG TCGAAATCAT GCTGTTGCCG CGCTGGGCGC CAATCCATAT CGAAAAGATG
TCCTACTGGG CGCGGACCAC CCTGGTGCCC TTGACGATCC TCTGTTCGCT CAAGGCCCGG
GCTGCCAACC CCAAGCGAGT GGACATCCGC GAGCTGTTCG TCACTGCGCC GGAGCAGGAG
CGCCACTATT TCCTCCGTGG TGGCCTGCTG AACCGCATTT TCCTGGGACT GGACAAATTT
GCTCGTACGC TTGACCGCTG GATGCCGAAA TCGCTGCGTC AGCATGCTAT CCGCAAGGCC
GAGGCCTGGT TTTTGCCGCG CATGAATGGC GAAGACGGCC TGGGGGCGAT TTTCCCGCCG
ATGGTCAATT GCTACGAAGC GATGATCCTG CTCGGCTATC CCAAGGATCA TCCAGCCCGA
AAGACCTGCT TGCGCTCGAT CCAGAAGCTG ATTGTCCATC GTGACGACGG CTCGGCCTAC
TGCCAGCCTT GCGTTTCGCC GGTATGGGAT ACCGCCTGGA GTGCCATGGC CCTGATCCAC
AGTGGTGACG ATACGGCAAC CCAAACGGCC ATCGCACGGG CCGGCGACTG GCTCGTCCAG
CGGCAGGAGC TGGATTGCAG GGGCGACTGG GAAGCGCAGG CCCCCCAAGC GGCTCCAGGC
GGCTGGGCCT TCCAGTATGC CAACGGCTAT TATCCGGACA TCGACGACAC GGCACTGGTG
GCAGCTCTGT TGCACATATC GGACCGTCGT CGCGGACAGC CCGGGCAACA TGCCTTCAAC
ATCGATCGCG CCGTGGATTG GATGCTGGCC TTGCAGTCGA GGAATGGCGG CTTTGCCGCC
TTCGATGCCG ACAACACCCA CTACTACCTC AACGCCATTC CCTTCGCAGA CCATGGAGCC
TTGCTCGACC CCCCGACCGA GGACGTCTCC GGCCGCGTGG CCGCCTGCCT CGGCATATTG
AAGCGCGATC AAGACCGTGA CGGCCTGCGC CGCTGCATCG ACTACCTGCG CACGACCCAA
CAGCCCGACG GCAGTTGGTG GGGCCGCTGG GGCAGCAACT ATATCTACGG CACCTGGAGC
GCGCTTTCCG GCCTTGCCTT GGCCGGCGAG GACCTCCGCC AACCCTACCT GCGCAAATCC
GTCGACTGGC TACGCACGCG CCAGCACCCC GATGGCGGCT GGGGCGAGAC CAACGACAGC
TACATCGACC CGCACCTGGC CGGTACCAAC GCAGGAATCA GCACGCCACA CTCGACGGCA
TGGGCAGTAC TGGCCCAACT GGCCATGGGC GAAGTGGAGT CCGACTCGGT GAGACGCGGT
ATCGCTTTTC TGCTCGCCTG CCAGCAAACC GACGGACTCT GGTCCCATCC CTCGCACAAC
GCCCCCGGTT TTCCACGGGT TTACTACCTC AAGTATCACG GTTATGCCGC CTATTTCCCT
CTATACGCGC TGGCCCGCTA TCGGCATCTG TTGAATCGCT CCAGGGAGCA GCGGTGA
 
Protein sequence
MMHFTDEIDE DLQERMARYL RATQVQETHG GWPQYVGGAI DLSCTVKAYY ALKAAGDSPE 
APHMRRAREA VLALGGAAKS NVFTRILLAM FEQVPWRAVP YLPVEIMLLP RWAPIHIEKM
SYWARTTLVP LTILCSLKAR AANPKRVDIR ELFVTAPEQE RHYFLRGGLL NRIFLGLDKF
ARTLDRWMPK SLRQHAIRKA EAWFLPRMNG EDGLGAIFPP MVNCYEAMIL LGYPKDHPAR
KTCLRSIQKL IVHRDDGSAY CQPCVSPVWD TAWSAMALIH SGDDTATQTA IARAGDWLVQ
RQELDCRGDW EAQAPQAAPG GWAFQYANGY YPDIDDTALV AALLHISDRR RGQPGQHAFN
IDRAVDWMLA LQSRNGGFAA FDADNTHYYL NAIPFADHGA LLDPPTEDVS GRVAACLGIL
KRDQDRDGLR RCIDYLRTTQ QPDGSWWGRW GSNYIYGTWS ALSGLALAGE DLRQPYLRKS
VDWLRTRQHP DGGWGETNDS YIDPHLAGTN AGISTPHSTA WAVLAQLAMG EVESDSVRRG
IAFLLACQQT DGLWSHPSHN APGFPRVYYL KYHGYAAYFP LYALARYRHL LNRSREQR