Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_20760 |
Symbol | |
ID | 7761001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2066888 |
End bp | 2068684 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643804971 |
Product | squalene-hopene cyclase |
Protein accession | YP_002799252 |
Protein GI | 226944179 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.577107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCATT TCACGGATGA GATTGACGAA GACCTGCAGG AAAGGATGGC GCGCTACTTG CGCGCTACCC AGGTCCAGGA GACTCATGGC GGCTGGCCGC AATACGTGGG AGGCGCGATC GACCTGTCGT GCACGGTCAA GGCCTATTAT GCGCTCAAGG CCGCCGGCGA CTCCCCCGAA GCGCCACATA TGCGTCGGGC GCGCGAGGCG GTGCTGGCGC TTGGCGGCGC CGCCAAAAGC AACGTATTCA CCCGCATCCT ACTGGCGATG TTCGAGCAGG TCCCCTGGCG TGCCGTACCC TATCTGCCGG TCGAAATCAT GCTGTTGCCG CGCTGGGCGC CAATCCATAT CGAAAAGATG TCCTACTGGG CGCGGACCAC CCTGGTGCCC TTGACGATCC TCTGTTCGCT CAAGGCCCGG GCTGCCAACC CCAAGCGAGT GGACATCCGC GAGCTGTTCG TCACTGCGCC GGAGCAGGAG CGCCACTATT TCCTCCGTGG TGGCCTGCTG AACCGCATTT TCCTGGGACT GGACAAATTT GCTCGTACGC TTGACCGCTG GATGCCGAAA TCGCTGCGTC AGCATGCTAT CCGCAAGGCC GAGGCCTGGT TTTTGCCGCG CATGAATGGC GAAGACGGCC TGGGGGCGAT TTTCCCGCCG ATGGTCAATT GCTACGAAGC GATGATCCTG CTCGGCTATC CCAAGGATCA TCCAGCCCGA AAGACCTGCT TGCGCTCGAT CCAGAAGCTG ATTGTCCATC GTGACGACGG CTCGGCCTAC TGCCAGCCTT GCGTTTCGCC GGTATGGGAT ACCGCCTGGA GTGCCATGGC CCTGATCCAC AGTGGTGACG ATACGGCAAC CCAAACGGCC ATCGCACGGG CCGGCGACTG GCTCGTCCAG CGGCAGGAGC TGGATTGCAG GGGCGACTGG GAAGCGCAGG CCCCCCAAGC GGCTCCAGGC GGCTGGGCCT TCCAGTATGC CAACGGCTAT TATCCGGACA TCGACGACAC GGCACTGGTG GCAGCTCTGT TGCACATATC GGACCGTCGT CGCGGACAGC CCGGGCAACA TGCCTTCAAC ATCGATCGCG CCGTGGATTG GATGCTGGCC TTGCAGTCGA GGAATGGCGG CTTTGCCGCC TTCGATGCCG ACAACACCCA CTACTACCTC AACGCCATTC CCTTCGCAGA CCATGGAGCC TTGCTCGACC CCCCGACCGA GGACGTCTCC GGCCGCGTGG CCGCCTGCCT CGGCATATTG AAGCGCGATC AAGACCGTGA CGGCCTGCGC CGCTGCATCG ACTACCTGCG CACGACCCAA CAGCCCGACG GCAGTTGGTG GGGCCGCTGG GGCAGCAACT ATATCTACGG CACCTGGAGC GCGCTTTCCG GCCTTGCCTT GGCCGGCGAG GACCTCCGCC AACCCTACCT GCGCAAATCC GTCGACTGGC TACGCACGCG CCAGCACCCC GATGGCGGCT GGGGCGAGAC CAACGACAGC TACATCGACC CGCACCTGGC CGGTACCAAC GCAGGAATCA GCACGCCACA CTCGACGGCA TGGGCAGTAC TGGCCCAACT GGCCATGGGC GAAGTGGAGT CCGACTCGGT GAGACGCGGT ATCGCTTTTC TGCTCGCCTG CCAGCAAACC GACGGACTCT GGTCCCATCC CTCGCACAAC GCCCCCGGTT TTCCACGGGT TTACTACCTC AAGTATCACG GTTATGCCGC CTATTTCCCT CTATACGCGC TGGCCCGCTA TCGGCATCTG TTGAATCGCT CCAGGGAGCA GCGGTGA
|
Protein sequence | MMHFTDEIDE DLQERMARYL RATQVQETHG GWPQYVGGAI DLSCTVKAYY ALKAAGDSPE APHMRRAREA VLALGGAAKS NVFTRILLAM FEQVPWRAVP YLPVEIMLLP RWAPIHIEKM SYWARTTLVP LTILCSLKAR AANPKRVDIR ELFVTAPEQE RHYFLRGGLL NRIFLGLDKF ARTLDRWMPK SLRQHAIRKA EAWFLPRMNG EDGLGAIFPP MVNCYEAMIL LGYPKDHPAR KTCLRSIQKL IVHRDDGSAY CQPCVSPVWD TAWSAMALIH SGDDTATQTA IARAGDWLVQ RQELDCRGDW EAQAPQAAPG GWAFQYANGY YPDIDDTALV AALLHISDRR RGQPGQHAFN IDRAVDWMLA LQSRNGGFAA FDADNTHYYL NAIPFADHGA LLDPPTEDVS GRVAACLGIL KRDQDRDGLR RCIDYLRTTQ QPDGSWWGRW GSNYIYGTWS ALSGLALAGE DLRQPYLRKS VDWLRTRQHP DGGWGETNDS YIDPHLAGTN AGISTPHSTA WAVLAQLAMG EVESDSVRRG IAFLLACQQT DGLWSHPSHN APGFPRVYYL KYHGYAAYFP LYALARYRHL LNRSREQR
|
| |