Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1740 |
Symbol | |
ID | 4072007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2111617 |
End bp | 2113590 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983748 |
Product | squalene cyclase |
Protein accession | YP_590815 |
Protein GI | 94968767 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACA GACATATTCA ATCGGAAATA ACGTTCGGCA AGATTGACGG CATCCGCGAG CGCATTCAAC AGGCGATGGA TGCCGCCAAA CGGTATCTCT TCTCGAAGCA GGATCCCGAA GGTTTCTGGT GCGGTGAACT CGAAGCCGAC ACCACGCTGC AATCCGATTA CATCGTGATG CATACACTGC TCGGCACCGG CGATCCGGTG AAGATGCAGA AGGCCGGGAA GCAGATCCTG CAGCACCAGA ATCCAGACGG CGGGTGGAAT ATCTACCCCG ACGGCCCTTC GAACATCAGC GCGGCTGTGA AGGCGTATTT CTCGCTCAAG CTGATCGGTC ACAAGCCCGA TGAGCCGGAG ATGACGAAGG CCCGGGAGTG GATCCTGGCC CATGGCGGCG TGACTGCCTG CAACACATTT TCGAAGATGT ATCTCTGCTT TTTCGGCCAG TACGACTACG ACACGGTTCC AGCTATCCCG CCGGAGATCG TGCTCTTTCC GAACTGGTTC TGGTTCAACC TGTACGAGAT ATCGTCGTGG TCGCGCGGCA TTCTTGTCCC GCTCGCAATC TGCTACGCGA AAAAACCATT CAAGAAAATT CCCGACGAAG CAAACATCGA CGAGCTGTTC GTCGAAGGGC GTCACGCCAA TTTGCATTTG ACCTGGGACA AGAAGCCTTT TTCGTGGCGG AACTTCTTCC TCGTGCTTAA CAACATGGTG CACTTTTTCG AGCGCGTGCA CGTGCGTCCG CTGCGCAAGC TCGCAATGAA GCGGGCAGAG AAATGGATGC TCGAGCGCCT CGAGATGAGT GACGGACTCG GTGGGATCTA TCCTGCGATT TTGAATTCAA TCATTGCGCT GCGCGCACTT GGATATTCCA CGGACGATCC GCAGGTGATC CGTGCGATGG ACGAGTTCGA GAAGCTCGGT ATCGAGGAAG ACGACACATT CCGCATGCAA CCGTGCATGT CACCAGTGTG GGACACGGCA TACGCGCTGT ATGCGCTTGG CGAAGCCGGC GTGCCGGGCA GCGATCCACG CATGCAAAAG GCCGCTGAGT GGATGCTGAA GAAGCAGGTG ACGCACAAGG GCGATTGGGC GGTGAAAGTC CGCAACGTGC AGCCGGGTGG CTGGTACTTC GAGTTCAATA ACGAGTTCTA TCCCGACGTG GACGACACCG CTCAGGTGAT TCTGTCGCTG AACCACGTGC GGACATCCAA CGAACGCTAC CAGGACGACA CCGTCAAGCG CGCTCTCGAC TGGCAACTCG CCATGCAGTG CAAGAACGGC GGCTGGGCCT CGTTTGATAA AGACAACAAC AAGATGGTCT TCCAGTACAT CCCGTTCGCT GACCACAACG CCATGCTCGA TCCGGCGACG GTCGACATTA CGGGACGCGT CCTGGAAGCC CTCTCGCATC ACGGGTACTC GCTGAAGGAC AAAGTTGTGC AGCGCGCCGT CAAGTTCATT CAGAGTGAGC AAGAACCTGA CGGTTCCTGG TTTGGTCGCT GGGGCGTGAA CTACATCTAC GGCACCATGC TTTGCCTCCG CGGACTCGCG GCGGTCGGCG TGGATCACCA CGAACCCATG GTGCAACAGG CAGCCGAATG GTTGCGTATG GTGCAGAACC CCGACGGTGG TTGGGGCGAA AGCGTGGGCT CATACGACGA TCCGAAATTG CGCGGGCAGG GACCGAGCAC GGCCTCGCAG ACAGCATGGG CTGTAATGGG GTTGCTGGCG GCGAATGATC TGCGGAGCGA TTCGGTGACG CGCGGTATCG CGTGGCTGCT TGAGAACCAA AAGCCGAATG GATCTTGGTG GGAAAAGTGG ATCACTGGCA CCGGTTTCCC GCGCGTGTTC TACCTCAAAT ACACCATGTA CGCCGAGTAT TTCCCGCTCA TCGCTTTTGC CGAGTATCTG CGGCGTTTGA ATACGCCGCT CGATGAGAAG GTGAAGCTCG GACCACAGGC GTAG
|
Protein sequence | MDDRHIQSEI TFGKIDGIRE RIQQAMDAAK RYLFSKQDPE GFWCGELEAD TTLQSDYIVM HTLLGTGDPV KMQKAGKQIL QHQNPDGGWN IYPDGPSNIS AAVKAYFSLK LIGHKPDEPE MTKAREWILA HGGVTACNTF SKMYLCFFGQ YDYDTVPAIP PEIVLFPNWF WFNLYEISSW SRGILVPLAI CYAKKPFKKI PDEANIDELF VEGRHANLHL TWDKKPFSWR NFFLVLNNMV HFFERVHVRP LRKLAMKRAE KWMLERLEMS DGLGGIYPAI LNSIIALRAL GYSTDDPQVI RAMDEFEKLG IEEDDTFRMQ PCMSPVWDTA YALYALGEAG VPGSDPRMQK AAEWMLKKQV THKGDWAVKV RNVQPGGWYF EFNNEFYPDV DDTAQVILSL NHVRTSNERY QDDTVKRALD WQLAMQCKNG GWASFDKDNN KMVFQYIPFA DHNAMLDPAT VDITGRVLEA LSHHGYSLKD KVVQRAVKFI QSEQEPDGSW FGRWGVNYIY GTMLCLRGLA AVGVDHHEPM VQQAAEWLRM VQNPDGGWGE SVGSYDDPKL RGQGPSTASQ TAWAVMGLLA ANDLRSDSVT RGIAWLLENQ KPNGSWWEKW ITGTGFPRVF YLKYTMYAEY FPLIAFAEYL RRLNTPLDEK VKLGPQA
|
| |