Gene Acid345_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1740 
Symbol 
ID4072007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2111617 
End bp2113590 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content57% 
IMG OID637983748 
Productsqualene cyclase 
Protein accessionYP_590815 
Protein GI94968767 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACA GACATATTCA ATCGGAAATA ACGTTCGGCA AGATTGACGG CATCCGCGAG 
CGCATTCAAC AGGCGATGGA TGCCGCCAAA CGGTATCTCT TCTCGAAGCA GGATCCCGAA
GGTTTCTGGT GCGGTGAACT CGAAGCCGAC ACCACGCTGC AATCCGATTA CATCGTGATG
CATACACTGC TCGGCACCGG CGATCCGGTG AAGATGCAGA AGGCCGGGAA GCAGATCCTG
CAGCACCAGA ATCCAGACGG CGGGTGGAAT ATCTACCCCG ACGGCCCTTC GAACATCAGC
GCGGCTGTGA AGGCGTATTT CTCGCTCAAG CTGATCGGTC ACAAGCCCGA TGAGCCGGAG
ATGACGAAGG CCCGGGAGTG GATCCTGGCC CATGGCGGCG TGACTGCCTG CAACACATTT
TCGAAGATGT ATCTCTGCTT TTTCGGCCAG TACGACTACG ACACGGTTCC AGCTATCCCG
CCGGAGATCG TGCTCTTTCC GAACTGGTTC TGGTTCAACC TGTACGAGAT ATCGTCGTGG
TCGCGCGGCA TTCTTGTCCC GCTCGCAATC TGCTACGCGA AAAAACCATT CAAGAAAATT
CCCGACGAAG CAAACATCGA CGAGCTGTTC GTCGAAGGGC GTCACGCCAA TTTGCATTTG
ACCTGGGACA AGAAGCCTTT TTCGTGGCGG AACTTCTTCC TCGTGCTTAA CAACATGGTG
CACTTTTTCG AGCGCGTGCA CGTGCGTCCG CTGCGCAAGC TCGCAATGAA GCGGGCAGAG
AAATGGATGC TCGAGCGCCT CGAGATGAGT GACGGACTCG GTGGGATCTA TCCTGCGATT
TTGAATTCAA TCATTGCGCT GCGCGCACTT GGATATTCCA CGGACGATCC GCAGGTGATC
CGTGCGATGG ACGAGTTCGA GAAGCTCGGT ATCGAGGAAG ACGACACATT CCGCATGCAA
CCGTGCATGT CACCAGTGTG GGACACGGCA TACGCGCTGT ATGCGCTTGG CGAAGCCGGC
GTGCCGGGCA GCGATCCACG CATGCAAAAG GCCGCTGAGT GGATGCTGAA GAAGCAGGTG
ACGCACAAGG GCGATTGGGC GGTGAAAGTC CGCAACGTGC AGCCGGGTGG CTGGTACTTC
GAGTTCAATA ACGAGTTCTA TCCCGACGTG GACGACACCG CTCAGGTGAT TCTGTCGCTG
AACCACGTGC GGACATCCAA CGAACGCTAC CAGGACGACA CCGTCAAGCG CGCTCTCGAC
TGGCAACTCG CCATGCAGTG CAAGAACGGC GGCTGGGCCT CGTTTGATAA AGACAACAAC
AAGATGGTCT TCCAGTACAT CCCGTTCGCT GACCACAACG CCATGCTCGA TCCGGCGACG
GTCGACATTA CGGGACGCGT CCTGGAAGCC CTCTCGCATC ACGGGTACTC GCTGAAGGAC
AAAGTTGTGC AGCGCGCCGT CAAGTTCATT CAGAGTGAGC AAGAACCTGA CGGTTCCTGG
TTTGGTCGCT GGGGCGTGAA CTACATCTAC GGCACCATGC TTTGCCTCCG CGGACTCGCG
GCGGTCGGCG TGGATCACCA CGAACCCATG GTGCAACAGG CAGCCGAATG GTTGCGTATG
GTGCAGAACC CCGACGGTGG TTGGGGCGAA AGCGTGGGCT CATACGACGA TCCGAAATTG
CGCGGGCAGG GACCGAGCAC GGCCTCGCAG ACAGCATGGG CTGTAATGGG GTTGCTGGCG
GCGAATGATC TGCGGAGCGA TTCGGTGACG CGCGGTATCG CGTGGCTGCT TGAGAACCAA
AAGCCGAATG GATCTTGGTG GGAAAAGTGG ATCACTGGCA CCGGTTTCCC GCGCGTGTTC
TACCTCAAAT ACACCATGTA CGCCGAGTAT TTCCCGCTCA TCGCTTTTGC CGAGTATCTG
CGGCGTTTGA ATACGCCGCT CGATGAGAAG GTGAAGCTCG GACCACAGGC GTAG
 
Protein sequence
MDDRHIQSEI TFGKIDGIRE RIQQAMDAAK RYLFSKQDPE GFWCGELEAD TTLQSDYIVM 
HTLLGTGDPV KMQKAGKQIL QHQNPDGGWN IYPDGPSNIS AAVKAYFSLK LIGHKPDEPE
MTKAREWILA HGGVTACNTF SKMYLCFFGQ YDYDTVPAIP PEIVLFPNWF WFNLYEISSW
SRGILVPLAI CYAKKPFKKI PDEANIDELF VEGRHANLHL TWDKKPFSWR NFFLVLNNMV
HFFERVHVRP LRKLAMKRAE KWMLERLEMS DGLGGIYPAI LNSIIALRAL GYSTDDPQVI
RAMDEFEKLG IEEDDTFRMQ PCMSPVWDTA YALYALGEAG VPGSDPRMQK AAEWMLKKQV
THKGDWAVKV RNVQPGGWYF EFNNEFYPDV DDTAQVILSL NHVRTSNERY QDDTVKRALD
WQLAMQCKNG GWASFDKDNN KMVFQYIPFA DHNAMLDPAT VDITGRVLEA LSHHGYSLKD
KVVQRAVKFI QSEQEPDGSW FGRWGVNYIY GTMLCLRGLA AVGVDHHEPM VQQAAEWLRM
VQNPDGGWGE SVGSYDDPKL RGQGPSTASQ TAWAVMGLLA ANDLRSDSVT RGIAWLLENQ
KPNGSWWEKW ITGTGFPRVF YLKYTMYAEY FPLIAFAEYL RRLNTPLDEK VKLGPQA