Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1747 |
Symbol | |
ID | 4072014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2119388 |
End bp | 2120833 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983755 |
Product | carotene 7,8-desaturase |
Protein accession | YP_590822 |
Protein GI | 94968774 |
COG category | [S] Function unknown |
COG ID | [COG3349] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03467] squalene-associated FAD-dependent desaturase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.969616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCA CTCTCACCGC AACCCAATTG AAAGCAGAGC AACCGACGGG CAAGAAGGCA CGTGTGGCCA TTGTCGGCGG CGGTCTCGCC GGCCTCGCAG CCGGATGCGC CCTCGCGGAC GCCGGCTTCT CGGTGAAGCT CTTCGAGCGC AAGCCGTTCC TCGGTGGACG CGCGTCGTCC TATCAGCATC CCGCTACCGG CGAAGTCGTG GACAACTGCC AGCACGTGCT GCTCGGCTGC TGCACCAATC TCCTCGACTT TTACAAACGC CTCGGTGTCG AAGACCAGAT CCGCTGGTTT GAGCAGTTGA CCTTCATGCT CCCCAACGGC AAAGCTGGAA CCATCGAGCC TTCCGGCCTG CCCGCGCCGC TCCACGCCTC GCCGGCGTTT TTGAAGTTCA AAGTGCTCAG TCTCGGCGAT AAGCTTTCTA TCGCGCGCGC CATGCTCGCG CTCATGCGCG GACTGCCGAA AGAATCCGGA GACAATTTCC TTTCCTGGCT CAAGCGCCAC GGCCAAACCG AGCACGCCAT CAATCGCTTC TGGGCGCCGG TGTTGATTAG CGCGCTCAAT GATGATCTCG ACCAGGTTTC CGTTCGGTAC GCCGCGATGG TTTTTCGCGA GTCGTTCCTG AAGTCGGCGG AGGCAGGGAA GATGGGCGTT CCTGCGGCGC CACTGAGCGA CATCTACGGG CGTGCCGGCG AATACATTGA GAAGCGTGGT GGCGAAGTCG TGTTGCGAGC CTCGGTAGAC CAACTCACTT TGCAGGATTC GCGCGTTCTG TTGCGCGTGA ACGGCGAGCA GATCGAAAGC GATTATGTCG TTCTCGCAGC GCCCTTTTTC GAATCCGTAA AACTGCTCCC AGAAGCCGAC AGTGAGGGGC TCCGCTCACA GATTGGCGAA CTCAAGACCG TGCCGATCAC CGGCATTCAC TTTTGGTTCG ACCGCGAAGT CACGCCGCTG GAGCACGCTG TGCTTCTCGA TCGAACCATC CAGTGGATGT TCCAAAAGTC GAAGCTCCTC CGCGGCCAAC GTGACGAAGG CGCGCCTCTC GCTGCCGGTA GCCACATTGA ACTCGTAGTG AGCTCTTCCA AATCGCTGCT GACCATGGGC CGGAATGAGA TTCTCGATCT CGCGTTGAAA GAGTTCTACG AGTTCTTTCC GCAGGCTAAA GAAGCGCGCG TCCTGAAGTC GGCTGTGATC AAAGAGGTGC ACGCCACGTT TTCACCCGCG CCGCAGGGAG ATCGCTACCG CCCGCTCCCA ATCACGCCGT GGCCGCGTAT TTTTCTAAGT GGCGATTGGA CTGCCACTGG CTGGCCTGCC ACCATGGAAG GTGCAGTGCG CGGCGGATAC CTTACGGCAG AAGCGTTGAG TTTCGCCACG GGAAATCAAC GCAAGTTCCT GGTGCCTGAT CTCGGCGCCA AGGGCCTTAT GAAACTGTTC CCTTAG
|
Protein sequence | MSATLTATQL KAEQPTGKKA RVAIVGGGLA GLAAGCALAD AGFSVKLFER KPFLGGRASS YQHPATGEVV DNCQHVLLGC CTNLLDFYKR LGVEDQIRWF EQLTFMLPNG KAGTIEPSGL PAPLHASPAF LKFKVLSLGD KLSIARAMLA LMRGLPKESG DNFLSWLKRH GQTEHAINRF WAPVLISALN DDLDQVSVRY AAMVFRESFL KSAEAGKMGV PAAPLSDIYG RAGEYIEKRG GEVVLRASVD QLTLQDSRVL LRVNGEQIES DYVVLAAPFF ESVKLLPEAD SEGLRSQIGE LKTVPITGIH FWFDREVTPL EHAVLLDRTI QWMFQKSKLL RGQRDEGAPL AAGSHIELVV SSSKSLLTMG RNEILDLALK EFYEFFPQAK EARVLKSAVI KEVHATFSPA PQGDRYRPLP ITPWPRIFLS GDWTATGWPA TMEGAVRGGY LTAEALSFAT GNQRKFLVPD LGAKGLMKLF P
|
| |