Gene Acid345_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1747 
Symbol 
ID4072014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2119388 
End bp2120833 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content60% 
IMG OID637983755 
Productcarotene 7,8-desaturase 
Protein accessionYP_590822 
Protein GI94968774 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03467] squalene-associated FAD-dependent desaturase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.969616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA CTCTCACCGC AACCCAATTG AAAGCAGAGC AACCGACGGG CAAGAAGGCA 
CGTGTGGCCA TTGTCGGCGG CGGTCTCGCC GGCCTCGCAG CCGGATGCGC CCTCGCGGAC
GCCGGCTTCT CGGTGAAGCT CTTCGAGCGC AAGCCGTTCC TCGGTGGACG CGCGTCGTCC
TATCAGCATC CCGCTACCGG CGAAGTCGTG GACAACTGCC AGCACGTGCT GCTCGGCTGC
TGCACCAATC TCCTCGACTT TTACAAACGC CTCGGTGTCG AAGACCAGAT CCGCTGGTTT
GAGCAGTTGA CCTTCATGCT CCCCAACGGC AAAGCTGGAA CCATCGAGCC TTCCGGCCTG
CCCGCGCCGC TCCACGCCTC GCCGGCGTTT TTGAAGTTCA AAGTGCTCAG TCTCGGCGAT
AAGCTTTCTA TCGCGCGCGC CATGCTCGCG CTCATGCGCG GACTGCCGAA AGAATCCGGA
GACAATTTCC TTTCCTGGCT CAAGCGCCAC GGCCAAACCG AGCACGCCAT CAATCGCTTC
TGGGCGCCGG TGTTGATTAG CGCGCTCAAT GATGATCTCG ACCAGGTTTC CGTTCGGTAC
GCCGCGATGG TTTTTCGCGA GTCGTTCCTG AAGTCGGCGG AGGCAGGGAA GATGGGCGTT
CCTGCGGCGC CACTGAGCGA CATCTACGGG CGTGCCGGCG AATACATTGA GAAGCGTGGT
GGCGAAGTCG TGTTGCGAGC CTCGGTAGAC CAACTCACTT TGCAGGATTC GCGCGTTCTG
TTGCGCGTGA ACGGCGAGCA GATCGAAAGC GATTATGTCG TTCTCGCAGC GCCCTTTTTC
GAATCCGTAA AACTGCTCCC AGAAGCCGAC AGTGAGGGGC TCCGCTCACA GATTGGCGAA
CTCAAGACCG TGCCGATCAC CGGCATTCAC TTTTGGTTCG ACCGCGAAGT CACGCCGCTG
GAGCACGCTG TGCTTCTCGA TCGAACCATC CAGTGGATGT TCCAAAAGTC GAAGCTCCTC
CGCGGCCAAC GTGACGAAGG CGCGCCTCTC GCTGCCGGTA GCCACATTGA ACTCGTAGTG
AGCTCTTCCA AATCGCTGCT GACCATGGGC CGGAATGAGA TTCTCGATCT CGCGTTGAAA
GAGTTCTACG AGTTCTTTCC GCAGGCTAAA GAAGCGCGCG TCCTGAAGTC GGCTGTGATC
AAAGAGGTGC ACGCCACGTT TTCACCCGCG CCGCAGGGAG ATCGCTACCG CCCGCTCCCA
ATCACGCCGT GGCCGCGTAT TTTTCTAAGT GGCGATTGGA CTGCCACTGG CTGGCCTGCC
ACCATGGAAG GTGCAGTGCG CGGCGGATAC CTTACGGCAG AAGCGTTGAG TTTCGCCACG
GGAAATCAAC GCAAGTTCCT GGTGCCTGAT CTCGGCGCCA AGGGCCTTAT GAAACTGTTC
CCTTAG
 
Protein sequence
MSATLTATQL KAEQPTGKKA RVAIVGGGLA GLAAGCALAD AGFSVKLFER KPFLGGRASS 
YQHPATGEVV DNCQHVLLGC CTNLLDFYKR LGVEDQIRWF EQLTFMLPNG KAGTIEPSGL
PAPLHASPAF LKFKVLSLGD KLSIARAMLA LMRGLPKESG DNFLSWLKRH GQTEHAINRF
WAPVLISALN DDLDQVSVRY AAMVFRESFL KSAEAGKMGV PAAPLSDIYG RAGEYIEKRG
GEVVLRASVD QLTLQDSRVL LRVNGEQIES DYVVLAAPFF ESVKLLPEAD SEGLRSQIGE
LKTVPITGIH FWFDREVTPL EHAVLLDRTI QWMFQKSKLL RGQRDEGAPL AAGSHIELVV
SSSKSLLTMG RNEILDLALK EFYEFFPQAK EARVLKSAVI KEVHATFSPA PQGDRYRPLP
ITPWPRIFLS GDWTATGWPA TMEGAVRGGY LTAEALSFAT GNQRKFLVPD LGAKGLMKLF
P