Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1122 |
Symbol | |
ID | 6355764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 1224512 |
End bp | 1225570 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642668739 |
Product | phytase |
Protein accession | YP_001943170 |
Protein GI | 189346641 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATAG ATTTTATAAG GTGTATTGCC GTTGCGGTCT TTGTGGCGTT ATCAGCATGT TCGGGAATGA ACCGGGATCT GCCCGAAAAT GCCGTCAACC CGGTTGTCGT TTCCGAAAAA GTGCCGCATG ATGCGGATGA TCCCGCTATC TGGGTTAACC ATGCCGATCA TGACGGGAGC ATGATACTCG GGACTGACAA GCATGAGAAT GGGGCGGTTT ACGTTTTCGA TTTACAGGGA CGCATTATCG CAAATAAATG CGTTCACGGG CTTCAGCGTC CAAACAATAT CGATGTCGAA TACGGACTTC TTCTGAACGG TAAGCCGGTC GATATTGCTG TAGTAACAGA ACGTATGAGC GGGAAGCTTC GGGTTTTTAC CCTGCCGGAT ATGAAGGCTG TCGATAAGGG GGGGATTCCC GTATTTACCG GTGAACGGGA TAATGCGCCA ATGGGTGTTG CATTATATAA ACGGAAGCAT GATGGAGCGA TCTATGCAGT TGTAAGCCGC AAACAGGGGC CGGTTGATGG AACGTATCTC TGGCAGTACA GGCTTGAAGA TAGCGGAAAT GGTTTTGTGC GGGCCTCTCT TGTAAGAAAA TTCGGCATAT GGAGTGGTAA AAAGGAGATC GAGGCTGTAG CGGTCGATGA CCGTTCAGGT TTTGTCTATT ATTCCGATGA GGGGGTTGGT GTCAGAAAAT ATCATGCCGA TCCTGATATG AAGGGAGGAG AGAAAGAGCT TGCCCTGTTT GCCACCGATG GTTTTACAAA AGATCATGAA GGTATTGCGG TCTTTTCGAC GACAGATGGC AGCGTTGTCA TCATTTCAGA TCAGGGGGCC GGTCAGCTTC ATCTGTTCAG GGAGTCCGGG TCTGCATCAG ATGGCAGTAA GGGAGTCCGA CGGATAGGGA TTGTAAAAAC AGCGGCAGTT GACACAGACG GGATCGAGGC CTCTTCGGTA CTTTCTACAG CCGGTTTTCC TGCCGGTATT CTTGTGGCTA TGTCGGATGA CCGTACGTTC CAGTATTATT CGCTCAAAGA TCTGGGTATT CTACCCTGA
|
Protein sequence | MRIDFIRCIA VAVFVALSAC SGMNRDLPEN AVNPVVVSEK VPHDADDPAI WVNHADHDGS MILGTDKHEN GAVYVFDLQG RIIANKCVHG LQRPNNIDVE YGLLLNGKPV DIAVVTERMS GKLRVFTLPD MKAVDKGGIP VFTGERDNAP MGVALYKRKH DGAIYAVVSR KQGPVDGTYL WQYRLEDSGN GFVRASLVRK FGIWSGKKEI EAVAVDDRSG FVYYSDEGVG VRKYHADPDM KGGEKELALF ATDGFTKDHE GIAVFSTTDG SVVIISDQGA GQLHLFRESG SASDGSKGVR RIGIVKTAAV DTDGIEASSV LSTAGFPAGI LVAMSDDRTF QYYSLKDLGI LP
|
| |