Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1536 |
Symbol | |
ID | 6375214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1660434 |
End bp | 1661504 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642684029 |
Product | phytase |
Protein accession | YP_001959943 |
Protein GI | 189500473 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4247] 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.4705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTTTACCTAT CCTTGCATTG ATTGGAGTCG GCGGAATACT CGCATCCTGT CATAACAACA TCGAACAAAC GATCGAACCT TTGGCCGTCA CTGATTCCGT TCGTCATGAC AGTGAAGATC CGGCCATATG GATAAACAAA GAAAATCCAT CAAAAAGCCT TGTCCTTGCA ACAGACAAGC ATAAAGACGG TGCGTTGTAT GTTTTCGACC TTGAAGGCAA AGCAATACAC AAAAAAACCA TCAAAGGCCT TGCAAGACCC AACAATGTTG ATGTGGGCTA CGGCTTCCCT CTGAACGGGA ACAACGTCGA CATAGCGGTC GTCACAGAAA GGCTTGAAAA CCGGATACGT ATATTCCGGT TACCCGATAT GACCGCTATC GACAATGGAG GCGTCCCGGT CTTTCAAGGG GAAGAGTACA ACGCCCCGAT GGGTATAGCT TTTTACAAAA GACCTTCGGA CGGAAAAATG TATGTTATTG TCAGCCGGAA ACAGGGGCCA ACCGACGGCA CATACCTCTG GCAGTATCTC CTCGAAGACA GTGGTAATGG ATATATTACC GCTCATAAAG CAAGGGCGTT CGGACAATGG AGCGGCCAAC AGGAAATCGA GGCTGTTGCA GTAGACAATG AGCTGGGTTA TGTCTATTAT TCCGATGAAT GTGTGGGAGT CAGGAAATAT CATGCCGATC CTGAAACCCC GGATGCGAAC CGGGAACTAT CCCTGTTTGC CACCAAAGGC TTTGCCGAAG ACCATGAAGG CGTCGCCATA TGGAAAACCG GTGAAACCGA CGGCTATATT ATCGTTTCGG ATCAGGCAGC AGGAAAATTA CGGCTTTATC CCAGAAACGG TAAAGACCTT CATGAACCTC ACAAGCACGA GCTGGTCGGT ATTGTGCAAA CCGGCGCAAA AGAAACCGAC GGAATCGAAG CTGCTGCAGA GCTCGTAACA GAGGAATACC CATCAGGCCT TCTGGTGGCC ATGTCCGACG ACAAAACCTA TCACTACTAC TCCCTTAAGG ATATTCCCGA TAAACAAGAT AAGCCCCGTC AATCGCACTA A
|
Protein sequence | MKKVLPILAL IGVGGILASC HNNIEQTIEP LAVTDSVRHD SEDPAIWINK ENPSKSLVLA TDKHKDGALY VFDLEGKAIH KKTIKGLARP NNVDVGYGFP LNGNNVDIAV VTERLENRIR IFRLPDMTAI DNGGVPVFQG EEYNAPMGIA FYKRPSDGKM YVIVSRKQGP TDGTYLWQYL LEDSGNGYIT AHKARAFGQW SGQQEIEAVA VDNELGYVYY SDECVGVRKY HADPETPDAN RELSLFATKG FAEDHEGVAI WKTGETDGYI IVSDQAAGKL RLYPRNGKDL HEPHKHELVG IVQTGAKETD GIEAAAELVT EEYPSGLLVA MSDDKTYHYY SLKDIPDKQD KPRQSH
|
| |