Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0511 |
Symbol | |
ID | 3747844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 596824 |
End bp | 598278 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637773045 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_378827 |
Protein GI | 78188489 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCCG AACACCTCCT TTTCCACTCC GATGGTGGCT TTATTCGCAT TCCTGTGTGG GGGCACATTC CCCTCAGCAA GCCGCTTAAA AGCATTCTCT CACACCCTCT TTTTTTGCGC TTAAAGGGTA TTCGCCAGCT CTCGTTTTCG CAGCAGGTCT ATCCGGGCGC AACCCACACT CGCTTTGAGC ACTCGGTTGG CGTTTATCAT TTAATGAAGC TCATTTTGCA GCGAATGGTT ACCAGCTCGT TAGCACAAAA GCTGCAAACG GAGCACTTCC GTTTTGATGA TGCAAGTTGT CGTTTGTTGC TTGCTTCAGC GCTGTTGCAC GATATTGGGC ACTTTCCCCA TGCTCATATT ATTGAGGAGC AAATTCCTCG TGTTGGCAAC GAAGTGGTTT TTTCGCATCA CGAGGAGCTG TGTCGTTATT TTTTGGAGGA AGAGCACCCC AACCATCCCT CGTTGGCAAC GTTGCTGATG GAAGAGTGGC GGGTTGATCC GAACGATGTG GTGGCGCTGA TTAGTGGCAA GCATCGTTTG AGCAAGCTTA TTAGCGGCAC GCTTGACCCC GATAAAATGG ATTACCTCAT GCGCGATGCT CACCATTGCA ACATCCCGTA TGGCAGCATT GATATTGAGC GACTTATTGA ATCCTTTGTG CCCGACCCTG AGCGCCAACG TTTTGCCATT ACTGAAAAAG GGATTGCCCC GCTTGAGAGT TTGCTCTTTG CGAAGTACAT GATGATGCGC AACGTCTATT GGCATCACAC CAGTCGGGCG CTCTCGGCAA TGTTGCGGCG CTTGCTGCAA GATATTGCTG AAGCTGAGTT GCTCCCTGCG GCAACCTTGC GCGAACTCTT TTACCGCAAT GCCGACGACC GTGTGCTCTA TGAATTAAAG CTTCTGCTGC CCGAAGCAAC CCATCCGCTT GTGGCACTTT TGGAGGATGT GCTGATGCGC CGTGTCTATA AACGTGCTAT TACGGTTCAA CCTTATCTGC AAAGTTCGGG CAAAGAGGAT GAGCGCTGGT TCCTTTATAG CAACAACAGC GCTTTGCGCC GCTCAATGGA GGTAGAAATT TGCGAACTGC TCAACAAACG CTATCAGCTT AATCTGCACG GTTATGAAGT GCTGATTGAT TCCCCATCGC GCAAAGATAT TTTCGATTAT GCCGATTTAC AGGAATTGCG CGTCTATCCA ACTCGATCGG AGCACATCCA CTACGCCATG CACTGTGCAT CCGAATATGT TCGATTTGAT GAGCTCAATG AATCAGTCTT TCAATCAAAC TTCATTCTCT CCTTTGAACG CTACACCAAA AAATTCCGTC TGCTCTGCCG TCCCGACCTT GTTGCGCACA TTGTGGAGTT ACGCCACGAT ATTATGAGCC TGTTAGCGCA CGATTATCCG CTGTTTCACT CCACCGTCTC ATCCTCAGCA ACCGAACATT CATAA
|
Protein sequence | MIAEHLLFHS DGGFIRIPVW GHIPLSKPLK SILSHPLFLR LKGIRQLSFS QQVYPGATHT RFEHSVGVYH LMKLILQRMV TSSLAQKLQT EHFRFDDASC RLLLASALLH DIGHFPHAHI IEEQIPRVGN EVVFSHHEEL CRYFLEEEHP NHPSLATLLM EEWRVDPNDV VALISGKHRL SKLISGTLDP DKMDYLMRDA HHCNIPYGSI DIERLIESFV PDPERQRFAI TEKGIAPLES LLFAKYMMMR NVYWHHTSRA LSAMLRRLLQ DIAEAELLPA ATLRELFYRN ADDRVLYELK LLLPEATHPL VALLEDVLMR RVYKRAITVQ PYLQSSGKED ERWFLYSNNS ALRRSMEVEI CELLNKRYQL NLHGYEVLID SPSRKDIFDY ADLQELRVYP TRSEHIHYAM HCASEYVRFD ELNESVFQSN FILSFERYTK KFRLLCRPDL VAHIVELRHD IMSLLAHDYP LFHSTVSSSA TEHS
|
| |