Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1573 |
Symbol | |
ID | 4069011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1921816 |
End bp | 1922826 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637983582 |
Product | hypothetical protein |
Protein accession | YP_590649 |
Protein GI | 94968601 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.209685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0518858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCA CCGGGATTGT CGCTGCGCTC ATCCTGCTCG CTGGAATGGC GGGCACGGTT GCTTCTTCGC GGCAACTCGA CAAGATGACC GACCAGATCG CGGTGCAGGA GGTGTTGTAT CTACCCTCGG CGAACACGGT GAAAGCAATC AGCCTTGGCT ACGACGGTTT GATGGCCGAC ATCTATTGGA CGCGCGTAGT GCAGTACTTC GGCCGGAAGC ACCTGGAAGA GGCGCAGGCC TATAAGCTGC TGCCGGGACT TCTGGATATC ACGACGACGC TCGATCCACA TCTCGTGGTG GCTTATCAGT TCGGGGCGTT CTTCCTTTCG CAGAAGCCGC CACAGGGCGC TGGTTCTCCC GATGCAGCCA TCGCGCTGGT GAAGAAAGGG ATTGAAAACA ATCCTGAATA TTGGCGCCTG TATTACGACC TGGGATTCAT CTACTGGCTG GAGAAAAAGG ATCCTAAATC GGCGGCTGAT GCATTTGAAG CGGGCTCGAA GGTTCCCGGA GCTCAGCCGT GGATGCGTAC TATGGCAGCT TCCATGGCCA CTGGGGCGAA CGATCTTGCG ACTGCCCGCA CACTGTGGAC GGAGATTCTG AACGATACGA AAGATCAGGA TATCCGCGCG AATGCAATAA AGCGGTTGAT GTGTGTGGAT TCCGATGAGG TTGTGATCCA GATCCAGAAA TATGTGGATA TGTTCAAAGA GCGTTCCGGA CACAGTGCGT CCAGCATGCG AGAACTGGTC GATGCCGGTA TATTCAATCG AGTTCCGGTC GACCCTACGG GCAGGCCGTA TGAGATCGAT TCTTACGGCC GGGTTGTGGT CAAGGATCCC AAAGCCCTTC CATTCATCAC CCAAGGGTTG CCGCACGGGC AGGAAGTGAA CTACATCTTC GACATCTACG CAATGCAGGA GCGCGGTAAG CGGCTTGAGG AAGAGAAGCG GAAAAAAGAA GCGGAAGAGA AATCCTCGGG CTCCGCATCG CGTCCAAATA ATCAGCAATA G
|
Protein sequence | MNRTGIVAAL ILLAGMAGTV ASSRQLDKMT DQIAVQEVLY LPSANTVKAI SLGYDGLMAD IYWTRVVQYF GRKHLEEAQA YKLLPGLLDI TTTLDPHLVV AYQFGAFFLS QKPPQGAGSP DAAIALVKKG IENNPEYWRL YYDLGFIYWL EKKDPKSAAD AFEAGSKVPG AQPWMRTMAA SMATGANDLA TARTLWTEIL NDTKDQDIRA NAIKRLMCVD SDEVVIQIQK YVDMFKERSG HSASSMRELV DAGIFNRVPV DPTGRPYEID SYGRVVVKDP KALPFITQGL PHGQEVNYIF DIYAMQERGK RLEEEKRKKE AEEKSSGSAS RPNNQQ
|
| |