Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0833 |
Symbol | |
ID | 4072359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1033911 |
End bp | 1035656 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637982842 |
Product | hypothetical protein |
Protein accession | YP_589912 |
Protein GI | 94967864 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTACCATC TGAGAAAGCT GGTGCAGCGC GCCAGACGAG GCGTGGCCTG CGCGCTATTG GCTTTGGCGG CTTGTACTAC CGTTAGTGTG GCAGCTTTCC CGCAAAGTCA GCCGACCATC GGTTCGACGT TTGTACCGCT CGATAGTTGG GTTTATCCGG CGTTAGATCG CCTGCGTGCC CTTGGCTACA CGCACACGCA GTTCATGGGT CTCAGGCCTT GGACACGCCT GATGTGCGCG CGACTCGTTC ACGAAGCTGC ACAGTCTTTG CGGCCCGGTG ACCTCAACGC GACCAAGATG TATAACCAGC TCGCAGATGA GTTCCAGCCG GAACTAGGGT ATCTAGGTGG GGACCCGAGC CAGACGGTCA CAATCGATTC GGTCTACGGT CGAGTTATGG GCATCGGTGG TGACGAGCCG TTGCGCGACA GTTGGCACTT CGGCCAAACC ATCGCGAATG ATTTTGGCAG GCCATTCGGT CAAGGCGTAA ACGCGGTCGT TGGGATGAGC GCGCGCGCAC AGCGGGGAAG ATTCTTCATC GCGTTTCGCG GTGAATATCA GCATGCTCCA GCTTTGCCGG GCTTCGATTC GAATGTACAA AGCGTTATCG CGAAAATTGA TCAAACGAGC AGTGCTCCTC TCCTGTTTCC GCAAGATCAG CGAGACCGTT TCAATCTGCT GGACACATAC GCCGGTGTGG CGTTCGGAGC CTTTGAGCTA ACTTTCGGCA AGCAAAGCCT TTGGTACGGT CCCGGAACCA GCGGGGCATT GCTGTTCAGC AACAATATTG ACCCTCCGTA TATGTTGAGG CTTGATCAAG TGAATCCGGT TCGGCTGCCC TCGTTTCTTA AGTATTTGGG GAATATCCGC ACCGAGTTTT TCTTTGGAAA GTTGTCGGGA CATTCTTTCC CGGCACGACC GTTCATGCAT GGAGAAAAGG TCACTCTCAA GCCGACTGAC AATCTTGAGG TCGGCTTTAC GCGCATGACC GTATTCCTGG GCGAAGGCAA TGGGTTTACT CTCGGGCGCA TCATCCATAG TTATTTCAGT GTCGGAGACA ATCTCGGAAG CAATCGTTCG AACAGCGATC CGGGCGACCG TAAAGGCGGA CTCGATGCAA GCTACCGCGT GCCGGGTTTG CGCGATTGGG TCACGATTTA TACAGATTCC TTTACGGATG ATGACCCTTT ACCGCTTTCT GCTCCGCACC GCGCGGCCTG GAACCCCGGT ATTTACATGC CGAAGCTTCC AGGATTGCCG AGTCTGGATC TCCGTGTGGA AGGGGTAACC ACGGATATCC ACTCCGAAGC GACGGTTGGT CACTTCGTTT ACTACAACGG CATATACAAG GACGGATATA CCCAAAACGG CTTTATCATT GGAAATACAA TCGGACGAGG CGGGCGTGCC ATACAGGCGA CCAGTACCTA CTGGTTTAAC GCGCGCAACG ACATCCAGGT GGGCTTCAAG ACGGGAACGG TGGATTACAG GTATATCCCG GGCGGCGGCG GCCAGAAGGA TTACAACGTT CGCGCCGACT GGTTAGTGAA GAAAAACATC GCCCTGTCTG GATTTGTTCA GTACGAGCAC TGGAGTTTCC CGCTGCTGGC GGCGACCCCG CAAAACAACG TAGCAGCGTG GTTGTCCATC ACCATCGATC CGAAATTGGA ATGGGGTCAC GCCCGCACTG CGCTCCATCG TGATTCCACG AGTCGTCCGT CTACTCAGGA TTTAAAGCAG GAGTAA
|
Protein sequence | MYHLRKLVQR ARRGVACALL ALAACTTVSV AAFPQSQPTI GSTFVPLDSW VYPALDRLRA LGYTHTQFMG LRPWTRLMCA RLVHEAAQSL RPGDLNATKM YNQLADEFQP ELGYLGGDPS QTVTIDSVYG RVMGIGGDEP LRDSWHFGQT IANDFGRPFG QGVNAVVGMS ARAQRGRFFI AFRGEYQHAP ALPGFDSNVQ SVIAKIDQTS SAPLLFPQDQ RDRFNLLDTY AGVAFGAFEL TFGKQSLWYG PGTSGALLFS NNIDPPYMLR LDQVNPVRLP SFLKYLGNIR TEFFFGKLSG HSFPARPFMH GEKVTLKPTD NLEVGFTRMT VFLGEGNGFT LGRIIHSYFS VGDNLGSNRS NSDPGDRKGG LDASYRVPGL RDWVTIYTDS FTDDDPLPLS APHRAAWNPG IYMPKLPGLP SLDLRVEGVT TDIHSEATVG HFVYYNGIYK DGYTQNGFII GNTIGRGGRA IQATSTYWFN ARNDIQVGFK TGTVDYRYIP GGGGQKDYNV RADWLVKKNI ALSGFVQYEH WSFPLLAATP QNNVAAWLSI TIDPKLEWGH ARTALHRDST SRPSTQDLKQ E
|
| |