Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0691 |
Symbol | |
ID | 4071336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 849651 |
End bp | 852590 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982697 |
Product | hypothetical protein |
Protein accession | YP_589770 |
Protein GI | 94967722 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTCATC GTCTCTTGCG TACCGCGCTC GTCTGCGTTT CGTTCGTCTC GATTTTCGGA GTTCTCACCT CTGCCCAGGA ATTTCGCGGC GGACTAGCTG GTAGGGTCCA GGACGCCTCT GGCGCCCGCG TCGCCGCAGC CCACGTCCAG GTGCAGGCTC CAGAGTCTTC CGTGCAACGC GAGACCGCCA CAGACGCAAG CGGCAACTTC CGCTTCAGCG ATCTTCCCGT CGGACGCTAC CAGGTGACCG TCAACGCGCA GGGTTTCGGT GTCGCGACCT CCGAAGTCGC GGTGCTGGTC GGCTCAACCC GCGATGTCTT CGTGACCCTC CATCCGCCTT CGGTGAAGGA GAGTGTCGCG GTCTCCGGTG AAGCATCCTC CATCACCATG CAGCCCCTCG ATACCTCAAG CCCCGTCCAC CAGGCGGTGG TCTCGGCACA TGATCTCGAG GAACTCCCGC TCGCCAACCG CAGCTTCGCC AACATGGCGT ATCTCGCACC CGGCACCCAG CCCATCGAAC CCAGCGACCC GACCAAGGCA CGCATCACCG CCGTCGGCAC CGGTGGCAGT TCCGGGTTGA ATAACGAGAA CTCCGTGGAC GGAGGCGACA ACTCCGACGA TTACATCGGC GGCTTCCTCC AGAACTTCTC CACCGACAGC ATCCAGCAGT TCGCCTTCCG CGTCGCGCAG GAAGACGCCG ACACTGGCCG CACGACCGGC GGCTCCGTCG TCATTACAAC CAAGCGCGGC ACCAACGACT GGCACGGCCT ATTCGGCTTC TACGACCGCA CCTCTGGCCT CACCGCTCGC TACCCCATCG ACAATCCTGA GCCCAACCCG AAGCAGCCTT TCTCGCGCCA GAACTACATC TTCAACGGCG GCGGGCCGAT TAAGAAAGAC AAGCTCTGGG GTTTCGGATC GTTGGAATAC GTCCACGAGC GCGCCAGCAT CGCGTACAGC AACGACAGCC TCGCACAGTT CAACGCGCTG GCCTCGCTCG CGCAGGCCGG CTATATTCCC GGCGCTCCCG ATATCGCCGT TCCGCCCTAT GTCATCACTC CCTTTAACGA CTACATCGGC GATGCCCGTC TCGATTGGGC GCAATCCGAT CATTCGCAGT GGTTCTTGCG CGGCGCCACC GACCGTTACA CCACCGAGAA CGACATGGTG CAGCAGGGCA CGCTCCCGTC GGTCGGCGCC ACGACGCGTT CGCTCTACTG GAATTTCGCG CTTAACAACC AGTACCAGTT CTCGAACACC TGGCTCGGTT CCTTCACTTT CGACGCTTCC ATCCTGCATC GCACCGTCAA TCGCAACCAG TACTACGGCT TCTCGCTCGA CTTCCCATTC ACCACCACGC CCAGCGTCAT CACCGGCGCC GACACCTTCG GCGACAACTC CTTCGTAACG CCGATCACTG CCTTCCCAGT CCTGCGTAAC CAGCAGAAGT ATCAGTTCCG CTACGATCTT TCCCACTCGA CGCCGAAGCA CACCATGAAG TTCGGTGTGA ACTTGATCCA CGAGCCGGTC ATTGGGGGAG TATTGGCCTC ACAAGCCGAA ACCGTCATCG CTTACTCGCA GAATCCCGTG GACTACGCGG CCAACCCTGC CAGTTTCGCG TTTACGTCGG CGTATCTCAC CAATCCTGAT ACCTGCAACG AAAACGCTCT CGATCCCGAC ACGGTTTGTA CCGCGACGCC TGCGGGCGAT GGGAGTTTTT GGCAGAACGT GCAGCGCCTG GGCATCTACG CCGAGGACAT CTGGCGCGTG ACCCCGCACC TCACGCTGAA CTACGGCCTG CGTTGGGACA CAACCTTCGG CCTCTTCGAC GTGGGTGGCC GCAGCCAGAA CGCGAACGTC GCCCTGCAAA CCATTGCCTA CCCGCAGTAC AACGGCGTCC CGCTCGACAA CCGCAAGCAG TTTGGTCCGC GCGTCGGCGC CATTTATTCA CCGGGAGATA GCGGCAAACT CGTTCTTCGC GCTGGCTTTG GCATGTTCTA CAACGATCTC GCGCAAAACG GCTGGGTGGA TGCGTTGATG GCCGTTAACC CCGGCAATGC CAACGTCAAC AGCACCGGCG CGATCATCGA CCCGCACTAC CACACGCCCT ACGCCATCGA CGCTAGCGCC GGCGTGGAAT ACGCCTTCGA TCAAGACTGG ATGGGTGCCG TCGAATTCAC GCACCAGACC GGCATGCACG GCTATCGTCG CTACGACTAT CCCGATGTCT CAGTCTTCCG CAGCGACAAT CGCTCCGCCT ACGACGGCCT CGTCCTGCGC GTACAGGGCA ACGTCTCCAA GCACTACAGC CTGACTGCCC ACTACACCTT TGCCAAGGCC CAGACCTGGG GCTGCCAACT CGGCGAACTC TTTGACTACG TGAACGGCGT CTGCGACCCC TTTAACGCCT TTGGCCCGGG GGACTACGGT CCAGCCGGTG AAGACGTACG CCACCGCTTC GTGCTCGCAG GCACATGGCA CGCCCCGCTC GGCATCGAGC TTTCCACCAT GACGCAAGCC GAGAGCGGTC GCCCCTTTAC CATTACGAAT CCCGACGGCT CCGGCCGCGC CGTCATTAAC GGCGTCACCA CTACCATGGA CCAGTTCCGC GGACGCCCAT ACTTCCAGGT GGACCTCCGC GTCTCCCGTC CGTTCCACAT CCAGGAACGC TGGCAAGTCA CCCCGTTCTT CGAGATGTTT AACCTCTTCA ACCGCAACAA TCCCGGCGCG TTCTACCGCG CCAACATGGC GGATTTGCCG GTCAACGATC CTGACAATGC CACAGCCATC TGTTTGAACG CTGATTGCAG CCAGACCAAA CCCATCACCA GCCCGAACCA ACTCCGCATT CCAGCCGGCG CTTTCGGCGA CTTCTTCGGC CCCGGTACTA CCGTCGGCAT TCCATTCAGC GGGCAATTCG GTGTTAGAGT GAGCTTCTGA
|
Protein sequence | MTHRLLRTAL VCVSFVSIFG VLTSAQEFRG GLAGRVQDAS GARVAAAHVQ VQAPESSVQR ETATDASGNF RFSDLPVGRY QVTVNAQGFG VATSEVAVLV GSTRDVFVTL HPPSVKESVA VSGEASSITM QPLDTSSPVH QAVVSAHDLE ELPLANRSFA NMAYLAPGTQ PIEPSDPTKA RITAVGTGGS SGLNNENSVD GGDNSDDYIG GFLQNFSTDS IQQFAFRVAQ EDADTGRTTG GSVVITTKRG TNDWHGLFGF YDRTSGLTAR YPIDNPEPNP KQPFSRQNYI FNGGGPIKKD KLWGFGSLEY VHERASIAYS NDSLAQFNAL ASLAQAGYIP GAPDIAVPPY VITPFNDYIG DARLDWAQSD HSQWFLRGAT DRYTTENDMV QQGTLPSVGA TTRSLYWNFA LNNQYQFSNT WLGSFTFDAS ILHRTVNRNQ YYGFSLDFPF TTTPSVITGA DTFGDNSFVT PITAFPVLRN QQKYQFRYDL SHSTPKHTMK FGVNLIHEPV IGGVLASQAE TVIAYSQNPV DYAANPASFA FTSAYLTNPD TCNENALDPD TVCTATPAGD GSFWQNVQRL GIYAEDIWRV TPHLTLNYGL RWDTTFGLFD VGGRSQNANV ALQTIAYPQY NGVPLDNRKQ FGPRVGAIYS PGDSGKLVLR AGFGMFYNDL AQNGWVDALM AVNPGNANVN STGAIIDPHY HTPYAIDASA GVEYAFDQDW MGAVEFTHQT GMHGYRRYDY PDVSVFRSDN RSAYDGLVLR VQGNVSKHYS LTAHYTFAKA QTWGCQLGEL FDYVNGVCDP FNAFGPGDYG PAGEDVRHRF VLAGTWHAPL GIELSTMTQA ESGRPFTITN PDGSGRAVIN GVTTTMDQFR GRPYFQVDLR VSRPFHIQER WQVTPFFEMF NLFNRNNPGA FYRANMADLP VNDPDNATAI CLNADCSQTK PITSPNQLRI PAGAFGDFFG PGTTVGIPFS GQFGVRVSF
|
| |