Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2307 |
Symbol | |
ID | 4071461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2731912 |
End bp | 2733381 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984323 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_591382 |
Protein GI | 94969334 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.35999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGAG AATCCAGAGT TATGAGCAGC TCCAAAGGCA ACGGAAACGG CACCCCCACT GTCCCTCAGG CCCGCGCAGA ATGGGTCAAG AAGCGCAAAG AAGAAGCTGC CCGCACCGGA GACGACAACG TCTCCCAGAT GCATTTTGCG CGCAAGGGCC TCGTCACCGA GGAAATGCTC TTTGTCGCTG AACGCGAGCG CGTGAGCCCC GAAATCATCC GCGCCGAAGT TGCCGCCGGA CGCATGATCA TTCCGGCCAA CATTAATCAC CCAGAGCTTG AGCCGATGGC CATCGGCATT GAGTCGCGTT GCAAGATCAA CGCCAATATC GGCAACTCCG CCGTTACGTC CGACGTCGAA ACCGAGCTCA AGAAGCTCCA CACCTCTGTG CATCATGGCG CAGACACGGT GATGGACCTC TCTACCGGCG GCGACATCCA CGACATCCGC GAAGCCATCA TCCGTCATTC GCCTGTGCCC ATCGGTACCG TGCCGATTTA CGAAGCGGTC TCGCGGGTGA AGCGGATTGA AGATCTGACC GCCGATCTCA TGCTCGAAGT GATCGAGGAG CAAGCACAGC AGGGCGTGGA TTACATGACC ATCCACGCCG GCGTGCTCAT TCAGTACTTG CCGCTGGTCG CCAAGCGAAT CACCGGCATC GTCAGCCGCG GCGGCGCGAT CCTGGCGCAA TGGATGGCCC ACCATCACAA GCAGAACTTC CTCTATGAAC GCTTCGACGA CATCACCAAG ATTCTGAAGA AGTATGACGT CTCGTATTCG TTGGGCGACG GCCTGCGTCC GGGCTGCATT GCCGACGCTA GCGACGAAGC GCAGTTCGCC GAGCTTAAAA CTCTAGGCGA ACTAACGACC AAAGCCTGGG AGTACGACGT ACAGACGATG ATTGAAGGTC CGGGCCACGT GCCGCTCGAC AAGATCAAAG AGCAGGTGGA GAAAGAAGTG GAGTGGTGCC ACGGCGCGCC GTTCTACACT CTCGGCCCGC TTGTCATCGA TATTGCTCCG GGCTACGACC ACATCACCAG CGCGATCGGC GCGGCGATCA TCGGCTGGCA CGGCGCATCT ATGCTCTGCT ACGTGACGCC GAAAGAACAC CTCGGTCTGC CGAATGAAAA AGATGTGAAG GACGGCATCA TCGCTTACAA GATCGCCGCC CACGCTGCCG ACGTCGCCCG TCATCGTCCG GGCGCTCAGG ACCGCGATAA CGCCCTCAGC CACGCCCGCT ACACCTTCGA CTGGGAGTCG CAGTTCAATC TTTCACTCGA TCCCGAGACC GCCCGCTCGA TGCACGACGA GACGCTTCCG GATGCGTATT ACAAAGAAGC GGCGTTCTGC TCGATGTGCG GCCCGAAATT CTGCTCGATG AATTATTCGT CGAAGGTCGA TGAATACAAC AAGAAGGTCC ACGGCATCGA TAAGGCCGAG TTCATTTCGC AGCTAACAGT TTTGAAATAA
|
Protein sequence | MQGESRVMSS SKGNGNGTPT VPQARAEWVK KRKEEAARTG DDNVSQMHFA RKGLVTEEML FVAERERVSP EIIRAEVAAG RMIIPANINH PELEPMAIGI ESRCKINANI GNSAVTSDVE TELKKLHTSV HHGADTVMDL STGGDIHDIR EAIIRHSPVP IGTVPIYEAV SRVKRIEDLT ADLMLEVIEE QAQQGVDYMT IHAGVLIQYL PLVAKRITGI VSRGGAILAQ WMAHHHKQNF LYERFDDITK ILKKYDVSYS LGDGLRPGCI ADASDEAQFA ELKTLGELTT KAWEYDVQTM IEGPGHVPLD KIKEQVEKEV EWCHGAPFYT LGPLVIDIAP GYDHITSAIG AAIIGWHGAS MLCYVTPKEH LGLPNEKDVK DGIIAYKIAA HAADVARHRP GAQDRDNALS HARYTFDWES QFNLSLDPET ARSMHDETLP DAYYKEAAFC SMCGPKFCSM NYSSKVDEYN KKVHGIDKAE FISQLTVLK
|
| |