Gene Acid345_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2307 
Symbol 
ID4071461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2731912 
End bp2733381 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content58% 
IMG OID637984323 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_591382 
Protein GI94969334 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.35999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGAG AATCCAGAGT TATGAGCAGC TCCAAAGGCA ACGGAAACGG CACCCCCACT 
GTCCCTCAGG CCCGCGCAGA ATGGGTCAAG AAGCGCAAAG AAGAAGCTGC CCGCACCGGA
GACGACAACG TCTCCCAGAT GCATTTTGCG CGCAAGGGCC TCGTCACCGA GGAAATGCTC
TTTGTCGCTG AACGCGAGCG CGTGAGCCCC GAAATCATCC GCGCCGAAGT TGCCGCCGGA
CGCATGATCA TTCCGGCCAA CATTAATCAC CCAGAGCTTG AGCCGATGGC CATCGGCATT
GAGTCGCGTT GCAAGATCAA CGCCAATATC GGCAACTCCG CCGTTACGTC CGACGTCGAA
ACCGAGCTCA AGAAGCTCCA CACCTCTGTG CATCATGGCG CAGACACGGT GATGGACCTC
TCTACCGGCG GCGACATCCA CGACATCCGC GAAGCCATCA TCCGTCATTC GCCTGTGCCC
ATCGGTACCG TGCCGATTTA CGAAGCGGTC TCGCGGGTGA AGCGGATTGA AGATCTGACC
GCCGATCTCA TGCTCGAAGT GATCGAGGAG CAAGCACAGC AGGGCGTGGA TTACATGACC
ATCCACGCCG GCGTGCTCAT TCAGTACTTG CCGCTGGTCG CCAAGCGAAT CACCGGCATC
GTCAGCCGCG GCGGCGCGAT CCTGGCGCAA TGGATGGCCC ACCATCACAA GCAGAACTTC
CTCTATGAAC GCTTCGACGA CATCACCAAG ATTCTGAAGA AGTATGACGT CTCGTATTCG
TTGGGCGACG GCCTGCGTCC GGGCTGCATT GCCGACGCTA GCGACGAAGC GCAGTTCGCC
GAGCTTAAAA CTCTAGGCGA ACTAACGACC AAAGCCTGGG AGTACGACGT ACAGACGATG
ATTGAAGGTC CGGGCCACGT GCCGCTCGAC AAGATCAAAG AGCAGGTGGA GAAAGAAGTG
GAGTGGTGCC ACGGCGCGCC GTTCTACACT CTCGGCCCGC TTGTCATCGA TATTGCTCCG
GGCTACGACC ACATCACCAG CGCGATCGGC GCGGCGATCA TCGGCTGGCA CGGCGCATCT
ATGCTCTGCT ACGTGACGCC GAAAGAACAC CTCGGTCTGC CGAATGAAAA AGATGTGAAG
GACGGCATCA TCGCTTACAA GATCGCCGCC CACGCTGCCG ACGTCGCCCG TCATCGTCCG
GGCGCTCAGG ACCGCGATAA CGCCCTCAGC CACGCCCGCT ACACCTTCGA CTGGGAGTCG
CAGTTCAATC TTTCACTCGA TCCCGAGACC GCCCGCTCGA TGCACGACGA GACGCTTCCG
GATGCGTATT ACAAAGAAGC GGCGTTCTGC TCGATGTGCG GCCCGAAATT CTGCTCGATG
AATTATTCGT CGAAGGTCGA TGAATACAAC AAGAAGGTCC ACGGCATCGA TAAGGCCGAG
TTCATTTCGC AGCTAACAGT TTTGAAATAA
 
Protein sequence
MQGESRVMSS SKGNGNGTPT VPQARAEWVK KRKEEAARTG DDNVSQMHFA RKGLVTEEML 
FVAERERVSP EIIRAEVAAG RMIIPANINH PELEPMAIGI ESRCKINANI GNSAVTSDVE
TELKKLHTSV HHGADTVMDL STGGDIHDIR EAIIRHSPVP IGTVPIYEAV SRVKRIEDLT
ADLMLEVIEE QAQQGVDYMT IHAGVLIQYL PLVAKRITGI VSRGGAILAQ WMAHHHKQNF
LYERFDDITK ILKKYDVSYS LGDGLRPGCI ADASDEAQFA ELKTLGELTT KAWEYDVQTM
IEGPGHVPLD KIKEQVEKEV EWCHGAPFYT LGPLVIDIAP GYDHITSAIG AAIIGWHGAS
MLCYVTPKEH LGLPNEKDVK DGIIAYKIAA HAADVARHRP GAQDRDNALS HARYTFDWES
QFNLSLDPET ARSMHDETLP DAYYKEAAFC SMCGPKFCSM NYSSKVDEYN KKVHGIDKAE
FISQLTVLK