Gene Acid345_2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2843 
Symbol 
ID4070362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3380777 
End bp3382558 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content59% 
IMG OID637984861 
Productthiamine pyrophosphate enzyme-like TPP bindin 
Protein accessionYP_591918 
Protein GI94969870 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.89479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCAGC CATACGTAAG CGAAACCCGC AACATTGCAA GCTATGCCGT AGATCTGGTG 
GCCGCACTCG GCTCCGACAC GGTCTTCAGC CTCACTGGCG GCATGGCCAT GTTCATGAAC
CGCGCCGTTG CCACCCATAA ACGCCTGAAG CCGGTGTACT GCCAGCACGA GCAAGCTTGC
GTCGCTGCGG CTGAGGGCTA CACCAAGGCC GCGGACTTTC GTCGCGCTGG CTTTGCCCTG
ATCACAGCTG GCCCTGGCGT CTCGAACTCT GTGACCTCCC TGCTTTCTGC CTACGGCGAC
TCTGCGCCGG TGATCGTTCT GGCCGGACAG ATCAAAACCG ACGACATTGA TCGCTTCGGC
ACCCGCGCGC ACGGAATTCA GGAAGTGCCC TCGCAGGCAT TGATCACCCC ATGTGTGAAG
AAGTTCGCTC GCGTGGATCC GCTGAACTAT CGAAAGCAAC TCGTAGAAAC GCTGGCGGAA
GCATTCGCAG GACGCCCCGG CCCGGTCTTC ATCGAGATTC CGCTGGATGT GCAAGGTGCA
CCGATCGAAT ACAGTGTCGA GACAATCGTC GCTGATCAAG CCGAGATAGA AAAACGAATT
ATCGCTTCGC GCGACGCACA ACAGAGTCTG GCGCGAATCT CTGATGCACT CGGCGAACTT
CTCAAGGCCA AGCGTCCGCT GCTCTATGTT GGAAATGGCT GCCGCATCGC GGGAGTAGAA
GAAGCCGCCC GCACGCTGAT TTCCCGCTAC GATCTGCCCG CGGTTTTCTC CTGGCTCTCG
TTCGATATCC TGGCCAGTCA AGATAAACAC TGGTTTGGCT GCCCGGGCGG ACTTGCGCCG
ATCTATTCCA ACGAAGTGCT GGCGCGCGCC GACGTAATTC TCTTTCTCGG AGCGCGGCTT
GATCTCGGCA CTACCGCTTT CCAACGCCAC GCTTTTGGGG ACCAGGCCCG GCGCCTGTTC
ATCGACATAG ATCCCGCCGA GTTGGCGAAG TTCGCAGGTT TCCCGAATAC CAAGTGCATC
GAAGCGGATC TGCATGCACT CCCAATCGCC GTCGAACAAC ACGCGACGAC GAACAGCGCA
GCCGGAGAAG GCTGGCTGCA ATGGTGCATC GCTCGCAGAG ACCAATATCT TCCTGAAGAA
CGCGAGCGCC TGCAGTCCAC GGAAATGACG GTGTTCGGCG TCGCTGAGCT TCTCTCGCGA
TGGTCTGACG GCAAAGTGTT CGTACCCGCC AGTTCCGGCT ACGCGGAAGA AACTTTCTCG
CGGTTCTTCG CGCCGGGTCA AGGCACGCGG TTCTTCAACG GGGCGTCGCT TGGATCTATG
GGTTTGGGAT TGGCACACTC CATCGGCGCT TCGTTCGGCT CGCCGCGACG CGTGATCGGA
CTCGAAGCCG ATGGCGGCCT GATGCTCAAC GTCCAAGAAC TCGCGACGTT GTCTCACTAC
GCTCCGAAGG GCCACGTTCT CTTCGTGTTG AACAACGGCG GCTATGAATC CATTCGCGCT
TCGCAGAGCC GCTATTTTGG CGCGGTGAGT GGCGTTGATG GCGAAACGGG GCTGTTCATT
CCTGACCTCG CGAAGATCGC CGAAGCCTTC CAACTCCGCT ATTTGCGCGT AGATTCCCTC
GCTGCACTCG ACGAGTTGCT TCCGAAGCTC GACCCGAATG ATCCGCCCAT ACTGGTTGAC
CTCTGCGTTG CGCGCTTCGA AAATCGTGGG CCTTCGGTAA AGACCAAGAT CGGCGAGGAC
GGGAAGCCCT ACACCACGCC GTTAGCGGAG CTATCGTGGT AA
 
Protein sequence
MSQPYVSETR NIASYAVDLV AALGSDTVFS LTGGMAMFMN RAVATHKRLK PVYCQHEQAC 
VAAAEGYTKA ADFRRAGFAL ITAGPGVSNS VTSLLSAYGD SAPVIVLAGQ IKTDDIDRFG
TRAHGIQEVP SQALITPCVK KFARVDPLNY RKQLVETLAE AFAGRPGPVF IEIPLDVQGA
PIEYSVETIV ADQAEIEKRI IASRDAQQSL ARISDALGEL LKAKRPLLYV GNGCRIAGVE
EAARTLISRY DLPAVFSWLS FDILASQDKH WFGCPGGLAP IYSNEVLARA DVILFLGARL
DLGTTAFQRH AFGDQARRLF IDIDPAELAK FAGFPNTKCI EADLHALPIA VEQHATTNSA
AGEGWLQWCI ARRDQYLPEE RERLQSTEMT VFGVAELLSR WSDGKVFVPA SSGYAEETFS
RFFAPGQGTR FFNGASLGSM GLGLAHSIGA SFGSPRRVIG LEADGGLMLN VQELATLSHY
APKGHVLFVL NNGGYESIRA SQSRYFGAVS GVDGETGLFI PDLAKIAEAF QLRYLRVDSL
AALDELLPKL DPNDPPILVD LCVARFENRG PSVKTKIGED GKPYTTPLAE LSW