Gene Caci_7374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7374 
Symbol 
ID8338744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8558952 
End bp8560682 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content66% 
IMG OID644960455 
Productthiamine pyrophosphate protein domain protein TPP-binding 
Protein accessionYP_003118042 
Protein GI256396478 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.693349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACTG TCGCGGACGT GCTGTGGAAG ATGCTCGCCG ACGCCGGGGT GCGGCGGTGC 
TACGGCATCG TCGGCGATGC TCTGAACCCG GCGATCGATG CGCTGCGGCG AGCCGGCGAC
ATCGAGTTCG TGCATGTCCG GCACGAGGAG TGGGGCGTGT TCGCGGCGGT CGCGGAGGCG
AAGATGTCGG GGCGGCCGGT AGCGGTGTGC GGAACCGCCG GACCCGGTGT GAGCCACCTG
ATCAACGGAC TGCTCGACGC CCGCAAGGAG GGCGCCCCGG TGATCGCGAT CGCCGGCGAC
GTCGAGACCG CCATCATGGA CACAGACGGT CTGGAAGAGC TGAACCCTTA TACGTTCTTC
AGCGTGGCGT CTCTCTATAC AGGGCGCCTA GTAGACCCCC AGCAGCTGCG TCCGATCGTT
ACCTCCGCGA TCACCACCGC TCTGACCGAA CTGGGTCCGA CCGTCATCTC GCTGCCCGGC
GATGTGGCGG CAGCGGACGC GCCAGACGTT CCGACGCATA TCGCGCTGCC GAGCACGACT
ACCGGTCCCG CGCCCGACAA GGACATCGCG CAGTTGGCGG ACATCATCAA CGCCGCCAAG
ACGGTCGCCA TCTTCGGCGG CGAAGGCTGT CAGAACGCGC GGGAACAGGT CCGTGCCCTC
TCAGACATAC TGAACGCGCC CGTCGGCTAC AGCTTCAAGG GCAAGCAGTG GCTGGAGTAC
GACAACCCGC ACGCGGTCGG CATGACCGGG CTGCTCGGCT ATGGCGGCTG CTGGGAAGCG
GTGAACCACG CCGACGTACT CCTGATGCTG GGTACGGACT TCCCCTTCCC GCAGTTCCTC
CCGCACAGCG GAGTGAAGGT GGTCCAGGTG GACCGCGACG GACGCCGCCT AGGCCGCCGC
GTCCCACTGG AGCACGGACT GGTCGGCGAT GTCGGCGCCA CCCTGGACCG ACTCCTCCCG
CAACTGTCGC CGAAGACGGA CGACGCCTTC CTGCGTAAGT GCTTGAAGAA GACCGAGGAG
TTCGACAAGC AGCTGCAGCA CTACGTCGAG CGCGGCCCGG CGCTTAAGCA GATCCGCCCC
GAGTACCTGA CCGCCACTTT GGACCGGCTC GCCCCCGAAG AAGCGGTCTT CACCGTAGAC
ACCGGCACAG CGTGCATCTG GGCTGCGCAC TATCTGCACC TCGGCCCTAA GCGCCACCTG
TTCGGCAGCC TCACATGGGC CTCGATGGCC AGCGCCTCCC CGAACGCGTT CGGCGCCAAG
ATGGCCTTCC CGGACCGCGC GGCGATCGCG CTGTGCGGCG ACGGCGGCTT CACGATGCTA
GGCCTCGGCG ACCTCCTCAC GGAGGTGCAG CACAAGGCCG AGATCGTCCA CGTGATCCTG
AACAACGGCA AGCTCGACTT CGTCTGGATC GAGATGCAGG AGGCCGGGCT CCAGCCGTGG
GGCGTCGACT TCCAGAACCC CGACTTCGCC AAGGTCGGCG AGGCGCTCGG TGCCAAGGGC
ATCCGGATCG AGCGCCCCGC GGACCTGGAA CAGGGACTCA AAGAAGCGTT GAACCACCGT
GGCGGTCCCG TAGTGGTCGA CGTAGTAGTC GATCCCTATG CGTTGGCGCT ACCCGCACAC
ACTCCGGCGG CCACCGTTAA GGGATTCACG CTCAGCGTCG CTAAGCAGGC GCTGAGCGGA
CACCTCGGAG ACGTCGTCAA GGAAGCGACG CACAACGCGC GCCTACTCTG A
 
Protein sequence
MPTVADVLWK MLADAGVRRC YGIVGDALNP AIDALRRAGD IEFVHVRHEE WGVFAAVAEA 
KMSGRPVAVC GTAGPGVSHL INGLLDARKE GAPVIAIAGD VETAIMDTDG LEELNPYTFF
SVASLYTGRL VDPQQLRPIV TSAITTALTE LGPTVISLPG DVAAADAPDV PTHIALPSTT
TGPAPDKDIA QLADIINAAK TVAIFGGEGC QNAREQVRAL SDILNAPVGY SFKGKQWLEY
DNPHAVGMTG LLGYGGCWEA VNHADVLLML GTDFPFPQFL PHSGVKVVQV DRDGRRLGRR
VPLEHGLVGD VGATLDRLLP QLSPKTDDAF LRKCLKKTEE FDKQLQHYVE RGPALKQIRP
EYLTATLDRL APEEAVFTVD TGTACIWAAH YLHLGPKRHL FGSLTWASMA SASPNAFGAK
MAFPDRAAIA LCGDGGFTML GLGDLLTEVQ HKAEIVHVIL NNGKLDFVWI EMQEAGLQPW
GVDFQNPDFA KVGEALGAKG IRIERPADLE QGLKEALNHR GGPVVVDVVV DPYALALPAH
TPAATVKGFT LSVAKQALSG HLGDVVKEAT HNARLL