Gene Caci_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3372 
Symbol 
ID8334725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3722951 
End bp3724681 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content70% 
IMG OID644956516 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003114119 
Protein GI256392555 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG TCGCCGAACA GATCGTCACC GCCCTGGCCG ACCTGGGGGT CCGCACCGTC 
TGGGGAGTGG TCGGGGACGC GCTCAACCCC GTGACGGACG CGATCCGCCG CGAAGAGCGC
ATCGAGTGGA TCGGCACCCG GCACGAGGAG GCGGCGGCCT TCGCCGCGAG CGCGCAGGCC
CAGCTGAGCG GCACCATCGG CGTGTGCATG GGAACCGTCG GACCGGGTTC GCTGCATCTG
CTGAACGGCC TGTACGACGC CAAGAAGTCG CACGCCCCGG TGCTGGCGAT CTGCGGCCAG
GTCCCCTCGG CCGAGTTGGG CGCCGAATAC TTCCAGGAGG TCGACAACGA CGCGGTGTTC
CGCGACGTCG CCGCCTTCCG GCACACGGTG ACCAGCGCGA GCCAGATGCC CCGGGTCCTG
GAGCAGGCGG TGCAGACGGC CTACGCCACC CCGGGCGTCT CGGTGCTCAC GCTGCCCGGC
GACATCGGCT CCGCGGAGGT CGCCAAGGAC AGCGCCGTCC ACATCACGCG CGTCCCGGCA
CGTCTGACAC CCGACGACGA CGAGATCACC CGCGCCGTGC GGCTCCTGGA CGACGCCAAG
ACCGTGACGA TGCTCGTCGG CGCCGGAGCC CGGGAATCGC GTGCCTCAGT GCTGCAACTG
GCCGATCGCC TGGCCGCTCC GATGGTCCTG ACTCTCAAGG CGAAGGAAGG GCTCGAAGAC
GACAACCCCT TCCAGATCGG CCAGAGCGGC CTGATCGGCA ACCCGGCGAC CCGCGAGGCG
TTCGAGTCCG CCGGCGCGCT GCTGATGATC GGCACGGACT TCCCGTATCC CGACTGGCTG
CCCCGCTCGA CGCCGACCGT CCAGATCGAC ACGCGCGCCG GCCACATCGG GCGCCGTACG
CCGGTCGACG TCGGCGTCGT CGGCGACGCG GGGCTGAGCA TCGCCGCGCT CCTGAACCGG
GTGCGCAGCA AGGACGATCG CAGCCATCTG GAAAAGGCAC GCTCGAGCTA CGAGGACTGG
CAAGGTCACC AGCGCCGCCT CACCGACCCG GAGTTCGACC AGAGCCTGGT GGGCAAGGTG
CGGTCTTGGC TCGACAACAC CGAGGACAAG ATCCGCCCCG AGGCGCTGGC CACGCTCATC
GACACGCACG CCGCCGAGGA CACCGTGTTC ACCACCGACA CCGGCATGTC CACGGTCTGG
CTCGCGCGCT GCGTGACGAT GCGCGGCAGC CGCCGCCTGA TCGGGTCCTT CAACCTCGGT
TCGATGGCGA ACGCCCTGCC GCACGCCCTC GGCGCCGCCG CCCTGGACCG GCAGCGGCAG
GTCGTCGCCT TCTGCGGCGA CGGCGGTCTG ACGATGCTGC TCGGCGACGT GCTCACCGCC
GTCGCCTACG ACCTGCCGGT CAAGCTCATC GTCTTCGACA ACGGCCGCCT GGGCATGGTC
AAGCTCGAGC AAGAGCAAGG CGGGCTCCCG GAGTTCGGCA CCGAGTTGGC CAACCCCGAC
CTGGCCGCCG TCGCCACCGC GATGGGCATG CCGGCCGCCC GGGTCACCGA ACCCGAGGCG
CTGGAGGCCG CTGTCCAGGC CGCACTCGCC TCACCGGGTC CGTACCTGCT CGACGTGGTC
ACCAATCCCG AAGAGATCGC GCTGCCGCCG AAGACAAGTA TCGACCAGGC GTGGGGGTTC
GCGATCGCGA AGATGAAGGA AGGGATTGTG AGCCGGGGCG CCAAGTCCTG A
 
Protein sequence
MTTVAEQIVT ALADLGVRTV WGVVGDALNP VTDAIRREER IEWIGTRHEE AAAFAASAQA 
QLSGTIGVCM GTVGPGSLHL LNGLYDAKKS HAPVLAICGQ VPSAELGAEY FQEVDNDAVF
RDVAAFRHTV TSASQMPRVL EQAVQTAYAT PGVSVLTLPG DIGSAEVAKD SAVHITRVPA
RLTPDDDEIT RAVRLLDDAK TVTMLVGAGA RESRASVLQL ADRLAAPMVL TLKAKEGLED
DNPFQIGQSG LIGNPATREA FESAGALLMI GTDFPYPDWL PRSTPTVQID TRAGHIGRRT
PVDVGVVGDA GLSIAALLNR VRSKDDRSHL EKARSSYEDW QGHQRRLTDP EFDQSLVGKV
RSWLDNTEDK IRPEALATLI DTHAAEDTVF TTDTGMSTVW LARCVTMRGS RRLIGSFNLG
SMANALPHAL GAAALDRQRQ VVAFCGDGGL TMLLGDVLTA VAYDLPVKLI VFDNGRLGMV
KLEQEQGGLP EFGTELANPD LAAVATAMGM PAARVTEPEA LEAAVQAALA SPGPYLLDVV
TNPEEIALPP KTSIDQAWGF AIAKMKEGIV SRGAKS