Gene Caci_4491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4491 
Symbol 
ID8335845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5117174 
End bp5119087 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content71% 
IMG OID644957593 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_003115195 
Protein GI256393631 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0431429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG GAAACGCTGA CAGCGCCGGC AGCACCCGCC CCACCCGCCG CCTCACCGTC 
GCCCAAGCCC TGGTCCGCTT CCTCGCGGTC CAGCACACCG AGCGCGACGG CACCGAGCAC
CGCCTCATCG AAGGCGTCTT CGGCATCTTC GGTCACGGCA ACGTCGCAGG CCTCGGCGAA
GCCCTCCTCG GCCTGGAACT CCAGGACCCG GCCCTGCTGC CCTACCGCCA GGCCCGCAAC
GAGCAGGCCA TGGTCCACAC CGCCGCCGCC TTCGCCCGCA TGCGCGACCG CCTGTCCACC
TACGCCTGCA CCACCTCCGT CGGGCCCGGC GCCACCAACC TGGTCACCGG CGCCGCCCTG
GCGACCGTCA ACCGCCTCCC GGTTCTGCTG CTGCCCGGCG ACGTCTTCGC GACCCGCCCC
GCCAACCCGG TCCTGCAAGA ACTGGAAGAC CCGCGCAGCC TCGACATCTC CGTCAACGAC
ACCTTGCGCC CGGTCTCCCG CTACTTCGAC CGCGTCAACC GGCCCGAACA ACTCCCCGCC
GCGCTCATGG CGGCCATGCG GGTCCTGACC GACCCCGCCG AGACCGGCGC CGTGACCCTC
GCCCTGCCGC AGGACGTGCA AGCCGAAGCC TGGGACTGGC CCGAAGAGCT GTTCCAGCAC
CGCGTCTGGC ACGTCCCCCG CCCCCGCGCC GACCTGGCAG CAATAGAGCG CGCGGCGGCA
GCGATCAAGA ACTCCCGCCG TCCCCTCATC GTCGCCGGCG GTGGCGTCAC CTACTCGCAG
GCCACTGACA CGCTGCGCAC GTTCGCCCAC GCCGCCGGCA TCCCGGTCGC CGAAACCCAA
GCCGGCAAAG GCGCTCTGGA CGAATCCGAC CCGCTGTCCC TCGGCGCGAT CGGCGCGACC
GGCACCCAAG CCGCGAACCT CATCGCGGCG CGCGCCGACC TCGTGATCGG CGTCGGAACC
CGCTTCTCGG ACTTCACCAC CGCCTCGCAC ACCGCCTTCG CCGACCCGGA CGTCCGCTTC
GTCACCGTCA ACATCGCCGC CTTCGACGCC GCCAAGCACG CCGGCGTATC AGTGGTCGCA
GACGCCAGAT CAGCCCTCGA AGACCTCACC GCCGCCCTCG ACGGCTGGCG CACCGAAGCC
GCCTTCCACG CCGAGATCGC CGACGTCCGC GCCCGGTGGG CCACGACCGT CCAGGGCGCC
TACCGCTTCG CCAACGTCCC GCTGCCCTCG CAGATCGAGG TCATCGCCGC CGTCAACGAC
GCCGCCGAGC CCGGGGACGT GGTCGTCTGC GCGGCCGGTT CGATGCCCGG CGACCTGCAC
CGGCTCTGGC GCACCGGCGG CGACCCGAAG TCCTACCACG TCGAATACGG CTTCTCCTGC
ATGGGATACG AGATCGCCGG CGGCCTCGGC GTGAAGCTTG CCGCGCCCGA GCGCGAGGTC
TACGTCCTGG TCGGCGACGG CTCCTACCTG ATGCTCGCCC AGGAGATCGT TACGGCCGTG
GCCGAGCGCG TGAAACTGAT CGTGGTCCTG GTCGACAACG CCGGGTACGC CTCCATCGGC
GCCCTGTCCG AATCCCTCGG CGCCCAGCGC TTCGGCACCG CCTACCGCTA CCGCGGCGCG
TCAGGACGCC TGAACGGCGG CCCGCTTCCG GTCGATCTCG CCGCCAACGC CGCGAGTCTC
GGCGCGAACG TTGAGCGCGC CACCGACATC GAGGAGTTTG TCGCGGCGTT GGACCGGGCC
AAAAAAGCCG ACCGCATCAC AGTCGTGCAC GTCGCGACCG ATCCGATGGC CGGCGCGCCC
GACGGCGGCG CGTGGTGGGA CGTGCCCGTG GCACAGGTAT CAGACCTGGA ATCCACTCGC
GCGGCCAGAA CGGGGTATGA GGAGGCGAAG AAAGCTCAAC GCCCGTACCT GTAA
 
Protein sequence
MSTGNADSAG STRPTRRLTV AQALVRFLAV QHTERDGTEH RLIEGVFGIF GHGNVAGLGE 
ALLGLELQDP ALLPYRQARN EQAMVHTAAA FARMRDRLST YACTTSVGPG ATNLVTGAAL
ATVNRLPVLL LPGDVFATRP ANPVLQELED PRSLDISVND TLRPVSRYFD RVNRPEQLPA
ALMAAMRVLT DPAETGAVTL ALPQDVQAEA WDWPEELFQH RVWHVPRPRA DLAAIERAAA
AIKNSRRPLI VAGGGVTYSQ ATDTLRTFAH AAGIPVAETQ AGKGALDESD PLSLGAIGAT
GTQAANLIAA RADLVIGVGT RFSDFTTASH TAFADPDVRF VTVNIAAFDA AKHAGVSVVA
DARSALEDLT AALDGWRTEA AFHAEIADVR ARWATTVQGA YRFANVPLPS QIEVIAAVND
AAEPGDVVVC AAGSMPGDLH RLWRTGGDPK SYHVEYGFSC MGYEIAGGLG VKLAAPEREV
YVLVGDGSYL MLAQEIVTAV AERVKLIVVL VDNAGYASIG ALSESLGAQR FGTAYRYRGA
SGRLNGGPLP VDLAANAASL GANVERATDI EEFVAALDRA KKADRITVVH VATDPMAGAP
DGGAWWDVPV AQVSDLESTR AARTGYEEAK KAQRPYL