Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4491 |
Symbol | |
ID | 8335845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5117174 |
End bp | 5119087 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957593 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_003115195 |
Protein GI | 256393631 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0431429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG GAAACGCTGA CAGCGCCGGC AGCACCCGCC CCACCCGCCG CCTCACCGTC GCCCAAGCCC TGGTCCGCTT CCTCGCGGTC CAGCACACCG AGCGCGACGG CACCGAGCAC CGCCTCATCG AAGGCGTCTT CGGCATCTTC GGTCACGGCA ACGTCGCAGG CCTCGGCGAA GCCCTCCTCG GCCTGGAACT CCAGGACCCG GCCCTGCTGC CCTACCGCCA GGCCCGCAAC GAGCAGGCCA TGGTCCACAC CGCCGCCGCC TTCGCCCGCA TGCGCGACCG CCTGTCCACC TACGCCTGCA CCACCTCCGT CGGGCCCGGC GCCACCAACC TGGTCACCGG CGCCGCCCTG GCGACCGTCA ACCGCCTCCC GGTTCTGCTG CTGCCCGGCG ACGTCTTCGC GACCCGCCCC GCCAACCCGG TCCTGCAAGA ACTGGAAGAC CCGCGCAGCC TCGACATCTC CGTCAACGAC ACCTTGCGCC CGGTCTCCCG CTACTTCGAC CGCGTCAACC GGCCCGAACA ACTCCCCGCC GCGCTCATGG CGGCCATGCG GGTCCTGACC GACCCCGCCG AGACCGGCGC CGTGACCCTC GCCCTGCCGC AGGACGTGCA AGCCGAAGCC TGGGACTGGC CCGAAGAGCT GTTCCAGCAC CGCGTCTGGC ACGTCCCCCG CCCCCGCGCC GACCTGGCAG CAATAGAGCG CGCGGCGGCA GCGATCAAGA ACTCCCGCCG TCCCCTCATC GTCGCCGGCG GTGGCGTCAC CTACTCGCAG GCCACTGACA CGCTGCGCAC GTTCGCCCAC GCCGCCGGCA TCCCGGTCGC CGAAACCCAA GCCGGCAAAG GCGCTCTGGA CGAATCCGAC CCGCTGTCCC TCGGCGCGAT CGGCGCGACC GGCACCCAAG CCGCGAACCT CATCGCGGCG CGCGCCGACC TCGTGATCGG CGTCGGAACC CGCTTCTCGG ACTTCACCAC CGCCTCGCAC ACCGCCTTCG CCGACCCGGA CGTCCGCTTC GTCACCGTCA ACATCGCCGC CTTCGACGCC GCCAAGCACG CCGGCGTATC AGTGGTCGCA GACGCCAGAT CAGCCCTCGA AGACCTCACC GCCGCCCTCG ACGGCTGGCG CACCGAAGCC GCCTTCCACG CCGAGATCGC CGACGTCCGC GCCCGGTGGG CCACGACCGT CCAGGGCGCC TACCGCTTCG CCAACGTCCC GCTGCCCTCG CAGATCGAGG TCATCGCCGC CGTCAACGAC GCCGCCGAGC CCGGGGACGT GGTCGTCTGC GCGGCCGGTT CGATGCCCGG CGACCTGCAC CGGCTCTGGC GCACCGGCGG CGACCCGAAG TCCTACCACG TCGAATACGG CTTCTCCTGC ATGGGATACG AGATCGCCGG CGGCCTCGGC GTGAAGCTTG CCGCGCCCGA GCGCGAGGTC TACGTCCTGG TCGGCGACGG CTCCTACCTG ATGCTCGCCC AGGAGATCGT TACGGCCGTG GCCGAGCGCG TGAAACTGAT CGTGGTCCTG GTCGACAACG CCGGGTACGC CTCCATCGGC GCCCTGTCCG AATCCCTCGG CGCCCAGCGC TTCGGCACCG CCTACCGCTA CCGCGGCGCG TCAGGACGCC TGAACGGCGG CCCGCTTCCG GTCGATCTCG CCGCCAACGC CGCGAGTCTC GGCGCGAACG TTGAGCGCGC CACCGACATC GAGGAGTTTG TCGCGGCGTT GGACCGGGCC AAAAAAGCCG ACCGCATCAC AGTCGTGCAC GTCGCGACCG ATCCGATGGC CGGCGCGCCC GACGGCGGCG CGTGGTGGGA CGTGCCCGTG GCACAGGTAT CAGACCTGGA ATCCACTCGC GCGGCCAGAA CGGGGTATGA GGAGGCGAAG AAAGCTCAAC GCCCGTACCT GTAA
|
Protein sequence | MSTGNADSAG STRPTRRLTV AQALVRFLAV QHTERDGTEH RLIEGVFGIF GHGNVAGLGE ALLGLELQDP ALLPYRQARN EQAMVHTAAA FARMRDRLST YACTTSVGPG ATNLVTGAAL ATVNRLPVLL LPGDVFATRP ANPVLQELED PRSLDISVND TLRPVSRYFD RVNRPEQLPA ALMAAMRVLT DPAETGAVTL ALPQDVQAEA WDWPEELFQH RVWHVPRPRA DLAAIERAAA AIKNSRRPLI VAGGGVTYSQ ATDTLRTFAH AAGIPVAETQ AGKGALDESD PLSLGAIGAT GTQAANLIAA RADLVIGVGT RFSDFTTASH TAFADPDVRF VTVNIAAFDA AKHAGVSVVA DARSALEDLT AALDGWRTEA AFHAEIADVR ARWATTVQGA YRFANVPLPS QIEVIAAVND AAEPGDVVVC AAGSMPGDLH RLWRTGGDPK SYHVEYGFSC MGYEIAGGLG VKLAAPEREV YVLVGDGSYL MLAQEIVTAV AERVKLIVVL VDNAGYASIG ALSESLGAQR FGTAYRYRGA SGRLNGGPLP VDLAANAASL GANVERATDI EEFVAALDRA KKADRITVVH VATDPMAGAP DGGAWWDVPV AQVSDLESTR AARTGYEEAK KAQRPYL
|
| |