Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8210 |
Symbol | |
ID | 8339589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 9520012 |
End bp | 9523284 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 644961296 |
Product | hypothetical protein |
Protein accession | YP_003118874 |
Protein GI | 256397310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCACG ATACGGACCG GCAATACCTC GCGCCGCAGG GACCGCCGCC GGGTATGGCG CAGCAGACTC CGGTCGCCTT GAAGCGGCAG CGGAGCGCAG CGGACGTCCT CACCGGTCTG GGGGCGCTGC TGGCGTTGCT GGCGCTGGTG ATCGGCGTGC CGTTGGCGTT GGCGTACTTC GTGGGCTGGC CGCTGCCGCA CCACATGCCC TCCGGCGGCG TGCTGAACTC GAAGATCGAC ACCAAGACCT TCACCAACGT GCTGGCGATC CTGGTCTGGC TGGCGTGGGC GCAGTTCTCG GCGTGCGTGC TCGTCGAGGC GCTGGCCGCC GCGCGCGGTA TCGGGATGCC CGGCCACGTG CCGCTGTCCG GCGGCAGCCA GGTCCTGGCC CGCCAGCTCG TCGCCGCCGT GCTGCTGATC ACCGCCTCGG CCGCCTCCTT CGCCCCCGGT CTGTCCTCCC TGGGCCGCAC CTCCGGCGAC GGCCCGCACC GCGCGCCGAT CGCCGCGACC CAGGTGCTGC AGCAGGGCAC GCGCGCCGAC ACCGCGATGC CCTCGGGACC GTCGCAGCGC GCTGCGACGT CCATCGACGC CCGCACCGCG ACGGACACGA AGTCCCCCGC GGCCAAGGGC GCGACGAAGT TCTACCGGGT GCAGCCGCCG GCCGGGCGCC ACCACGACTC GCTGTGGGAG ATCGCGCAGC GGCACCTCGG CGACGGCCGC CGGTACCAGG AGATCTACGA CCTGAACAAG GACCGGGTGC AGCCGGACGG CTCGATGCTG ACCAAGGCGT CGCTGATCCG CCCGGGCTGG ATCCTGGAGA TGCCGGCCGA CGCGGTCGGC GGGGACCTGG TGAACGATCC GTCGGCGCCC GCGCAAGCGT CGGGACCGAC GCACCCCGGT GCGCCTTCGC ACCAGGGCGG CGGTCCCGCG CACAACGTGC CGGGTCCCGG ACCCGGGAGC GGTGTCCAGC AAGGCGGTCC CGGCGCTCTG CCGGACCCGC ACGCCGTGGG CGGTCTGGGC GGCGTGCCCG GAAGCCCTTC TACCTCGTCG ACCCACGTCC CGCACGGCGC GAGCGCCGCC GCCCTGGACC GCATATCCGC CACCGAGCAG ACGGTGACCC TGCCCGCGGT GACCGACGCC GCCGCGACGG TCGGCCAGGC CGCCGCGGAC GCCGTGCACC ACCTCGGAGA CGGCTCCGGC GAGCGGTCCG ACGCGGTGAA GCTCGGCGCG ACCAACCTTT CGCAGAACCC TCAGTCGCCG CCCGCACCGC GCCAGAGCCC CGTTTCGCCG GCGCACGCGA CCATCGCGGC CGGCAGCCAC ACCCCGAGCA ACGCGGCGCA CCGGAACCAG GCTCCCGCCG GCGAGGAATC GCCGTACCGG CTGCCGCTGG AGCTCGCTTC CGCGCCGCTG CTGGCCGCCG GGCTGCTCGG CGCCCTCGGC CGCAACCGGC GGCGCCAGCT GTGGAACCGG ACCGTCGGCC GCCGGCTGGC CGGGCCCGGC GGCAGCGCGG CCGGCGCCGA GGAGGCGATC CGGCTCGGCG CCGGGCTGGC CGACGCGCGG TTCCTGAACC AGGCGCTGCG CGAACTGTCC GCCTCGCTGG CCACCGCCGG GCGGCCGCTG CCGCCGGTGC AGCTGGCCAA CCTGACCGAA TCAGGGCTCG AACTGCGTCT CGCGGAGCCG GGTCCGGCCG CCCCGCAGCC GTGGCACACC CGGCCGGACG GGCTCGCCTG GTGGGTCGCC CGCACCGACG TCGGCTCGGT CGCCAAGCGG GTCGCGGAGG CGGCTGTGGC GCCGTGTCCG GGACTGGTGA CGGTCGGCGC GATCGGGCCC GACGCCGCCA CCCGCGTGCT GCTGGACCTG GAGGCGTCCG GCGGCGTGAT CGCGGTCGGC GGCGACGACG CGATGCGGCG CGCGGTACTC GCCGCGATGG CCGTGGAGCT GCTGACGAAC ACGTGGTCGG ACAAGATGAC GGTGACGCTC GTGGGCTTCG CCGGCGACCT GTCGTCGCTG GCGCCGGGCC GCGTGCACCA GACCGCGTCG CTGGAAGAGG TCCTGCCGGG TCTGGAGACG GAACTGGCCG AACGCCGCCG CGGTCTGTCC GAGGCCGGTC TGGACTCGGT GCTCGGCGGC CGGCTCGGCA TGGTCGGCGG CGCGGGCTGG CCGCCGCACT TCATCATCAG CGCCGCCCCG GTCTCCGGGC AGACGGCGGC GCGCCTGGCG GCCGTCATCG GCGACCCGTC GCGCCTGGGC ATCGGCTACC TGATCGCCGG CGAGGTCCCG GGAGCGGCGT GGCAGGCCAC GGTCGACGCC GCCGGCCGGC TCCGCCTCCC GGCGCTGAGC CTGGAGGTGA CGGCGCAGCG CCTCCCCGAC GACCAGTACC AGTCGGTCCT GGCTCTGTTC GAAGCCACCC GCGACCTCGA CGGCGAGCCG ATCGTCCCGC TGACCGCCGA AGCCGCGGTC CTGGAAGCCC AGCTCCGCGT GACCCCGACC GTCTCGGTCC GCCTGCTGGG GAACCTGGAG GTCACCGGCG CCTACGGCGA CCTCGAGGAG GACCGGGTCG AGCAGGCCGC CGAGGCTTTG ACGTTCCTGA TGCTGCACCG CGACGGGGTC CACCCGCGCG TCCTCACCTC GGCCCTGTTC CCGCGCGGCG CCACCACCGA GATCGGCGAC CAGGTGCTGC ACCGCCTCGG GACCTGGCTC GGCGTCGCGC CGGACGGCAC CCCGAACCTG GTGACGCTGC CCGACGGGCG CCTGACCGTC TCCCAGAGCG TCCGCAGCGA CTGGGAGATG TTCAAGAACA TGCGCGCCCT GGCCGATCTC GACCCCCGGT ATCAGGACCC GAAGAACCGC GACCAGGTCC TCGGCCAAGC CCTCGGACTG GTCCGCGGAC CGCTGCTGGC GCAGCGCGAG ACGAGCCGCT ACGGCTGGCT GGCGTACGAA TCGGTGGAGA CCGAGGTCCC GGCGGTGATC GCGGACACGG CGATCGAGCT CTGCGAACTC CGCCTGTCCC TCGGCGACGC CGAAGGCGCC ATCGACGCGG TCCGCAGCGG CATGCGCGGG TCGCCGAACG ACGAGGAACT GTGCCGCTCC CTGGTGCGCG CGACGCACGC GAGCGGCGAC GAAGGCAGGC TCCGCGAAGC CATCACGGCG ATCGAGGAGC AGACCCGGGC GGTACACGGC GAACGCGGCC TGCACCCGAA AACCGAGGCC CTGGTGGACG AACTCCTCCC GGGCTGGCGC GAAGGACGCG AGGTACTGGC GGCGAGGGCG TAA
|
Protein sequence | MAHDTDRQYL APQGPPPGMA QQTPVALKRQ RSAADVLTGL GALLALLALV IGVPLALAYF VGWPLPHHMP SGGVLNSKID TKTFTNVLAI LVWLAWAQFS ACVLVEALAA ARGIGMPGHV PLSGGSQVLA RQLVAAVLLI TASAASFAPG LSSLGRTSGD GPHRAPIAAT QVLQQGTRAD TAMPSGPSQR AATSIDARTA TDTKSPAAKG ATKFYRVQPP AGRHHDSLWE IAQRHLGDGR RYQEIYDLNK DRVQPDGSML TKASLIRPGW ILEMPADAVG GDLVNDPSAP AQASGPTHPG APSHQGGGPA HNVPGPGPGS GVQQGGPGAL PDPHAVGGLG GVPGSPSTSS THVPHGASAA ALDRISATEQ TVTLPAVTDA AATVGQAAAD AVHHLGDGSG ERSDAVKLGA TNLSQNPQSP PAPRQSPVSP AHATIAAGSH TPSNAAHRNQ APAGEESPYR LPLELASAPL LAAGLLGALG RNRRRQLWNR TVGRRLAGPG GSAAGAEEAI RLGAGLADAR FLNQALRELS ASLATAGRPL PPVQLANLTE SGLELRLAEP GPAAPQPWHT RPDGLAWWVA RTDVGSVAKR VAEAAVAPCP GLVTVGAIGP DAATRVLLDL EASGGVIAVG GDDAMRRAVL AAMAVELLTN TWSDKMTVTL VGFAGDLSSL APGRVHQTAS LEEVLPGLET ELAERRRGLS EAGLDSVLGG RLGMVGGAGW PPHFIISAAP VSGQTAARLA AVIGDPSRLG IGYLIAGEVP GAAWQATVDA AGRLRLPALS LEVTAQRLPD DQYQSVLALF EATRDLDGEP IVPLTAEAAV LEAQLRVTPT VSVRLLGNLE VTGAYGDLEE DRVEQAAEAL TFLMLHRDGV HPRVLTSALF PRGATTEIGD QVLHRLGTWL GVAPDGTPNL VTLPDGRLTV SQSVRSDWEM FKNMRALADL DPRYQDPKNR DQVLGQALGL VRGPLLAQRE TSRYGWLAYE SVETEVPAVI ADTAIELCEL RLSLGDAEGA IDAVRSGMRG SPNDEELCRS LVRATHASGD EGRLREAITA IEEQTRAVHG ERGLHPKTEA LVDELLPGWR EGREVLAARA
|
| |