Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5029 |
Symbol | |
ID | 8336383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5760980 |
End bp | 5763826 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644958128 |
Product | hypothetical protein |
Protein accession | YP_003115730 |
Protein GI | 256394166 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0984885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACACA GAACACGGCT GCGGCGCACT GCCGCCGCCG GCGCTGTCGC AGCCCTCGTG GGAACGATGC TCACCACGAC CCTGACCGGT CCGGCGAGGG CCGACTCGGT CACCGCGAAC CAGGCCTGGC GCATCGCCCA GCAGTACACC GGCGTCTGGA CGAGCCCGCC GTCCGCCCTG ACCAACGGCG AGACGGTGGA CGCCCCGATG CTGGGCAACG GCGACATCGG CGTGGCGATC GGCGGGTCGA TCGCGAATCA GACCATGTAC CTCGGCAAGA ACGACTTCTT CTCCGGGTCC GCCCACGCGA TCAAACCGCT GGGACGGATC GTGGTCACCG CGGCCGGGCT GAACGGCTCG TCCTACCACG TCGTCCAGGA CATCGCGCAC GCCGAAGTGC GCGGCACGTA CACCCTGGGC AGCCAGACGC TGAGCACCAC GAGCTGGGTC GACGCGAACT CCGGCATGTA CGTCACCTCC TTCGCCCTGA CCGGCGGCAG CGCGCAGAGC ATCGGCATCG CGCTGCAGAA CGGGAGCGGC GGCACCCCGA GCGTCAGCAC CAGCGGCAAC GACCTGGACG CCGACGTCGC CGCGGACACC GGAACCGGCA GCGACCCGCA CGCCCGGATC GCCGCGCGCA CGATCGGGCA GACCCAGTCG ATCTCCGGCA ACAAGATCAC CCTGACCATC CAGCCGGGGA CCACGTCCAC TCTCGTGGCC GGGATCGTCT CCAGCATCGA CAGCTCTTCG TGGCAGTCCG GCGCCGATGC GCTGGTCGGC TCGCTGGCTC AGGCAGACGT CGCCAACCAC AACGCCGCGC ACCGTTCCTG GTGGCAGAAC TACTGGCAGC AGTCCTACGT CGAGATCCCC GACAAGACGG TGGAGAAGAG CTGGTACGGC TCGCTCTACC TGCTCGGCTC CGTCTCGCGC GCCGGGAAGT ACGCTCCCGG GCTGTGGGGC AACTGGATCA CCGGCGCGAT GAACTGGAAC GGTGACTACC ACACCAACTA CAACTACGAG GCGCCGTTCT ACGCCGCCTT GTCCACCAAC CACATCGCGC AGATGGCCGC CTATGACCAG CCGGTGCTGG ACTGGCAGTC CGGCGGCCAA TCGCTGGCGT CGCAGAACGG TTTCTCCGGC GTGCTGTACC CGGTCGGCTT GTCGCCCAAG GGCACCAGCG CCGACATGAA CCTGCACAAC CAGAAGTCCA ACGCCGCGAA CCTCGCCAGC GACATGGTGA TGCGCTTCGA GCACACCGGC GACACGTCGT ACGCGACCAC CGTCTACCCG TGGCTGAAGC AGGTCGGGCT GTTCTGGCAG AACTACCTGA CCTGGGACGC GGCGAACAAC CGGTATGTCA TCACCAACGA CGCCCCGCAC GAGGACCAGT CCTACCCGCA GACCAACAGC GGGCTGTCGC TCGGGCTGGT GCACCTGCTG TTCCAAGGCC TGATCGACAT GAGCACGGCG CTGAATCAGG ATGCTTCGAC CCGCGCCACC TGGCAGAACA TCGAGTCTCA TCTCAGTGCC CTGCCCACGA TGTCGCTGAA CGGGCAGACC ATCCTGCGCG AGACCGAGGT CGGCAGCGAT TTCATCAACG ACGGCAACGA CATCGACTCC CAGGCGATCT ACCCCGGCAG CTTGATCGGC CTGGACAGCG ACGCGGCCTC GCAGCAGAAC GCCCGCAACA CCATCGGCGC GCTGACCAAC GCCTGGCACG GCGGCAACGC GCCGGCCACG TTCTACGCCG CGGCGGCGCG CGTGGGCTAC AACCCGAGCA CGATCCTGTC CAACCTGGAC TCCGAAGCCG CGAACAACGC CTATCCCAAC ATGGCGATCC ACCACAACGG CGGCGGCATC GAGAACATCA ACGTCACCAC CTCCGGGCTG GACGAGATGC TGCTGCAGTC CTTCCAGAAG GACGTCAAGG TGTTCGCCGA CTGGCCGGCG AACACCAACG CGAAGTTCGG CGACCTGCTC GCGTACGGCG ACTTCCTGAT CTCCTCCAGC AAGTCCGGCA ACGCCGTCCA GTACATCCGG GCCGTCAGCC AGAAGGGCGG AAGCCTGACC GTCACCAACC CCTGGTCCGG CAGCGTCGAG GTCTACCGCA ACGGCACCGA CACCGGCGCC GTGTCCGGGG CGAAGCTCAC GATCGCGACC TCGGCCGGCG ACACGATCGA CCTCGCCCCG GCCGGTACCT CGCTGGCGAC CATCCAGTCC GAGCTGTCCC AGCCGCTGCA GACCACCTCC AGCGGCAGCT TCAGCTCCGG ATTCGAGAGC AGCGACCCGG CGGTGAGCTG GAGCGACACG GTCGACAGCA GCGGCGGCGG CAGCACGGGC GTCACGGGGA TCTGCTGCGG CGCACCCGGC CCGGAAGCCG GAGTCCGCAC CGGTGAGACT TCGCACACCG GGTCCAGCTC GTTGATGTAC TCCGGATCCG CGCAAGGCGG CACCAACGAC TATGCGTACC TGAAGGTCTA TGACCTCAGC GGCAGTCCGC TGGCGATCGG ATCCGGGAAG ACCCTCGGCT ACTGGATCTA TCCCCAGAGC AACGCCACCA GCACATGGGT CCCAGCCGGT TCCACGAACA GCAGTTGCGT CGCCGTCGAC ATGGTCTTCA CCGACGGCAG CACCCTGAGA GACTCCGGCG CCGTGGATCA GAGCGGCACC AAGATCCATC CGGCGAACCA GTGCGGGCAT CTGACGCTGG ACGCCTGGAA CCATGTCACG GTCAATCTGG GGACGAACAA CGCCAACAAA CAGATCAGCC GGATTCTGGT CGGCTACGAC CATCCGAACT CCACCGGCGG TTACCGCGGC TACGTCGACG ATCTGACCGT CAGCTGA
|
Protein sequence | MRHRTRLRRT AAAGAVAALV GTMLTTTLTG PARADSVTAN QAWRIAQQYT GVWTSPPSAL TNGETVDAPM LGNGDIGVAI GGSIANQTMY LGKNDFFSGS AHAIKPLGRI VVTAAGLNGS SYHVVQDIAH AEVRGTYTLG SQTLSTTSWV DANSGMYVTS FALTGGSAQS IGIALQNGSG GTPSVSTSGN DLDADVAADT GTGSDPHARI AARTIGQTQS ISGNKITLTI QPGTTSTLVA GIVSSIDSSS WQSGADALVG SLAQADVANH NAAHRSWWQN YWQQSYVEIP DKTVEKSWYG SLYLLGSVSR AGKYAPGLWG NWITGAMNWN GDYHTNYNYE APFYAALSTN HIAQMAAYDQ PVLDWQSGGQ SLASQNGFSG VLYPVGLSPK GTSADMNLHN QKSNAANLAS DMVMRFEHTG DTSYATTVYP WLKQVGLFWQ NYLTWDAANN RYVITNDAPH EDQSYPQTNS GLSLGLVHLL FQGLIDMSTA LNQDASTRAT WQNIESHLSA LPTMSLNGQT ILRETEVGSD FINDGNDIDS QAIYPGSLIG LDSDAASQQN ARNTIGALTN AWHGGNAPAT FYAAAARVGY NPSTILSNLD SEAANNAYPN MAIHHNGGGI ENINVTTSGL DEMLLQSFQK DVKVFADWPA NTNAKFGDLL AYGDFLISSS KSGNAVQYIR AVSQKGGSLT VTNPWSGSVE VYRNGTDTGA VSGAKLTIAT SAGDTIDLAP AGTSLATIQS ELSQPLQTTS SGSFSSGFES SDPAVSWSDT VDSSGGGSTG VTGICCGAPG PEAGVRTGET SHTGSSSLMY SGSAQGGTND YAYLKVYDLS GSPLAIGSGK TLGYWIYPQS NATSTWVPAG STNSSCVAVD MVFTDGSTLR DSGAVDQSGT KIHPANQCGH LTLDAWNHVT VNLGTNNANK QISRILVGYD HPNSTGGYRG YVDDLTVS
|
| |