Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1764 |
Symbol | |
ID | 8333107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1996531 |
End bp | 1999488 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644954914 |
Product | hypothetical protein |
Protein accession | YP_003112526 |
Protein GI | 256390962 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGG GCGGGGGGCC GGCGGCGCCG AGCACGGTGC TGCCCGCCGG GCGCGCCGGC GTCTGCCGCG CGGTGGTGGG CTGGCTGACC GCGCCGGAGG CACCCGCCTC CTCGGCTTCC GCCTCCGCCC CCGCCCCCGC GAGCACGCCG CGCCTGCTGC TGGTCACCGG CCCGCGCGGC GCCGGCAAGA CCTTCACCCT GGCCTGGGCC GCCTCGGAAC TCGCGGTCCT GGCCGAAGCC GCCGACGACG AGCGCTTCGC CTGGGTCTCC GCCCGCGGGC TGACCCCGCG AGTGCTCGCC TGGCAGCTGG CCGGCCAGCT CGGCGTGACC GTCGGCGACG CGCGAGGGCT CGCGACGGCG ATCGCCGCCC GGCTGCGCGG AAGCGAGGCG AAGGCCGGCG CGGAGTCGGG GAGTTCGGGC GGCGGGCTCG GCGGGTCCGG CGGGTCCGGC GGGTCCGGGT TCGGGCCGGG GGTCGCCGAC GGGGTCGCGG CTGACGGGCT GAGGGGGATC GCCGACGCGG ATCAGGGCTC GAGTGCAGCG CGCGGGGACC GCGGTTCGCG GGACGCGGGG GGCGCCAGAT CTGATGCCGC AGGCTCGGAT TCTGCCGGAT CAGATAGCCG CAGCGACGAT CCCGATAGCC GCAGCGACCA TCCCGGCGCC GGAGCTCCTC GATCAGACAG TCACAGCGAT CATCCCGGAG CTCGCGTCAG AGTTCCCGGA AGCCGATCGG CGACGTGGGA TCTGTCGGCG AGCGTCACCG ACCCGGTTGG CGAAGCCGGC CTCTGGTCGC GCGTCACGAT CCTGATCTCC GAACTCGACC TCGCCGGCCA CCTCCGCGAC GGCTCGTCGC GCCAGGCGGT GGTGGTCGAA CTGCTGGCGC CGCTGCTGGC GGTCTCGGGC GTCCGGCTGA TCGTGGAGTC GGCGCACACG GACGTGGCGG ACGCCGCCTC CCGCGCCGAC ACCTCGGCGA CCGTCATAGA CCTCGCCGAA CCCCGCTGGA CCGACCGCGC CGAATTCGCC GCGTGGCTCG ACGACCTGGT CCCCTCCGGC GGCGCGGCAG CCCCCCAGGC GTACCCGAAC CCCACCAAGG CCCGCGAGAT CCTCAGCCTG GCCTCCACCC GCCACGCAGG CCTCCTCGCC GCCAGCGACG AACGCCCCTG GAAGCTGGCC GCCGTCCCCT TCGACGTGGC ATTCCGCGAA GGCATCGCCG ACCAGATCGT CCTGGAGCCG GCCAACCTGG TCCTGGTCCA GCAACCGGCC CTCTCCACCG CCATCGACGC CCTCGGCACC GCCGTAGCCC CCGGCACCCG AGCCGCCTGG CGAGTAGCCG GCCAAGCCCT CACCAACTCC ACCACCGTCC CCGAACGCGC CGCCATCCTC CACGGCGCCG CCCTAGCAGT AGGCGACCAC ACCCTCGCCG AAGCCGTAGC CCCCTACACC CGAGGCACCC GCCCCTCCGA CGGCGCCCCA GTAGGCGACC CCATCCCCTG GCTCACCCGC TGGGCCGGCT GGCGCCCCCT CCCACCCGAC ACCCCCCGCC CCACACCCTG GCCAGGACCA GTGGCCGCAC TGGCGGTGGC GGCAGCAGGC CCGGCGACGC AGGCGATCGT TCCAGGAGCA CCGGTCGCCG CAGTGCCGGG GACAGCCGGC GCCGCCGACA TGGCGATCGC CGCGAACATG GCGGGCGCTG CTGGAGCGCA GGCTGGTGCC GGCTCGCCGA GCGCTGCCGG GCACGCTGGT GCTGGGTCGC AGGACGCGGC TGGTGATGCT GGTTCGCAGA TTGCCGCCGC GCAAGCTGGT GCTGCCGGCT CGCCGGTCGC CGCTGGGCTG GCTGGCGGTG AGCGTGCTGG TGCTGGGTCG CAGGGCGCTG CTGGCTCGCC GATGGCCGGT GCGCATGCTG GCGCTGGCTC GCCGATTGCT GCCGGTAATG CTGCTAGTGC GCAGGCTGCC ACCGTGCCGA CTTCTGCTGT TGCCGGCTCG CAGGATGGCG GTGCCGCCTC GGCTTCCGCT GCTTCCCCGT CTTCCGCCGT TGAGCCGACG ATAGCCGTGC CGCTTCCGGG TGGGGGTGGG TTGGCTGCTG AGCTGGCGGC GGCTGCTGCT GAGTTGGCTG AGGCGGTGCT TGAGTCCGCT GCGCAGGGTC GGGGTGGGCG TGCTGGTTCT GGGGTTCTGG TGGCGCCGGT CGCGGGTTCG GCGGCTGGGG CTTGGTTGCC TGATCCGGAG CCGACGCCGG CGGAGGTGTT GGTGGCCATT GATGATTTGG CGCGGGCTTA TCGGTTGGAT CCTCGGAGTG GGCGGATTGT GGGGCGGCCT GCTTCGCCGG TGATGGTGAA GGCTCGGGCT GCGGCTGTGG TGGATGCTTC GCCGAGTTTG GTGCTGACTG ATTCGAGTCG TCGGTTGGCT GCGCTGGGTC CGACTGCGCC GCGGGGGGCG GTGGCGGCGG TGCGGGATGT GCTGCCCGAC ACCGTGGTGA CCGCGGTTGC GGTGGCGCGG GATGCGTTGG TTTTCGGTGC CGGGGATGGT CGGGTGCATG TCTGGGACGT CGATACCGAG GAGCTGATCG CCGATCCGGA GGACCAGCAT TCCGGTGTGG TGACTGCGGT TGCGGCCATC TGGATGCCCG ATATGGACGT GCTGTTCGCG CTCTCCGGTG GTCAGGACGG CACGTTCCGG CTCTGGGCCG TGCCCGCCGA CGCCAGCCTG CTGCCGGTGG AGGATCGCGG CGTGCCCGTC ACGGCCGTCA CCGCCGCGAT GACGGAAGTG GGTCCGGTCG CCGCCGTCGC GTGGGCCGAC GGCTGCGTGA CCGTGTGGGA TCTCGGCGAG GCACGGACCG GGCGGCCCAT TCCGCTCGGC TGGGCGCCGA AGGCGCTGGC ACTCGCGGGT GACGGGCTGC TCGTCGCGGC CGGTGACGAT GGGCTCATCG CGATCGACCT GCACCTCGCC GAGTTCTTCG AGGACTGA
|
Protein sequence | MAAGGGPAAP STVLPAGRAG VCRAVVGWLT APEAPASSAS ASAPAPASTP RLLLVTGPRG AGKTFTLAWA ASELAVLAEA ADDERFAWVS ARGLTPRVLA WQLAGQLGVT VGDARGLATA IAARLRGSEA KAGAESGSSG GGLGGSGGSG GSGFGPGVAD GVAADGLRGI ADADQGSSAA RGDRGSRDAG GARSDAAGSD SAGSDSRSDD PDSRSDHPGA GAPRSDSHSD HPGARVRVPG SRSATWDLSA SVTDPVGEAG LWSRVTILIS ELDLAGHLRD GSSRQAVVVE LLAPLLAVSG VRLIVESAHT DVADAASRAD TSATVIDLAE PRWTDRAEFA AWLDDLVPSG GAAAPQAYPN PTKAREILSL ASTRHAGLLA ASDERPWKLA AVPFDVAFRE GIADQIVLEP ANLVLVQQPA LSTAIDALGT AVAPGTRAAW RVAGQALTNS TTVPERAAIL HGAALAVGDH TLAEAVAPYT RGTRPSDGAP VGDPIPWLTR WAGWRPLPPD TPRPTPWPGP VAALAVAAAG PATQAIVPGA PVAAVPGTAG AADMAIAANM AGAAGAQAGA GSPSAAGHAG AGSQDAAGDA GSQIAAAQAG AAGSPVAAGL AGGERAGAGS QGAAGSPMAG AHAGAGSPIA AGNAASAQAA TVPTSAVAGS QDGGAASASA ASPSSAVEPT IAVPLPGGGG LAAELAAAAA ELAEAVLESA AQGRGGRAGS GVLVAPVAGS AAGAWLPDPE PTPAEVLVAI DDLARAYRLD PRSGRIVGRP ASPVMVKARA AAVVDASPSL VLTDSSRRLA ALGPTAPRGA VAAVRDVLPD TVVTAVAVAR DALVFGAGDG RVHVWDVDTE ELIADPEDQH SGVVTAVAAI WMPDMDVLFA LSGGQDGTFR LWAVPADASL LPVEDRGVPV TAVTAAMTEV GPVAAVAWAD GCVTVWDLGE ARTGRPIPLG WAPKALALAG DGLLVAAGDD GLIAIDLHLA EFFED
|
| |