Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2697 |
Symbol | |
ID | 8334046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3090188 |
End bp | 3093898 |
Gene Length | 3711 bp |
Protein Length | 1236 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644955847 |
Product | legume lectin beta domain protein |
Protein accession | YP_003113453 |
Protein GI | 256391889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0132334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTCG CCCGGGCGGT TCGGCCGAAA AGGCACCGCC GACCCAGATC GCGCCTGATG ACGTGGCTGA CCGCCGTCGC GATCGGTGTG TCCGGCCTGA CAGGGCCCTT CCCCGCGCAC GCGCACGCCG ACGTGGTCAC GGTGTCCAAC GATCCGGGCC GCACCGGCTG GGACCGCAGC GAGCCGCACC TCTCACCGGG AGCCGTGAAC GGCAGCGACT TCGGGCAGCT GTTCTCCACC GCGCTCACCG GCCAGATCTA CGCGCAGCCG CTGGTGGTCG GCTCGACCGT GGTCGTGGCG ACCGAGGAGA ATCACGTCTA CGGCCTCGAC AGCGAGACCG GCGCGATCAA GTGGTCGACC TACCTCGGCC CGTCCTGGCC GGCCTCGACG GTCGGCTGCG GTGACCTGAC CCCCGACATC GGGGTCACCT CGACACCGGT CTACGACCCG GCCTCCGGCG ACCTCTATGT GACGGCGAAG GTCGACGACG GCGCGAACGC GACTGTCCCG CACTACTACC TCTTCGCCCT GAACGCCGGC ACCGGTGCGG TGGTCCCGGG GTGGCCGATG AGCGTCCAGG GCGCGCCGTC GAACGACCCG ACGAACACCT TCGACGCCGA GGACCAGCTC CAGCGCACGG GACTGCTGTT GCAGAACGGC TCGGTGTACA TGGCTTTCGG GAGCCTGTGC GACGTCCTGC CGTTCCGGGG CTACGTGGCC GGGGTGAACA CGGCCACCCG GGCGCTGCAC ATGTGGACCG CCGAGGCCGG TCCGAACGCC TCGGGCGCGG GGATCTGGCA GGCCGGCGGC GGGATCGTCT CCGACGGCGG CGACGGCATG TTCATCGCCA CCGGCAACGG CGTCACCCCG CCGGCCGGTC CCGGCACCTC GCCGCCGGCC ACGCTCTCGG AGTCGGTGGC GCATCTCGGG GTGGCCGCCG ACGGCACGAT CTCGGCCACC GACTTCTTCG CCCCGACCGA CGCCGGAACG CTCGACGCGA ACGATCAGGA CCTTGCCTCC GGCGGTCCGG TGGCGCTGCC CGACGCCGAG TTCGGGACCA GCGCCGATCC GCACCTGATG GTCGAGATCG GCAAGGCGGG CGAGCTCTAC CTCCTCAACC GCGACGCCCT CGGCGGTCGC GGCCAGGGCA GCGGCGGCGG CGATCAGGTG CTCGGCGAGA CAGCGCTGAC CGGGGTGTGG GGCCACCCGG CGGTGTGGGG CGGGGACGGC GGCTACGTCT ACCTCACCGA ATCGCAGGGC TATCTCACAG CACTGGGTTA CGGACAGACG AGCAGCGGCA CCCCCGCGCT GCACCTGGCC GGCACGAGCG CCGCGACGTT CGGCTACTCG TCCGGCTCGC CGGTGGTGAC CTCCGACGGC ACTTCCTCGG GGTCGGCTCT GGTGTGGGTC GTCCAGTCCT CGGGAGCCGA CGGCGGCGGC GGGCAGCTCA TGGTCTACAA CGCCGTGCCG TCCCAGCGGG TGCTCACGCT GCTGCGCGCG TGGCCGCTGG GCACCGTCTC CAAGTTCACC GTGCCGGCGA CCAACGGCGG GCGCGTCTAC GTCGGCACGC GCGACGGCAA CCTGCTCGCC TTCGGCGCTC CGATCAACCA GGCCTTGGAA AGCGGGCCGG TGCACTTCGG TTCCACGGCC GTGGGTTCCA CCGGCTCCGC GACGGCCACT TTGACCGCGA CCAGGACCGT CACAGTCTCG GCGATCACCG CCCCTGCCCC CTTCGGCGTC GCGCCGCCCG CGCTGCCGGT CACGCTGACC GCCGGCCAGA CCCTGACCGT TCCCTCTACG TTCTCGCCCA CCACCTGGGG CGCGGTCACC GGGCAGATCC AGCTGACGAC CGATCAGGGC ACGATCTACG TCGGACTGGA CGCCACCGGC ACGCAACCAG GCCTGGCCGC CACGCCACCG TCGCTGGATT ACGGGGACAG GGCGGTCGGC AGCGGCGAGA CGCTGGCACT GAGCGTGGAG AACACCGGCA CCACCACCGA GACGTTCACC TCCCTGACCG CACCGGCCGT GCCGTTCACC GCGAACGGAC TGCCCAAGGC CGGCGACACG CTCGCGCCGG GCGCCGCCGA CACCTTCACC GTGGCCTACG CGCCGACCGC CGTCGGCGAC GACCGCTCCT CGGTCGTGCT GACCAGCGAT CAGGGTGCGG TCACGATCCC GCTGACCGGC GTCGCGCAGA TCGCCGACGA TCAGGTCACG ATCAGCCCGA CGACCCTGTC CCTGGGGAAG GTGGACGTCA GCCAGACCTC GGCTCCGCAA TCGTTCACCG TCACCAACAC CGGCAACCTG ACGCTGACCG TCACCAAGGC GGCGCCGCCG ACCGCACCGT TCACGGTGCT CAACCCGATC CCCGAGGGTC AACAACTCGC CCCGGGCGAG AGTTACACGG TCTCGATGTT GTTCACCCCG TATCAGCTGG CGAACTTCAC CGGCACGTAT GCGATCAGCA CCGACACCGG CCAGGGCGAG ATGGACGTGA CCGTGACGGG CACCGGTACG CCGACGCCCG CGATCCCCGT GCCTCCGCCG GGCGGCGACT GGACGGTCAA CGGCTCGGCC CGCATGGCGG GGACGAACCT GGAGCTCACC ACGCCGCACA ACACGCACAG CGCAGGCTCT GCGGTCTACG GCACGCCGGT GCCCAGCGCG GGTCTGAAGG CCGACTTCAC CGCGCGGCTG AACGGCGGCA GCGGGGGCGA CGGGCTGACG TTCAGCCTGC TCGACGCGGC GAAGTCGAAG CCCACTTCAC TGGGCACCGC CGACGGCGGT ATGGGCTTCT CAGGCTTGAA GGGGGTCGCG GTCGTGCTGG GCACCCACCG GGAAAAGGGC GCGCCGTCGG ACAACTTCGT CGGGATCGCG ACCGGGAACC GCGACGGCTC GCTCACCTAT CTCGCGACCT CGACCCATGT CCCCGCCTTG CGCAAAGGCA CGCACGCCGT GACCGTGCAG GCGACCGGCG GCCGGCTCGC CGTGTCGGTG GACGGCAAGG TGGTGCTCTC ACCCGCGGTC GCCCTGCCGG CCCAGGTGCT GCCCGCGTTC ACCGCAGCCA CAGGCGCGCT CACCGACGAT CACATGGTCA GCGCGGCCGT GATCTCATCG GGCGGCGCGA ACCTCCCGGC GCCCGGCGGC GGCTGGTCCT ACAACGGCAC TAGCGCGCTG AGCGGTCCGG ACACCCTCCT GACCCGCGCT GGCCACGACC AGCCCGGCAC CGTGACCTCT CCGACCCCGC TCGGGATCCA CGCCATGGAG CTCCAGTTCA CGTCGGTCAT CGACGGCGCA GCGGCGGGCC GCGGCTTGAC GTTCAGCCTC TTCGACGCCG CATTGGTGAC CCCCGGCGCG GTGGGGGAGC GCGGTGACGG ACTGGGTGCG AGCGGCCTGC CGGGGGTGTA TCTGATCCTG AACACCTACA CCGGCGGCAA GGCGGCTTCG AGCGTCGCGA TCGGAACCAA CCCGGCATCG GGCACCGCAC CGACGCTGCT CTACAGCACC GCGAATCTGC CGAGTCTGCT GGGGACCCAC ACAGTGTGGG TGATGTACTT CGGCGGCTTG CTCAGCATCG CCATCGACGG AACGCTCGTG CTGCAGGGCA CTGTCGCACT GCCTGCCACC GCGCTGCCGT CGTTCACGGC GGGTACCGGG CAAGGCGGCG ACGCGCACCT CGTGCGCGAT GTGTCGCTCC AGTACTTCTA A
|
Protein sequence | MKVARAVRPK RHRRPRSRLM TWLTAVAIGV SGLTGPFPAH AHADVVTVSN DPGRTGWDRS EPHLSPGAVN GSDFGQLFST ALTGQIYAQP LVVGSTVVVA TEENHVYGLD SETGAIKWST YLGPSWPAST VGCGDLTPDI GVTSTPVYDP ASGDLYVTAK VDDGANATVP HYYLFALNAG TGAVVPGWPM SVQGAPSNDP TNTFDAEDQL QRTGLLLQNG SVYMAFGSLC DVLPFRGYVA GVNTATRALH MWTAEAGPNA SGAGIWQAGG GIVSDGGDGM FIATGNGVTP PAGPGTSPPA TLSESVAHLG VAADGTISAT DFFAPTDAGT LDANDQDLAS GGPVALPDAE FGTSADPHLM VEIGKAGELY LLNRDALGGR GQGSGGGDQV LGETALTGVW GHPAVWGGDG GYVYLTESQG YLTALGYGQT SSGTPALHLA GTSAATFGYS SGSPVVTSDG TSSGSALVWV VQSSGADGGG GQLMVYNAVP SQRVLTLLRA WPLGTVSKFT VPATNGGRVY VGTRDGNLLA FGAPINQALE SGPVHFGSTA VGSTGSATAT LTATRTVTVS AITAPAPFGV APPALPVTLT AGQTLTVPST FSPTTWGAVT GQIQLTTDQG TIYVGLDATG TQPGLAATPP SLDYGDRAVG SGETLALSVE NTGTTTETFT SLTAPAVPFT ANGLPKAGDT LAPGAADTFT VAYAPTAVGD DRSSVVLTSD QGAVTIPLTG VAQIADDQVT ISPTTLSLGK VDVSQTSAPQ SFTVTNTGNL TLTVTKAAPP TAPFTVLNPI PEGQQLAPGE SYTVSMLFTP YQLANFTGTY AISTDTGQGE MDVTVTGTGT PTPAIPVPPP GGDWTVNGSA RMAGTNLELT TPHNTHSAGS AVYGTPVPSA GLKADFTARL NGGSGGDGLT FSLLDAAKSK PTSLGTADGG MGFSGLKGVA VVLGTHREKG APSDNFVGIA TGNRDGSLTY LATSTHVPAL RKGTHAVTVQ ATGGRLAVSV DGKVVLSPAV ALPAQVLPAF TAATGALTDD HMVSAAVISS GGANLPAPGG GWSYNGTSAL SGPDTLLTRA GHDQPGTVTS PTPLGIHAME LQFTSVIDGA AAGRGLTFSL FDAALVTPGA VGERGDGLGA SGLPGVYLIL NTYTGGKAAS SVAIGTNPAS GTAPTLLYST ANLPSLLGTH TVWVMYFGGL LSIAIDGTLV LQGTVALPAT ALPSFTAGTG QGGDAHLVRD VSLQYF
|
| |