Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3829 |
Symbol | |
ID | 8335182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4332034 |
End bp | 4333992 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956966 |
Product | WD-40 repeat protein |
Protein accession | YP_003114569 |
Protein GI | 256393005 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0243668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0465096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGC TGCTCAAGAG CGATCTGTGC GGCTGGACCG ACGACCGGAT GCTCATCCTG CACGACGAGA AGAACAAACA TGATCTGCTC GCCCGTATCA CCGACGTCCT GCAAGCCAGA ATGGAGCAGA TCACCGATAC TGTGCTGTTC TACTACGTCG GCCACGGCCT GCTGATCGAC GGAGGCGAAA AGCTCGGGCT GGCCATGAGC GACACGAAAG ACGATCCCCT GCGCAAAAAG CAGAGCTCTT TACTCCTCAA CGAAGTGCGT GAGCAGCTGA GACATCTGAC GAACGCGACG CAGGTGGAGA TACTGGACTG CTGCTACGCC GGAACCGCCT CGAACACGCA GGGCGCCGCC AACGCATTGC AGGACAAGGT ATCGCGCGCG GTGCAAGACA AAGTATCGCG CGCAACGCAG GAATATCGTG GCAGCCGCTA CACGGTGACC GCCTCACGGC GCAACGAGGA AGCCCTCTAC CAGCTCGAAG ATGGCGGGCT GACCTACTTC ACCGGCTACC TCGCCGAGAT CGTCACCCAC GGCATCGAAG GCGCGCCGGC CCACCTCGCA GCGATGCGCA TCTTCCCCGA GCTGCAGAAA CGCTTCCGAA GGCTCGCGCT CGCCCCGATC GGCCAGCCGG TTCCGATGCC GACCCAGTTG GTCGTCAACG AGGGCGGAGA GTTCCCGTTC GCCCGCAACG CCGCCTATCA GGAGCACTCT CAGGAGTCGC ACCCGCCTGT CGTCGCCGAC CTCGACCCGG AGCATCGCCG CCTGCCCAGC CGTCGCGCCA TGCTGGCTGC GGGGGTGGCG GCAGCGACCG GCTTGGGAAT CGTCGCGGCG CTGTGGCCGG ACTCGGGTCC GGGAGGGAGT CAGGAGAAGA TCGGTAGCCC GGTGTCACCC GGCGCAGATA GATCTTCGCT CCCCCCGACC CCAACGATTC CGACGACACC GACACCGACA CCAACCCCGA CGATCCCCAC GACCACGCCA ACACAGGTCA CCGACGCCAC CGCACTGGGC GCACCCCTCA AGGGCTTCGA AGCCACTGTC GAGGCGGTGG CGTTCGCCTC GAGCGGAACG CTTCTGGCGG GCGGCAGCTA CGACCACACG ATCCACCTGT GGGACGTCGC CAACCCCGCC ACGCCCCTTC AAGTCGGCCC GTATCTGACC GGCGACACGG ACAGCGTCAA CGCCGTGGCG TTCAGCCCGG ACGGCCGGAC CCTCGTCGGC GCGAGCTGGG ACAAGACGAT CCGGCTGTGG AACGTGGCAA GCCCGCTCCA CGCCGTGGCG ATCGGCCGGC CGGTCATCGG CACCGACAAG GTCAACACCG TCGCGTTCAG TCCGAACGGA AAGACATTGG CCAGCGGCGG CGATGACAGG ACAGTGCGGA TGTGGAACAT TGCGGACCCG CCCGCCCTCG CACCCGCGTG CCCACCTCTG ACCGACCACA CGAACGCCGT CAACGCCGTG GCGTTCAGCC CGGACGGAAC CATCCTGGCC AGCGCCGGCT GGGACTTCAC GATCCGGCTG TGGGAGGTCA GCGACCCGGC CCGGACTCGC GCCGCCGGCC GACCCCTCAG CGGCCACACC TACTCGGTCG CCGCAGTGGC GTTCAGCCCG GACGGAAAGA CCCTGGCCAG CGCCGGCTGG GACAACACCG TCCGGCTCTG GGACGTGAGC AACCCGGCCC AGGCCACTGA GATCGGCCTT CCGCTGACCG GCCATACCGA CCACGTGCAG GGGATCGCCT ACAGCCCCGA CGGCCGGACG GTGGCCAGCG GCAGCTGGGA CAAGACGATC CGCCTGTGGG ACGTCAGCGA CCCGACCCGG GCCAAGCCCA TCGGCTCACC GCTCATCGGC CACACCGGCC AAGTCGCCTC GGTGGCGTTC CACCCGATGG GGAAGATCCT GGCCAGCGGC AGCACCGACT CGACCATCCG GCTGTGGCAG ATCGAGTGA
|
Protein sequence | MEQLLKSDLC GWTDDRMLIL HDEKNKHDLL ARITDVLQAR MEQITDTVLF YYVGHGLLID GGEKLGLAMS DTKDDPLRKK QSSLLLNEVR EQLRHLTNAT QVEILDCCYA GTASNTQGAA NALQDKVSRA VQDKVSRATQ EYRGSRYTVT ASRRNEEALY QLEDGGLTYF TGYLAEIVTH GIEGAPAHLA AMRIFPELQK RFRRLALAPI GQPVPMPTQL VVNEGGEFPF ARNAAYQEHS QESHPPVVAD LDPEHRRLPS RRAMLAAGVA AATGLGIVAA LWPDSGPGGS QEKIGSPVSP GADRSSLPPT PTIPTTPTPT PTPTIPTTTP TQVTDATALG APLKGFEATV EAVAFASSGT LLAGGSYDHT IHLWDVANPA TPLQVGPYLT GDTDSVNAVA FSPDGRTLVG ASWDKTIRLW NVASPLHAVA IGRPVIGTDK VNTVAFSPNG KTLASGGDDR TVRMWNIADP PALAPACPPL TDHTNAVNAV AFSPDGTILA SAGWDFTIRL WEVSDPARTR AAGRPLSGHT YSVAAVAFSP DGKTLASAGW DNTVRLWDVS NPAQATEIGL PLTGHTDHVQ GIAYSPDGRT VASGSWDKTI RLWDVSDPTR AKPIGSPLIG HTGQVASVAF HPMGKILASG STDSTIRLWQ IE
|
| |