Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8624 |
Symbol | |
ID | 8340006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 9996041 |
End bp | 10000627 |
Gene Length | 4587 bp |
Protein Length | 1528 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644961706 |
Product | YD repeat protein |
Protein accession | YP_003119281 |
Protein GI | 256397717 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.165231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.776649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC CGTCTGGTTG GGACATTCTT GGTTTGGATG GGGATCCGAC GCCTGGTGTG GTGGAGTCGG TGCAGGCGTT GGCGAAGGAG TTCGGGGACT TCGCCCACGA CGTGGAGGCG GCGTATCGGA GCCTGAACAG CTTCGGTTCG GACACTGCGG CGCTGGAATG GGTGGGTCAG ACCGCTGAGG CGTTCAAGGG CAAATATGGT CCCCTGCCCG GTCGCCTTCA GAAGCTGTAC ACGTCCTACA GCGAGGCCTC CGATGCGCTT TCGGCGTATT GGCCGTTGTT GCAGGCAGCG CAGACGAAAG CCGACACGGC GCTGCGTCAG GCTCAGGATG CGCATGCCGA TCTGCAGCGT GCCACCACCA GCGCCACCAA CGCGGCGACG GATCTGAAGA CCGCGCAGCA GAACCACGCC GCGACACCGA ACCCGCAAGC GGTTACCGAC GCGCAGACCG CGCATGACAC CGCACAAACG AACCTGAACA ACGCCAAGGC GCAGATGGCA GCACTGACCG AGCAGGCCAA CGACGCCTAC AACGACCGCA TCAACGCCGC CAAAACCTGT GGCAGCGCCC TGCATCACGC CCAATCCGAC GGCATCCACA ACAAGTCCTG GTGGCAGCAC GTCGGCGAGG ACCTGTCCAC ATGGGGCGGC GAGATCGGCA AAATCGCCGG CGAGCTCGCC CCCGTCCTGG ACATCATCGC CCTCGCGACC TCCTGGATCC CCGGCGTCGA CGTCGTCACC GCAGCCCTAG CCGAAGCCGA CAACATCGTC GCCCTAGCCG GCACCGCCAT GGCCACCATC GGCGACGCCA TGCAAGGCCA CTGGGGCGAC GCCCTCCTCG GCGCCGGCAT GCTCGCCCTC ACCTTCGTCG GCGCCCGGGC CTTCGGCTCG GAAGCGGAAG CGGTCGAAGG CGAGGCCGGA GGTCTTGAGG GGGAGGCTGG TGCGCTTGAA GGCGAAGAAG GCGCCGCAGC CCGTACCGCT GAGGGCAACG AACCCGCGAA CATCGCAGAC GACCCTGTCG ACGTGGTATC GGGCTGGATG CTCACGGATG CCTGCGACCT GGAACTGCCG GGTGTACTAC CCGTTGTGTT GCGGCGCGCG TACGCCTCCG GGTACACCAC CGGCCGCCTG TTCGGCCCCG GCTGGTCGTC GACTTTGGAT CAGCGTCTGT CGATCAACGA GGCGGGAATT CACTTCGCCG GCGACGATGC CCAACGCCTT GACTACTCGA TCCCCAACGG CGATGAGGAG CTGCTGCCCG AGCGCGGGAG CCGCTGGCCC TTGATATGGG ACCGTGAGCT GGACGAGATC CGCATCACCG ACCCGTGGTC CGGGCAGACC CGCCACTTCT CGACGGTACA TTTCAATAGC GAACTCGGGC AGATCCGCGA CCTGACCGCG ATATCCGACC GCAACGGCAA CCGAATCACC GTCCTGCGTG ACGAGCACGG CACCCCGACC GGGCTCGAAC ACCCAGCCTA CCGCGTCGCT GTCGACACGA TCGCGAGCGC GGCCGGCCCA CGAGTCAGCG GCCTGCGATT GCTGGGCGCT CGTGAGACCG GCTCGGACAT GGTCATGAAG CGCTTCTCCT ATGACGGTCA AGGCCGCTTA ACTGGTGTGA TCAATTCGTC CGACGTGCCG TTCAGTTACG AGTGGAACGA CACGGACCGC ATCTCGGCTT GGACCGACCA GGCCGGTTAC CGCTATGGCT ACCACTACGA CGCGATAGGC CGGGTCGTCC GCGGCCAGGG CCAGTTCCTC GCTGGATCGT TCACCTACGA CCCAGCGCAC CGCACGACTG TCCATACCAA TTCATTGGGT CACGTGACCA CGTTCGTCTA CGACGAAAAC GGCCACGTAT GCTCCGAGAC TGATGCGCTG GGTCACACCA CACTTACCGA GACCGACCGC TACGGCATGA TCCTGTCCCG GACCGACGCC CTCGGTACCC GCACCACGCT GGTCAGGGAC GAAGCGGGCA ACGTCCGAAG GCTCGTCGCG ACTGATGGCG CGACTGCCGA GCTCGAGTAC AACCACCTCC ACCAGGTCGT CGCAGCCACC GGGCGGGGCG GCGCGACTTG GCGACGTGCC CATGACGACC GGGGCAACCT GGTGTCCGCG ACCGACCCGG CCGGGGGTGT CACCGAGTTC GAATACACCG CCGAGGGCGT GCTAGCCGCG ACCACGGACC CGTTGGGCGC CGTAACCCGG TTCACAACGG ACGCCGCTGG CTTGCCCGTA ACGGTTCTGG ACGCCGTCGG CAGTCTGACG CGCGTCGAGC GAGACTTCGC CGGCCGGGTG GTGCGATTGA CCGACCCGAT GGGTGCTGTG ACCGCCTTCG AGTGGAACGC CGAGGGCCTG CCAGTATCCA GGACGCTGCC GGACGGCGCC CGAACCACCT GGCGGCACGA CGATGGCGGT CGTCTGCTGG AAGTCGTTGA CGCCATCGGC GCGACTACAA GGTTCGAACC GGGCCCCATG GAGACGTTGA CCGCTCGCAC CGGTCCGGAC GGCGTGCGGC ACAACTTCAC CTACGACAGC GAACTGAAGC TCGTCGCGGT CACCAATCCT CATGGTGCGG CCTGGACGTA CACCTACGAC GCAGCCGGGC GCCTGACCGG CGAGAGCGAC TTCATCGGCC GCCGTCTCGG TTACGAATAT GACGTCGCGG GACGCCTGTG CGCGCGGATC ACTGGAACCG GTCAACGTCT GACCCTCGAT CGCGACGCCA CAGGCCGAAT CGTGGCGCGC CACACGCCGG AAGGCGACTA CGGCTACACC TACGACAGCG CCGGACGCCT CGTGGTTGCC ACCGGCCCGA ACTCCGAGCT GTCCTACGAA TACGACACAC TGGGCCGACT CCTCGCCACC ACCACTGACG ACCGCACAAT GCGGTACAGC TACGACCTGG CTGGCCGACT CATCCGCCGC ACGACTGCCT CAGGCGTCGA GTCGACATGG ACATACGACT CGGCTGGCCG TGCGGTCGGG CTCGATACCG GCGGCGACCG GCTGGAGTTC GGATTCGATG CGTCGGGCCG GGAGATCGAG CGGCTCATCG GCTCCGAGGT CTGGCTCACT CGGAGCTATG ACGTGGTCGG TCGGCCGATC GCCCAGAGCG TCGGCCGCGG CAGACGCCAC CACGACCAGG CGGCCCAGAC GGACCTCATA ATCGCAGGCA GCAGAACTGG GAAAGCTGTA CAGGAAGTTC TCCAGCGCAG CTGGGTCTGG CGGCAGGACG GCGTACCCGA AGAAATCAGG GACAGCCTGC GCGGCACCAG TCGCATCGCT TCCGATGCCG CCGGGCGCGT CACCGCAATC AGCGCCCACA GTTGGAGCGA GTCCTACGCG TATGACGCGT TCGGCAACCT CACCGTCCAG GACGATGCCG CGGGCCCACT AAACCTCCCT GCTACCGCTG CGGGAGAGGG AGCCGGTTTC GCAACCAGAA CCCTTATTCG CCGCTCGGCC CGAGCCCGGT ACGAGCACGA CCAGGCAGGC CGGCTGACCA GGAGCGTACG TCGGACGCTT GATGGGCGCA GCAAGGTAAC CCAGTACGTC TGGGACTCCG AAGACCGGCT GGTTCACGTG ATCACCCCGG AAAACGGCAC CTGGCACTAC TCCTACGATC CGCTCGGGCG ACGCACCGCG AAGACCAGGT TCGCCGACGA TGGAACGGTG GTCGATCGGG TCACCTTCTT GTGGGACGGC CCGCGCCTGG CTGAGCAGAG CGTACAAGGC CCCGAAAGTT TCACCGTGAC CCTCACCTGG GACTACGACC CGGGTACGTT CCGCCCTGCC ACACAACGCC GCCGCAACCG CCTGGGCGAC GCCGACCAGA CCGTCATCGA CGAGGCCTTC CACGCCATCG TCGCCGACCT GGTCGGCTCG CCCACCGAGC TCGTCACCCC CGACGGGCGT GTCGTCTGGC ACACCACCAC GTCGTTGTGG GGGCGCACTA TAGGCACTTC CGCCGAATCC GGTGTGGACT GCCCGCTGCG CTTTCCCGGT CAATATCACG ACGACGAATC CGGCCTGCAC TACAACCTGA ACCGCTACTA CGACTCGGAA ACCGCGGCAT ACCTGACCCC GGACCCGCTG GGGCTGGTAC CCGCTCCCAA CGACCACGCC TATGTGCCGA ATCCTCTGAC CGTCTCCGAC CCGCTTGGGC TCAGCTACGA AGGTCCCAAC GGGCAGACGA TGTATCCGAA CAACATGCCC GGCACCTTGG ACACCGAACT TGCCCAGGCT GACCGTCTCG GAGTCGTGGT ATCTTCGCCC GGTACGTCCG AATTCGACTC CGCCATCGCT TCAGGAACAG TGAAATGGGC CGTGAAGGAC GACGGAAGCA TCGTCGTGAT GCCGAAGTTC GTAAACGGGC AGGAGATATC CCACTCGGTC CTGACCCGCG GGGCGTCTGT TCAAGCCGCT GGAGAAGCCG AGATAGCCGG TTCAGGAGCC GATGGCTACT TCGGCCTCGA TATCAACGAC CACAGCGGAC ACTTCTTCGA ACCTGGATGG AATGTGGCCA GTATCGGTAA AGACGCGTTC TCGGGAGCGG GGGTCCTGTT CCCATGA
|
Protein sequence | MARPSGWDIL GLDGDPTPGV VESVQALAKE FGDFAHDVEA AYRSLNSFGS DTAALEWVGQ TAEAFKGKYG PLPGRLQKLY TSYSEASDAL SAYWPLLQAA QTKADTALRQ AQDAHADLQR ATTSATNAAT DLKTAQQNHA ATPNPQAVTD AQTAHDTAQT NLNNAKAQMA ALTEQANDAY NDRINAAKTC GSALHHAQSD GIHNKSWWQH VGEDLSTWGG EIGKIAGELA PVLDIIALAT SWIPGVDVVT AALAEADNIV ALAGTAMATI GDAMQGHWGD ALLGAGMLAL TFVGARAFGS EAEAVEGEAG GLEGEAGALE GEEGAAARTA EGNEPANIAD DPVDVVSGWM LTDACDLELP GVLPVVLRRA YASGYTTGRL FGPGWSSTLD QRLSINEAGI HFAGDDAQRL DYSIPNGDEE LLPERGSRWP LIWDRELDEI RITDPWSGQT RHFSTVHFNS ELGQIRDLTA ISDRNGNRIT VLRDEHGTPT GLEHPAYRVA VDTIASAAGP RVSGLRLLGA RETGSDMVMK RFSYDGQGRL TGVINSSDVP FSYEWNDTDR ISAWTDQAGY RYGYHYDAIG RVVRGQGQFL AGSFTYDPAH RTTVHTNSLG HVTTFVYDEN GHVCSETDAL GHTTLTETDR YGMILSRTDA LGTRTTLVRD EAGNVRRLVA TDGATAELEY NHLHQVVAAT GRGGATWRRA HDDRGNLVSA TDPAGGVTEF EYTAEGVLAA TTDPLGAVTR FTTDAAGLPV TVLDAVGSLT RVERDFAGRV VRLTDPMGAV TAFEWNAEGL PVSRTLPDGA RTTWRHDDGG RLLEVVDAIG ATTRFEPGPM ETLTARTGPD GVRHNFTYDS ELKLVAVTNP HGAAWTYTYD AAGRLTGESD FIGRRLGYEY DVAGRLCARI TGTGQRLTLD RDATGRIVAR HTPEGDYGYT YDSAGRLVVA TGPNSELSYE YDTLGRLLAT TTDDRTMRYS YDLAGRLIRR TTASGVESTW TYDSAGRAVG LDTGGDRLEF GFDASGREIE RLIGSEVWLT RSYDVVGRPI AQSVGRGRRH HDQAAQTDLI IAGSRTGKAV QEVLQRSWVW RQDGVPEEIR DSLRGTSRIA SDAAGRVTAI SAHSWSESYA YDAFGNLTVQ DDAAGPLNLP ATAAGEGAGF ATRTLIRRSA RARYEHDQAG RLTRSVRRTL DGRSKVTQYV WDSEDRLVHV ITPENGTWHY SYDPLGRRTA KTRFADDGTV VDRVTFLWDG PRLAEQSVQG PESFTVTLTW DYDPGTFRPA TQRRRNRLGD ADQTVIDEAF HAIVADLVGS PTELVTPDGR VVWHTTTSLW GRTIGTSAES GVDCPLRFPG QYHDDESGLH YNLNRYYDSE TAAYLTPDPL GLVPAPNDHA YVPNPLTVSD PLGLSYEGPN GQTMYPNNMP GTLDTELAQA DRLGVVVSSP GTSEFDSAIA SGTVKWAVKD DGSIVVMPKF VNGQEISHSV LTRGASVQAA GEAEIAGSGA DGYFGLDIND HSGHFFEPGW NVASIGKDAF SGAGVLFP
|
| |