Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3868 |
Symbol | |
ID | 8335221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4377171 |
End bp | 4382381 |
Gene Length | 5211 bp |
Protein Length | 1736 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956999 |
Product | Tetratricopeptide TPR_4 |
Protein accession | YP_003114602 |
Protein GI | 256393038 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0314468 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATGG GCGCGGGTCT GGGTGACAGC CATTCGGCGG TGCTGCAGAT CGACGGCGGG CTGGTGCGTT TCAAGGCCGG CGCGGGGTCC GTCGCGCAAG AGCACAGTGG CCTGACGTAC CGCGCGCGTG AAGCGCTTCG GGAGCTGCGG GAGCTCCGCC AAGGCACCGA TGCGGTGTGG TGGGAGCTCG GGCTCGGGCT CGGGGAATCC TTTCTGGACG GGCCGGTCGG GGTTGCGCTG GCGGCCGAGC TCGGGCGCGC TGCGCAGACC GAAGTACCGT TGCGGCTCGC GCTTGACATC GCCGAGCCAG CATTGGTGAA CGTGCCGTGG GAAGCAATGG TGCTCCCGGA GAGGTGTGAT CGGTCGCGGC GACAGATCCC GCTGGCGCAT GACAGTGCGG TGCAGGTTTT CCGGCTCGTG GAACAGGAGT CTGCGACGGG GACGGCGGTC AGAGCGACAC CCGGAGTGGT AGGCAGGACC GCAGGCGGCG TTGCCGGCGG GCCGGCAAGG CTGCCGCGGG ATGCCGGATC AAACGGAGCG CCGCTGGAGG TGTTGGTGGC AATCGCGTTC CCGGAGCGCA GCGGGCTGGA CCGGCTGGAC TACGAACGGG AGCTTGTGCT GGTCCAGCGG GCGATGCGGC CGGCGATTGA GACTGGCATG GTGTCGCTGC GGCTACTCAC ATGGGGGACT GCGGACTCGA TCGGCGCCGC GCTGCGTGAG CGGCCTGCAG ACATCCTGCA CATCTCGTGC CACGCCGCGC CGGGCGTCCT CGCTCTGGAG ACGGTCGACG GTGAAGAGGA CTTCGTGGAC GCCGGGACGC TGGTCTCGCG AGTGTTCCCG CAGGACCGAC CAGTCCCGAT GGTTGTGCTG GCGGGCTGTT CCAGCGCGCT GGGCACGGTG GACGACGTAC AGGGGCCACT ACCTGGCCTG GCTCACGCGC TCGTGGCTCG AGGAGTCGAC GCGGTGGTGG CGATGACCGC GGACATCTCG GACGACTGTG CCGTCGAGTT CACTGCGGGA CTATATGCGA CGCTGGCGCA GGATCCAACG TTGGCGCCGC TAGACGTCGT GACCCAGACC CGGGCTGCAT TGGCACCTTT GCCACCGCGC GAGCCGGGGA TGACGGAGCC GCAACAAGAA CCGGCCCGAG AACGGGGGCG AGAGCAGGCT GACAGAAACG GCCGAGAGGC GAGGACGACG CTGATTCCAG CCTTGTTCCT CAACTCGGCG GGGGTCGGGC TGAGGAATGT GCAGGCTGGG AGTGCTTCCG TGCCCGGGGC GGCGTCAGAG CCGGGGCTAG GATCTGGGGT TGGGTTCGGC GCAGGACTGC GGGCCAGCTC GCGCTCGGAA GTCTTGGGTG GGTGGAATGA GGGCTCTGGC TTTGTCGGGC GGCGCACGGA GCTGCGCGGT CTTCTGCGTA TCTTGCGGTC GGATCTGCCG CGTGTGATCT TGTTCGGGAT GACCGGGATC GGCAAGAGCG CGCTGGCCGC TGAGCTCATT ACGCTCCTCG ACCCGGGGGC GTGGGTGGTC GTGCCCCTGG ATGCAGGGGC GGCTGCGGAC AGTGTGGTCA GCACTCTGCT TGGGGTGTTA AACAGGGTTG GCGCGTTGGG CAATCCGATG CTCGAGTCCT GGCTTTCCGA TCCGGATACG CCATGGATCA GGCGTCTGGA TCTCGTAAGT GAGAACGTGC TGCCGCACAC CCGCGTGCTG CTGATGGTTG ATGGGGTCTC CTGTTCAGGC GATCCGGCGA CGGGAGCGTA CGCAGCAGAG CCTGCGGAGC TGGGTTGGTT TCTGGACGCT TGGTCGCGAC TCGGGCCGAA CGCGGGGCTA CTCGTCACGT GTCGTCATCC GATTTCATTC GAGACGCGTG GTAGGAAGCT GGTGCAGCAT TGTCTACTCG GGCCGTTGTC GGCGTCCGAC ACGCAGACTT TTCTCTATCA GCATGCGCCG CAGCGGAGTG CTCTGTTCGC GAAGCTGCCC GATGCTGGGT TACTCGGCGG CCATCCTGCA GCGCTTGGGC TCGTCGCGAA TGCCGTGCGC AACGCCCAGG ACCCGACCGC GCTGAGCCCA GAGGTGATCC ATCATGCGCT GAACGAGACA GCGCCGGCTC TCGACGAGGT CTTCGAGCTG TTCGCGGGCC TCGATGCCGG GCATCCAGTG CGCCAGCTGC TTGTTGGCGC GTCGGTATAC CGGGGGCCGG TACGGCGTGA GGCCTTGGAG CAGCAGCTTG CCTTGACGGA TCAAGCTGCT ACGAATCCGG AGCGCACGAC GCGGCTGACC GTAGCCTTGG AGAATGTTCT GCGCGAGGCC CAGACCGAGG ACGTATCCGA GCTGTGGTGG GAGGATGGAG AGCCGTTACA ATTCTCTGAA ACGCTGCTCG CGGACCTCGA GGATGCACAG CGACCAGTCG TAGATCCGTC ATTCCCTGAG GTGCTACGTA GCGCGCTGGG CACCGGACTC ATCTTCGGGT CTTATGACAA CCGGTCCGGA GTGGAGAATC AGAACGGAGC GGACGACGGA AACGAGGCGA ACCGCGAGCC GTTCAAGGTT CATCCATGGG TGGCCGCGCA GATTAGTGCG CTGGCCGACC AGGCTGAGGT GGACGCCGCT CGCCGCCGGG CTGCGTCCTA CTGGCGCCGC CGACTGAACA TGGACCTGGC GCGCGGGCAC CTGCCGTCGC TAAGCGAAGA CGCTGAGCGT CTGTTGGAGC AGTTCGACGC TCTAGGCGAC ATGGCCGCAG CCAAGGAGCC GCTGCTGAGA TCGGTGAGCC TGGCGTACCT GCGCGGGACC GGGGGCTACG AGCAGTTGCG AGAGCGTTGC GAGCTGGGCC TCGCGCACCT GCCGCTCGAA CCGAAAGAGA CCGTGGTGTT GCTGCTCCTG CACAGTGTCG CCTGCTACGC GATGGGTGAC GAGGCCACAG GGGCGTGCAG CAGCGAACGA GCTGTCGAGG CAGCACGGAG CCTGCCCATG AACTCGCCCA CTCGAGTGCA GACCGTCATC ATGCATACAC GCAACCTACT GAGACTGCCG CAACGGGCCG GCGAAGTCGC GACACTCATC GATGAGGCTG CGGAGGGCGC ACGGGCCTGC GGATACGACC TCCTGGATGG AGCCGTTGAC CATCTGCGGG CGATGCTGGC AGTCAGGCGC CAACAATTCG ACGAATGCGA CCAGTTCGCT CGCGCGGCCT TGCACACGGT CGCCGGGGTC GCACCTATCT GGTACACCCG GGCGGTGCAG TGGGAGGACG TCGTGGACCT CTCCGAGGCG TTGAGGTTGG GCGAGTTGGC CGGGGCGGCA AGTGATCAGG CCATCCTGGC GGAAGGCCTC GGGATCGCCG CCGACACAGC CGAACTGGGC TGCCTCGGCC TGTGCGCCGC CATAGCGCTG GAGCGCGGCC GGCTCGATGA AGCGGCCGCG TTCATGGACA GGGCACGGCA AGTGGCCGAC CGCACTCCTC TACCCGGCCA GCTTCCGCCG ATTTACGCCC TAGACTCGAC CATCGCCCTG CTCCAAGGCG ATATCTCTCG GGCCGGAGAG CTGTGCCGAG CCGGTGTGAC CTCGGCTCGG GCGTTCGATG ACCGAGTGAG CGAGACCCGG TGCCTTAACC TGCTGGGCCA GATCGCCATG CACGGCGGCC AGGCGAACAC CGCGCTGCGC GAATTCGGCG CGGCGTACGC CTGCGGAATC GGCACCGTGG CGGACGACCA GGCCAGGGTC TCAGCAGTGT TCTGCGCCTT CGTCTACACG ACGATGGAGG ATCCGCGTGC GCGGGACTGG CTGGACAAGG TCGGCGACAC GCTGCCCACT GATCCGGGGG CAAACTCCCT CGCACAGTAC ATGATCGGCC AAACCGCCCT GGTCGAGGGC GACTTTGCAA CAGCCACGCG GCACGCGCAG GACATGCTCC GCGTCATCGA AGAGGGCGAC TATCCTCAGA TCTCTTGCAT GGGGAAGCTG CTACTAGCGC AATGCGCCGC CGGGCGCGAC GACTGGGACG CCGCTGGAGA ACTGTTCAGC GAGGCGATCG ACGCCTCCGA GGTATGCGAC CAGCCTTTGC TAAGGTTCGA CGTGCTGCTC AGCGGAATCG AGATGGTCCT CACGATCGAC GAGCCGGTGA GTGTTGGCGA CCTCGACGCG CTCGATGAGA TGGCGCAGGA GGCAGAGGCG ATTGCGCAGG GGGCGGATGC GCCAGGACTG ATCGCCCGGG TCTTGCAGGC CCGGCAACTG ATCAGGGGCC GCAGACGCGG CCCGCTAGGG TCCGGCTTTG CCGGTCCGCT GCTCAGGGCG AGCCGTGATG TCGGGGACGC GGAGGCCGTC GAGAACGCAA TCGTGGCGCA GTTCCACGAG GCGCTCAACG CTGAGCGGTT TGAGGAGGCC TCGGAGCACC TGGCGTGGCT CAAGTCGGTC GTCGAGGCCG GCAACGAGGG CAAAAGGCTC TTGCTCGGGT TGATGGAGGC CTCGCTTGAC CTCGAACAAG GGCGTCTCAC CGAAGCCGAG GAGCGTATGC TGCAGGTCTA CGCCGCAGCC GACGCGGGCG GACCGGACAT AGCCGAACTC GCCGCGATGG CTGCGGGCCT GCTGGCCGCG ATGGCCGCAG AGACCTCGGA TCTCCACAAG GCAGAATGCT GGCAGGCGAC TGCGTTAAGG CATCTAATCA CGGGCACCGC AGAGTGGTAC CAGGAACTCG ACGGGTTGAA AGATATGGCC GAGGAACGCG CGGCAGCCGA CCCCGCCGCC TGGTGGCAGA ACCTGGAAGC GATCGCCGAC GACGCTGTGG CCCGTCACCC CGCGCTTCCC GACCTCGCCG CGGCGGTCCA CCAGATCTTG GCCAAGATCG CGGATGACCA CGGGCCCGCA GCACAGCGGG AACGCTGGCT GCGGCAGGCG TTAAGTCTGT GGCCGACGGA CGTGCCGGAG CACTCCGATT ACGGGCCCCG GGCGACCAGG CACGCTTTGG CCCGGCTGCT CGCGCGCGAC GAGGAGACCC TCGACGAGGC GACCTCGCTA TTGATCTACA ACCTGTGCCG TTCCGAGGAG GACGGAAGCG GCTACCACGA CCCATTCGAC GAACTTCTTC TCGGCGTGCT GCGGCAGTAC TGGGGCGAAC CGGTATTTGC TGCGAAGCTT GCCACGCAGG CAGACGCCGA GCAGCAGGCT CGGATTCTCG CCGCCACGAG TGACTGGGGA AGTGCTTCCG ACCAGATGTA A
|
Protein sequence | MDMGAGLGDS HSAVLQIDGG LVRFKAGAGS VAQEHSGLTY RAREALRELR ELRQGTDAVW WELGLGLGES FLDGPVGVAL AAELGRAAQT EVPLRLALDI AEPALVNVPW EAMVLPERCD RSRRQIPLAH DSAVQVFRLV EQESATGTAV RATPGVVGRT AGGVAGGPAR LPRDAGSNGA PLEVLVAIAF PERSGLDRLD YERELVLVQR AMRPAIETGM VSLRLLTWGT ADSIGAALRE RPADILHISC HAAPGVLALE TVDGEEDFVD AGTLVSRVFP QDRPVPMVVL AGCSSALGTV DDVQGPLPGL AHALVARGVD AVVAMTADIS DDCAVEFTAG LYATLAQDPT LAPLDVVTQT RAALAPLPPR EPGMTEPQQE PARERGREQA DRNGREARTT LIPALFLNSA GVGLRNVQAG SASVPGAASE PGLGSGVGFG AGLRASSRSE VLGGWNEGSG FVGRRTELRG LLRILRSDLP RVILFGMTGI GKSALAAELI TLLDPGAWVV VPLDAGAAAD SVVSTLLGVL NRVGALGNPM LESWLSDPDT PWIRRLDLVS ENVLPHTRVL LMVDGVSCSG DPATGAYAAE PAELGWFLDA WSRLGPNAGL LVTCRHPISF ETRGRKLVQH CLLGPLSASD TQTFLYQHAP QRSALFAKLP DAGLLGGHPA ALGLVANAVR NAQDPTALSP EVIHHALNET APALDEVFEL FAGLDAGHPV RQLLVGASVY RGPVRREALE QQLALTDQAA TNPERTTRLT VALENVLREA QTEDVSELWW EDGEPLQFSE TLLADLEDAQ RPVVDPSFPE VLRSALGTGL IFGSYDNRSG VENQNGADDG NEANREPFKV HPWVAAQISA LADQAEVDAA RRRAASYWRR RLNMDLARGH LPSLSEDAER LLEQFDALGD MAAAKEPLLR SVSLAYLRGT GGYEQLRERC ELGLAHLPLE PKETVVLLLL HSVACYAMGD EATGACSSER AVEAARSLPM NSPTRVQTVI MHTRNLLRLP QRAGEVATLI DEAAEGARAC GYDLLDGAVD HLRAMLAVRR QQFDECDQFA RAALHTVAGV APIWYTRAVQ WEDVVDLSEA LRLGELAGAA SDQAILAEGL GIAADTAELG CLGLCAAIAL ERGRLDEAAA FMDRARQVAD RTPLPGQLPP IYALDSTIAL LQGDISRAGE LCRAGVTSAR AFDDRVSETR CLNLLGQIAM HGGQANTALR EFGAAYACGI GTVADDQARV SAVFCAFVYT TMEDPRARDW LDKVGDTLPT DPGANSLAQY MIGQTALVEG DFATATRHAQ DMLRVIEEGD YPQISCMGKL LLAQCAAGRD DWDAAGELFS EAIDASEVCD QPLLRFDVLL SGIEMVLTID EPVSVGDLDA LDEMAQEAEA IAQGADAPGL IARVLQARQL IRGRRRGPLG SGFAGPLLRA SRDVGDAEAV ENAIVAQFHE ALNAERFEEA SEHLAWLKSV VEAGNEGKRL LLGLMEASLD LEQGRLTEAE ERMLQVYAAA DAGGPDIAEL AAMAAGLLAA MAAETSDLHK AECWQATALR HLITGTAEWY QELDGLKDMA EERAAADPAA WWQNLEAIAD DAVARHPALP DLAAAVHQIL AKIADDHGPA AQRERWLRQA LSLWPTDVPE HSDYGPRATR HALARLLARD EETLDEATSL LIYNLCRSEE DGSGYHDPFD ELLLGVLRQY WGEPVFAAKL ATQADAEQQA RILAATSDWG SASDQM
|
| |