Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0892 |
Symbol | |
ID | 8332223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1039198 |
End bp | 1041201 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644954043 |
Product | Tetratricopeptide TPR_4 |
Protein accession | YP_003111666 |
Protein GI | 256390102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.224705 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.102157 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGTCC TGCGCCTGGC GTCCAGGGTC CTCCCGGTGG CCGCGCTCGG TCCGGAGAAC CCGCTCCCGC CGCTGCGCAA GCTCGCGGAT CCGCGCGGTG GCCTGGACGT CTCCGAGGCC GACGCCGAAA TGGTCGCCAA CCTCGGCTAC GGCCACGTCG AATCCCTGCT CCCCTACACC TTCCAGGACG GCTACACCCG CGAGACCCCC GACCGCGAGC TGGTCACGGC GGTGTTGGAG AACGACGTGC TGCGCGCCGA GTTCCTCCTC GGCTACGGCG GACGCCTGAT GTCCCTGCGC CACAAGCCCG CCGACCGCGA GCTGCTGCAC TTCCCGCCGA AGCTCCAGCT CGCCAACCTG GGCCTGCGCA ACGCCTGGTT CGCCGGCGGC GTCGAATGGA ACCTGGGCAC CTTCGGCCAC ACCGCCCTCA GCTGCTCCCC CCTGCACGCC GTCCGCCTCA CCCGCGAAGA CGGCACCCCG GTCCTGCGCA TGTACGAGTA CGAGCGCCAG CGCCGCCTGG TCTACCAGCT CGACTGCTAC CTGCCCGACG GCTCCCCGGT CCTGCTGATC CACGTCCGCG TCCACAACCC CTTCCCCGAG GACGCGCCCG TCTACTGGTG GAGCAGCGCC GCCGTACCGC AGACCCCTGA CACCCGCGCG CTGATGCCCG CGACCTCGGC GTACCACTTC TCCTACACCG GCAAGATGCG CCGCATCCCC TTCCCGCGCT GGCGCGAGGC CGACACCTCC TACCCCGGCC GCATCGACTA CGGCGCCGAC TACTTCGCCG AGATCCCCGA CGGCGCACGC CGCTGGATCG CCGCGGTCGA CGCGGGCGGA TCAGGGCTCT TCCAAACCTC GACCGACCGG ATGCGCGGCC GCAAACTCTT CCACTGGGGC ACCGGATCCG GCGGCCGCAA CTGGCAGGAC TGGCTGTCCG GCCCGCGCCA CGAGTACTTC GAGATTCAGG CCGGTCTGGC CCGAACCCAG CTGGAACACC TGCGGCTTCC GGGGCGCCGG TCGTGGTCCT GGGTCGAGGC TTACGGCCTG CTCCAGACCG ACCCCGCCGA CGTCCACGGC ACGAAATGGC CCGACGCCAC CCAGGCCGCG GACGCCGCGA TCGCCGAGCT GATCCCCGCG CACCGCCTGG ACGAAGAACT CGCGACCATG ACCGCGCTCG CCGACAAACC ACCGGAAGAA GTCCTGCACG CCGGAACCGG CTGGGGCGCC CTGGAACGCC GCGCCCTCGG CCGCAACCGC GCGCTGAGCC TGCCCGGCAC GCCCTTCCCC GACGCCACGA TCGGCCGCGA CCAACAGTCC TGGCTCTCGC TGGTCCGCAA AGGCCAGATG CCGACGCCGG CGCCGGCCCT GCCGCCGTCC TCCTACGCCG TCGGCCCGGT TTGGCGCACG CTGCTGGAGA AGGCGACCGC TGACCCCGCA CGCACAACCC ACGCGACCTG GTTCACCTGG CTCCATCTGG GCGTGAACCG CTACCACGCC GGCGACCTGG CCGGAGCGCG CGAGGCGTGG GATCAGTCGA TGGCGCAGGC CGAAACCGCG TGGGCGCACC GGAATCTCGC CATCCTCGAC GCCTCCGAAG ACCGCCTCGG CGACGCCGCC GACCGCTACC TCCGTGCCTG GCAGCTCTCC CCGCGCCTGC GTCCCCTGAC CATCGAGACC CTGCGCGCCC TGATCACCGC CAAACGCCCC GGCGAAGCGC TGGACATCAT CGACCGGCTC AAGGAGCCCG ACCGCTTCGC CGGACGCATC CTGATGCTGG AGTGCCGCGC CGCCCTGGAC GCCGGGGATC TGGTGCGGGC ACGCCGGATC CTGGAGACCG GTCTGATCGT CGAGGACGTC CGCGAGGGCG AGGACCCGCT GTCGGACATG TGGTGGGAGT TCCACTCCCG GCAGGCCGGC GGGCTCGGTC CGGTCGTGTC ACGGCGATTG CGCGAAGAGC ACCCCCTGCC GTGCAGCTAC GACTACAGCG TCCGCCAGCT GTAG
|
Protein sequence | MSVLRLASRV LPVAALGPEN PLPPLRKLAD PRGGLDVSEA DAEMVANLGY GHVESLLPYT FQDGYTRETP DRELVTAVLE NDVLRAEFLL GYGGRLMSLR HKPADRELLH FPPKLQLANL GLRNAWFAGG VEWNLGTFGH TALSCSPLHA VRLTREDGTP VLRMYEYERQ RRLVYQLDCY LPDGSPVLLI HVRVHNPFPE DAPVYWWSSA AVPQTPDTRA LMPATSAYHF SYTGKMRRIP FPRWREADTS YPGRIDYGAD YFAEIPDGAR RWIAAVDAGG SGLFQTSTDR MRGRKLFHWG TGSGGRNWQD WLSGPRHEYF EIQAGLARTQ LEHLRLPGRR SWSWVEAYGL LQTDPADVHG TKWPDATQAA DAAIAELIPA HRLDEELATM TALADKPPEE VLHAGTGWGA LERRALGRNR ALSLPGTPFP DATIGRDQQS WLSLVRKGQM PTPAPALPPS SYAVGPVWRT LLEKATADPA RTTHATWFTW LHLGVNRYHA GDLAGAREAW DQSMAQAETA WAHRNLAILD ASEDRLGDAA DRYLRAWQLS PRLRPLTIET LRALITAKRP GEALDIIDRL KEPDRFAGRI LMLECRAALD AGDLVRARRI LETGLIVEDV REGEDPLSDM WWEFHSRQAG GLGPVVSRRL REEHPLPCSY DYSVRQL
|
| |