Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0598 |
Symbol | |
ID | 8331926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 690813 |
End bp | 694190 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644953750 |
Product | WGR domain-containing protein |
Protein accession | YP_003111376 |
Protein GI | 256389812 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCCG GCAGGGCGCT CACGTCCACG GATCAGGACT CTGAGGATGT CTTCGAGCTG CCTGATTCCT ATCAGAGTGC GACGCCCTGG GTCCGTGAAC GGAAGAGGCC GCCCGAGGCG CTGGTGTTCG ACGAGCGCGC GGCCCTGCTC TTGGAACGCC GCCGTCGGGA TCGCGCGGAC TGTCAGGAAG TCGTCGACGA CGCCCTGCGC ACTGACGCGC ACCGCATCGC CGATCCGGTG GGCGTGATGC AGGGTGTTGA TTGCACCGAC CTGGGCTCAC AGACCCCGCG GCAAGCCGCC CTGGTCGCAT CGATACTGAA CCCCTTGCAC TCGCGGGAAC TGTCTCTCAT CGATCTGTGG TGCGAAACCC ATGGGCCGGC CTTCGCCCTC GCGGCTTTCG TCGAGTACTG CCATCTGACG AGTCTGCGAG AGCGCGACCT CATCGGGCGG CTTCGCTGGG AAGTCCTGCG GCTGCTCGAA TTCGTGCGGT CGGTCCCGGA TGAGGACCGC GACGAGGCCG TGGCTGCCGC GCAGCGCTTG CGGGAGGACG AGCCCACCGC GGTCACCCGC ATGCTGACCA CGCTCCTGTT TCCCGAGCGA CCCGCATGGT TCTGGGCCGA CCTGGCCGCC GCGGCCGACG GGAAGCTCGA CCCCTGCGCG TTGTTGCCGT CGGCCACCAC GGTCGACCAG GCGTCCGCGC TCGCCGAGCA GCTGCTGGCG CAGCCGGGGT GGTCCTGGCA CGACGATCAC GCGATCGAGC GGACCTTCCT CGCCGTCGCC GGATCCACGG CGCTGCCGTT CTTGGTGGCC TGGAGCGAAA CGGGCCATGA GTGCCCGTAT CGCGACTCCG GAAATCTGCG ACAGTGCCGG ATCCACCGGA CCTGTTTGGA CCTGATCTCC CGGATCCCGA CCGAGGCGGC GATGTCCGTG CTCATCCGGT GGCTCGGGCG GGACGGCGTC GTCAGCGAGT ATCTGCAGGC TGCGATCAAG CGCTTCCCGC GCCGGGCGTT GCGGCTGCTG GCCGAGACCG AAGCGACCGA GGAGATCACC CGCCTGCTGA CCATGGTCGT CCTCGCCGAT CCCGGCTTCG CGCGAACCGA GGCTCCGCGC CTGAGTCCGG AAGCCCGCGA CCGGATCGAG GCGCTGCTCG GCGGCGGCAC GCCGGATGCT GCGCCGGTGG ATCTGCCCGA GATCCTGGCT GCTCCGCCTT GGCGCGACAA GAAGCGCAAG CGCGGGAAGC CGGTCGTGGT CGACGGGGTC GAGGGTCTGA CGCCGCCGGC CGAGGTCACC GTCGCCTGGC TGCCCGGCGA GCGTGAAGCA TGGCTCGCCA AGCCCGGCGC GGGGTGGGTC CCGGAGCAGG GGTGGAACTC GATCCTGGCC GATATCGCCG AGGAAGAGGG CAGGGAAAGC CTCGCGACCT ACTTCGCCGC GTATGCTCCC GACGATCTGG TGCGTGCGCA GCTCGCCGCC GGCTGGCGCC CGCCGATGCG GCAGGTCACC GACCGGGTCC GGACGTTCGT CGCGAAGCAC GAGCTGTTCG TGCTCCCGAT GGTGCTGTCC CTGAACCGGT TCCCCGCGAT GCGTGCCCAG CTGCTTCAGC CGTTCGCGAG TGCCGAGATC GCCGCGATCA TGGCCGACGG ACTGGTGCGG CTGCGCACCG TGCACGCCTC TGCCACGGCC TGGCTGCGGC GGCATCCCGA CGACGCGGTG CGCGGTCTCG TGCCGGCTGC GGTCGGCAAG CCCGGCAAGG CACGACAGAA CGCTGTCACG GCCCTCCGGT GGCTCGACGC CGACGGCATC GACGTCGTGG CGCTCGCGCA GCAGGAGTAC GGCGAGGACG TCGCCACCGC CACCAAGGAG GTGCTGGCGG ACCGCGGGTT CGAGACCTAC CCTCGCACCG TTCCGGAAAC GCCGCTGTGG GCCCGCGCGG TGCTGCTGTC CGACATTCGC TTGAAGGACG GCCAAACCGC GCTCCCGCAC AGCGCCGTGC AGGCGGTCGT GGAGATGCTG ATGTTCTCCA AGCCGGACGC GCCCTATGCG GGCTTGGAGG TAGTCGCAGA GATCTGCGAC GCCTCATCGC TCGCGGAGTT CGTCTGGTCC CTGCATGGTT TGTGGGCGCA GTCGGGCAGT CACGAGAGCG AGCGCTGGGT GGTCGGCGCG CTCGGAATCT TCGGCGACCA GACCACGGTC GCGCAGCTGG AACAGCTGAT CCGGGACTGG TATCGGGATC CGAAGCTGGA CCTCGTGGTC GCCGGCTTCG ACGCGCTCGC CGCGATCGGC GGCGACCGGG CGCTCACCGC GCTGAACGGC TTCGCCAGGC GTTCGTGGAC CGGCATGCTC AAACGCAAGG CGCAGGCGAC GTTCGAGGAC GTCGCCGCGT CCATGGACCT GACGCCGGAC CGGCTCGCCG ACCGCTTGGT CCCGACGTCG GGGCTGACGG CCGAGGCGAC GTTCCAGCTG GACTACGGCA GGCGGCGTTT CGACGTGTCC TTCGACGAGC TGCTGCGGCC GGTCGTCCGT GATGAGACCG GTACGGTCCT GCGGACGCTT CCGAAGCCCG GCAAGCGGGA CGACGCGGAG CTGGCAGCCG CGTCCTTCAA GCGCTTCGTC ACGCTCAACA GGCAGGTCCA GGACGGGACG CAGGAACAGA TCGCCCGGAT GGAGCGGGCG ATGCTCGAGC GGCGCCGCTG GACGCCGGAG GACTTCGCGA CGTATCTGGT GCGCCATCTC GTGCTCGGCC GCCTCGTGCG GCGGCTGGTC TGGGGCGTCT ATGCCGAAGA CCGCCGGCTG CTCGGCTCGT TCCGGGTCGC CGAGGACCTG AGTTTCGCAG ACGTTGATGA TGCGCGTTAC GAGATTCCGG ACGGTGCCTC GATCGGCGTC GCGCATCCCG TGGACCTCGG GCTCGCCGTG ACGCGGTGGT CGGAAGTGTT CTCCGACTAC GAGATCCTGC AGCCTTTCGA CCAACTAGGC CGGCCGCCGC TGGCGCTGAC CGCGGAGGAG CTGAGCGGCA CGTGCCTGGT GCGGCCGTAC ATCACCCAGA AGGTCCCCGG CGGCGTGCGT ACCGCGGGCG AGATCCGGTT CGGCGGCGTG AAGTTCGCCG GGTTGATGTC CCGGGGATGG CAGCCGGGTC CGATCGGCGC CGGGCAGGTC TGGTCGCGGC TGCTGCGGCC GGTCGGCGAG GGCCGGTATA TGGTCGTGGA CCTCGATCCG GGACTGCCGG CCGGTGCCGT GGTGATGGCG AATCCGCAGG CCGTCGAGGC GGTCTGGTTG AGTGCGACCG GCGACGAGCC CGACTTCGCG CGCCACGCGC TCCCGCTGTC CGACCTCGAC GCGGTGACCG CGTCGGTGAT CCTGCGCGAT CTGGAAGAAG GGACCTGA
|
Protein sequence | MRAGRALTST DQDSEDVFEL PDSYQSATPW VRERKRPPEA LVFDERAALL LERRRRDRAD CQEVVDDALR TDAHRIADPV GVMQGVDCTD LGSQTPRQAA LVASILNPLH SRELSLIDLW CETHGPAFAL AAFVEYCHLT SLRERDLIGR LRWEVLRLLE FVRSVPDEDR DEAVAAAQRL REDEPTAVTR MLTTLLFPER PAWFWADLAA AADGKLDPCA LLPSATTVDQ ASALAEQLLA QPGWSWHDDH AIERTFLAVA GSTALPFLVA WSETGHECPY RDSGNLRQCR IHRTCLDLIS RIPTEAAMSV LIRWLGRDGV VSEYLQAAIK RFPRRALRLL AETEATEEIT RLLTMVVLAD PGFARTEAPR LSPEARDRIE ALLGGGTPDA APVDLPEILA APPWRDKKRK RGKPVVVDGV EGLTPPAEVT VAWLPGEREA WLAKPGAGWV PEQGWNSILA DIAEEEGRES LATYFAAYAP DDLVRAQLAA GWRPPMRQVT DRVRTFVAKH ELFVLPMVLS LNRFPAMRAQ LLQPFASAEI AAIMADGLVR LRTVHASATA WLRRHPDDAV RGLVPAAVGK PGKARQNAVT ALRWLDADGI DVVALAQQEY GEDVATATKE VLADRGFETY PRTVPETPLW ARAVLLSDIR LKDGQTALPH SAVQAVVEML MFSKPDAPYA GLEVVAEICD ASSLAEFVWS LHGLWAQSGS HESERWVVGA LGIFGDQTTV AQLEQLIRDW YRDPKLDLVV AGFDALAAIG GDRALTALNG FARRSWTGML KRKAQATFED VAASMDLTPD RLADRLVPTS GLTAEATFQL DYGRRRFDVS FDELLRPVVR DETGTVLRTL PKPGKRDDAE LAAASFKRFV TLNRQVQDGT QEQIARMERA MLERRRWTPE DFATYLVRHL VLGRLVRRLV WGVYAEDRRL LGSFRVAEDL SFADVDDARY EIPDGASIGV AHPVDLGLAV TRWSEVFSDY EILQPFDQLG RPPLALTAEE LSGTCLVRPY ITQKVPGGVR TAGEIRFGGV KFAGLMSRGW QPGPIGAGQV WSRLLRPVGE GRYMVVDLDP GLPAGAVVMA NPQAVEAVWL SATGDEPDFA RHALPLSDLD AVTASVILRD LEEGT
|
| |