Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0389 |
Symbol | |
ID | 8331716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 431849 |
End bp | 435232 |
Gene Length | 3384 bp |
Protein Length | 1127 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953555 |
Product | WGR domain-containing protein |
Protein accession | YP_003111182 |
Protein GI | 256389618 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.489109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.333615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG CCACCAACGC ACCCACCGCA GTGTCGGAAG ACACGTGCCA GATCCCCGGG GCGATGCGCC GCCTCGTGAT TCCCCGCCGG GGCGATGTTC CGAGCGAACC GGTCGCCGTC GATCCGAAAG CCGCCGCGTA TTACGCCGCA TGGTTTGACG AGCAGAGCCC GATCGTCGCC GGGATCGTCG ACCACGGACA CAGCGAAGCC GATCTCGTCG AGGAGTACCG AGCTTCCAAG GGCGACCTCG TCATAGCCAC GCCGCTTGCC GCAGCGGTCG CGGCGGCGAT GACGATCTCG CAGAGCACGC CGGCGTACGA GAACCCCGCC GGCATCGTCG ATGCCTGGAT CGCCACGCGC GGTCTCGCAT TCGCCGCGGC CGCCGCTGTC GAGCTGTTCG GCGTGAAGAC CTCGGGCCGC ACCTCCAAGG GAGAGCAAGC TTCAGCGTGG CTCTGCCGCA AGGACGGCGC CTATGAAGAC ATACCTCATG GAATCGCCGT CCGCGTGCGC CAGCACCTGG CATCGGCGGG GGAGCAGGAC TACGCGGAGG TCGCCGATGT CCTTGCGCGC TACCGGCTAT CGCTGGCGTC CCCGACGACC TCCGGTCAAG TGCTGTCGCG CATGCTCACG TCCTTCCTGA CGCCGGAGAA CGTCGACTGG GTCGAGAGCG ACTTCGCGTT CGTCTCGCGG GCGCATTACT CCGCGTGGCT CGCGCTCGCC TCGATCACGA CCGAGGAGCA AGTAGCCGCC TTGCCGGCGC AGAAGATCTC GGCGTGGGGC ATCGCCCGAG AGCTCAGTGC GCTGTGGACC GCGCTCGACA ACGTCGGCGT CCATCTCCTC CCGGTCCTGA CCGGATGGCT CGCTCCCGAG TCGGACCCGG AAAGCGTTCA GCGCCTGCTG TCCATCGTCG CGGGCATCGC GTCCGCAGAC GCCTTCCGCT ACCTCGTCGA CAACATCGAC AAGAAGTACT TCACGGTTGC GACGCAGAAC GCTGCCACCA CCTTTCCCCG GCGCGCTCTC CAGATCCTCG GCGACGCCGC GCTCCGCGAC ACTCCGCGCG GGCTGGCTGC CGCCAACGTG CTGCGCTTGC ACATCGTCGC GCACGACGAA CTCGTGCAGG ACGAGCTGGC GAACCTCACT CCGGCGGTAC GGGATCACGT CGAGAAGATC CGCGCCGCCA ACGTCCCCGT GCCCGACGCC GAGACAGACA GCCTGCCGAC GCTGTTCGTC AGCCCGCCGT GGCTGACCGC CGTGAAGCCG GCCAGGCCGG TCGTCATCAC CGGTCTGGAG GCGCCGACCG AGACGGTCTT GGCGTGGCGC GACGGCGAAA AAGAGCAGTG GGCTTCCGCC GAGCTCATCA CGAATCCCCA GTGGAAACTC GAAAGCTGGA AGGACATCGC CGACCGCATC AGCACCGGTG GCTCGGCCGG CTGGTACGCG AACGGCCACT TCGCCGTCGA GGCTCCCGAG GATCTGGTCC GGTCGGTTCT CACCAGCTGG GAGCCCGACT CCTGGCAGTC CGACACCTGG ATCCCAACGC TGGTAATGCG TTTCGGCGCC GACGCGCTGC CGCCGATCCT GAGCTTGGCA CGCAGCGTAC CGGCGTCGGG TGCCGTTTTC CTCGCCCCGT TCGAGGCTCC CGAGGTCGCC CTCCTCGCGG CCGACTGGCT CAGCCGGCTG AAGACCGCGC GACCGTTCGC GCTGGCTTGG CTGACGCGGC ATCCCGGCTT CGCAGCCAGG ACCCTGATAC CGGCGGCGCT CGGCAAGGCG GGCAAGGCGC GCAGTGCCGC CGAGCTGACC ATCCGCACAC TGGCCGCACG CGGATTCACT GCCGAGATCA CCGAAGCCGC CGCAGGGTAC GGCGACGCCG TGGCGCAGGC GGTCGCCGCC ATCGTCGCCG ACGACGGCAC GCTCACGCTG CCCAAGACGA TGCCCGCCGT CCCGGACTGG GCCGAAACCC GGCTGCTGCC GCAGATCCTG CTCAAGGGAC GGCAGACCGC GTTGCCCGAG CAGTCTGTGA AGCACCTGCT CCTGATGCTC GCGGTGTCCA AAGGTACCGA GCCCTACGCC GGAATCCAGA TGGCGAAGGG CATCTGCGAC CCGGCCTCGC TCGCCACGTT CTCGTGGGCG TTGTTCGAGA ACTGGCGCGG TGTCGACTAC CCGGCGAAGG AGAGCTGGGC GTTCGACGCG CTGCGGTGGT TCGGCGACGA CGAGACGGTG CGGCGGCTGT CCCCGATGAT CCGCCTGTGG CCCGGCGAGA ACGGGCATCA GCGCGCGGTC GCCGGTCTGG ACGTACTGGC CGACATCGGC GGAACCGTGG CGCTGATGCA CCTCTACGGC ATCTCGCAGA AGGTCAAGTT CAAGGGGTTG AAGGAGCAGG CGACGCAGCG CGTCACGGAG ATCGCCGACG ACCTCGGGCT CACCGCCGAG CAGTTGGGGG ACCGTCTCGT CCCGGACCTC GGGTTGGCGT CCTCCGGGAC GCTGTCCCTG GACTACGGTC CACGGTCGTT CACTGTCGGA TTCGACGAAC AGCTGAAGCC GTATGTCGCC GATCAGAGCG GCAAGCGTCT CAAGGCTCTG CCGAAGCCGG GAGCGAAGGA CGATCAGGAG CTCGCACCCG CTGCCCACCA GCAGTTCTCC GCGCTGAAGA AGGACGTGCG GACGCTCGCC GCGAGCCAGA TCGCGCGCTT CGAGCTCGCC ATGGTGACGC AGCGCCGCTG GACGTCCCAG GAGTTCGGCG AGTACTTCGT CGGGCATCCC TTGCTGCGAC ACCTGGTCCG GCGCCTGGTC TGGGTGACCT TCGTCGATGA GAAGGTAGGC AGCGCCTTCC GGGTCGCCGA AGACCTGAGT CTCGCCGACA TCGCAGATGA CGAATTCACC CTCGCCGACG ACGCGGTGAT CGGCGTCGCG CATCCGCTGC ACATCGGCGG GGACGTCGCC GCGTGGTCGG ACGTGTTCGC CGACTACGAG ATCCTTCAGC CGTTCGCGCA ATTGGGACGC ACTGTGTTCG CATTCACCGA CGCGGAGAAG GCTTCGACGC GCCTGACCCG GTTCGGCGGC ATCGAGACTC CCGTCGGCAA AGTCCTGGGA CTCGAACGGC GCGGCTGGCG CCGGGGTGCT CCGCAGGACG CCGGCATCCA GGGCTGGATA TCACGCGCGC TCCTCGGCGG GGGTTCGGTC ACCGCGACCC TCGACCCCGG CATCACCGTC GACTACGTCG CCGAGTGGGG CGAGACGCAG CGTCTCCAGG AGGTCTTCAT CAGCCGCCAC GCCGACGGCG AGAGCTACTG GAACGCCTCG GCGAAGTTCC GCGAGCTCGG CAGCCTAGAC GAGATCACGG CCTCCGAACT CCTCCGCGAC CTGACCGAGG CGACGACCCA GTGA
|
Protein sequence | MSGATNAPTA VSEDTCQIPG AMRRLVIPRR GDVPSEPVAV DPKAAAYYAA WFDEQSPIVA GIVDHGHSEA DLVEEYRASK GDLVIATPLA AAVAAAMTIS QSTPAYENPA GIVDAWIATR GLAFAAAAAV ELFGVKTSGR TSKGEQASAW LCRKDGAYED IPHGIAVRVR QHLASAGEQD YAEVADVLAR YRLSLASPTT SGQVLSRMLT SFLTPENVDW VESDFAFVSR AHYSAWLALA SITTEEQVAA LPAQKISAWG IARELSALWT ALDNVGVHLL PVLTGWLAPE SDPESVQRLL SIVAGIASAD AFRYLVDNID KKYFTVATQN AATTFPRRAL QILGDAALRD TPRGLAAANV LRLHIVAHDE LVQDELANLT PAVRDHVEKI RAANVPVPDA ETDSLPTLFV SPPWLTAVKP ARPVVITGLE APTETVLAWR DGEKEQWASA ELITNPQWKL ESWKDIADRI STGGSAGWYA NGHFAVEAPE DLVRSVLTSW EPDSWQSDTW IPTLVMRFGA DALPPILSLA RSVPASGAVF LAPFEAPEVA LLAADWLSRL KTARPFALAW LTRHPGFAAR TLIPAALGKA GKARSAAELT IRTLAARGFT AEITEAAAGY GDAVAQAVAA IVADDGTLTL PKTMPAVPDW AETRLLPQIL LKGRQTALPE QSVKHLLLML AVSKGTEPYA GIQMAKGICD PASLATFSWA LFENWRGVDY PAKESWAFDA LRWFGDDETV RRLSPMIRLW PGENGHQRAV AGLDVLADIG GTVALMHLYG ISQKVKFKGL KEQATQRVTE IADDLGLTAE QLGDRLVPDL GLASSGTLSL DYGPRSFTVG FDEQLKPYVA DQSGKRLKAL PKPGAKDDQE LAPAAHQQFS ALKKDVRTLA ASQIARFELA MVTQRRWTSQ EFGEYFVGHP LLRHLVRRLV WVTFVDEKVG SAFRVAEDLS LADIADDEFT LADDAVIGVA HPLHIGGDVA AWSDVFADYE ILQPFAQLGR TVFAFTDAEK ASTRLTRFGG IETPVGKVLG LERRGWRRGA PQDAGIQGWI SRALLGGGSV TATLDPGITV DYVAEWGETQ RLQEVFISRH ADGESYWNAS AKFRELGSLD EITASELLRD LTEATTQ
|
| |