Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0597 |
Symbol | |
ID | 8331925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 687556 |
End bp | 690816 |
Gene Length | 3261 bp |
Protein Length | 1086 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644953749 |
Product | WGR domain-containing protein |
Protein accession | YP_003111375 |
Protein GI | 256389811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.601967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG TCGTGGAATT GCCGAAGGCC TACCAGAAGG TCAAGCCCGT CGAACGCGGG CGCTGCCGCC CCGACCAGGT CCCGGCGGTC GACGCCGCGG CGATGCAGCA ACTGTACGAA CGCCGCCGGG GCAGCGTCGC GCACCACCAG ACCAAGATCG ACGCCGCCAT GAGGAACCCG GCCAGCGACA AGGACTTGGT CTCCTCGATG AAGGGCGTCG ACTTCGGCGA CCTGAGCACG GGGACGCCGC TGCAAGCTGC CGTCGCCGAA GCCGCGCTCG TCACCGTGGA GGACTTCGGC GACGACGCCG CGATCGGTGT CTGGCATCAC ATACGCGGCG TGCCCTTCGC GCTGGAAGCC TTCGTCGACT ACTGCCATCT GGACTCCTGG CCTCGCGTCG GATGGTTGCA GCAGATCCCC GTCCTGCTCG ACCGGATGGG AATGGCGAGG CGGCTGCTGG ATCTGGTGCG CGCGGCGCCG GACGACGTGC GGGCCGACAC GATGGCCCTC GCCGAGCGGC TGCGCGCGGA GGACTTGGTC GGCTGCGGGT TCGCCCGGAC TGTGGCGACG TTCCTGTTCC CCGAGCGGTC CGACTGGCTG GAGGAAGACC TGGCGGCTGC TGGCAGCAGA GAGCTTCCCG CTGCTTCGTT GATCCCCTCG GTCGCCACAC CCGAGCAAGC CGCGGCGGTC GGGGAGATGA TGCGCGCCGG TGCGAAGTGG ACATGGGATG GCGATCCCGC CATCGAGGCG ACGTTCCTGA CGGTCGCCGG CGACCAAGCC CTCCTCTTCC TGCTTGCCTG GCACGACGGC CAGTTCACCG GCACCAAGCG TCTGACTGCC GCCCGGAGAA GAGTTCTGGA TCTGATCTCC CGGATCCCCG GCGATGCGGC CGTCCGGGCG CTCGCTGAGC GGGACACCGG AAACCCGGAG ACCCTGGCGT ATCTGCACGC CGCCGCCGAG CGCTTCCCGG AGAGCGCCCT GCGCGTGTTG TCCGCGATGG AACCGACGCC GGCGACGACG CACGTCCTCG CCGCGCTGAA TCGGACCGAG CCGGCCGGCG CGGCTGCGAC AGAGATACAG ATCCCTGCCA TCCTGTCCCG GCCGCCGTGG TCGGACCCGA AGGCCAAGCG GCCGAAACCG CTCGTCGTCG AAGGGCTGAC GGCGCCGTCG GACGTGTCCG CCGCCTGGCT GCCGGGCGAG CGCGAGCGCT GGCTCGGCCG GGGCGCCGGC TGGGAGCCGA GCCAGGGCTG GGCCGCGATC GCCGAGCAGA TCGCCGTGTG GCGCACGCAG CCGGCGAAGC CCGACGACCC CCTGACGCCG GTGGCGCTCT ACTTCGCCGC CTACGCCTCG CCGGATCTGG TGCGGCCGCT GCTGGCGGAC GGCTGGAAGC CGGCGCTGCG CCGCGCCGAC GACCGAGCGC TGGCTTTCGT CGCGCGCTAC GAAGTGCTTG CGCTGCCTGC GGTTCAGGCC ATCCGCGGAC TGCCGGCCGA CCGGGGACGG CTGCTCCAGC CGTTCGTGAG CGCGGACGTC GCCACCTCGA TGGTCGAGGG GATGACGCGA CGGTGCAGCG TGGCGCGCGG CGCGGCCCTG GGCTGGCTGC GCCGCCACGC CGAGCCTGCC GTGGGCTTCC TGCTGCCCGC CGCCTTGGGC AAGGCGGGTC CGCAGCGGCG CAACGCTGAC GCCGCGCTGC GGTGGCTGGC GGCGGAGGGG ACCGACGTCG TGGGCGTCGC CATCGCGGCG CACGGCGAAG CGGTCGGCGT GGCCGTCAAG GAACTGCTGA GCGAGGGCGG ACTCGGGACC TATCCGCGCA GCATGCCCGA GACGCCGATG TGGGCCATGC CTTCGGTGCT GCCGGCGATC CGGCTGAAGG ACGGGACGGC GACGCTGCCG GCGCGTGCGG CGCAGACCGT GGTGGAGATG CTGCAGATCT CCAAGCCCGA GGCGCCGCAT CCGGGACTGG AGGTCGTCAA AGAACTCTGC GACGAGCCCT CGCTGGGCGA GTTCGCCTTC GCCCTGTTCG AGAACTGGCG CGCGACCGGC TCCGACACCT CGCACCGGTG GGCCTTCGAC GCCATGGGAC TGCTCGGCGA TGACGGCACC GTGGCCCGAC TGGAGCCGCT GATCGGTCCC TGGACCACCG CGCGCGACTA CGCGCTCGTC GGCGATGCGC TGACGGTCCT GGGCATGATC GGCGGCAGCC GGGCCCTGGC GGCGCTGCAC ACCGTGGTCC AGAAGGGGCG GCGCAAGCCG GTCCGGCGCC GGGCCGCGGA GCGGTTCCAG GACGTCGCCG CCTCGCTTGA CCTCGCCGCC GACGACCTCG CCGACCGCGT GGTCCCCACC CTCGGTCTGG GCGCCGACGG CACGCTGCGG CTGGACTACG GCCCGCGCTC GTTCGGCGTC GGCTTCGATG AGCTGCTGCG GCCGCGGATC ACCGACGAGT CCGGCAAGTT GCTCCCGCGC ATGCCCCGGC CGGTCAAGAA GGACGACGAG GAACTGGCGA CGGCTGCCTA TCAGCAGTTC ACGGAGGTAA AGAAGGCCGC GCAAATCATT GCTGTGGACC AGATCAAGCG CCTGGACAAA GCCATGTTCA CGCGCCGCCG CTGGGACCCG GAGGGCTTCG CCGCGCACAT GGTCGCGCAT CCGCTGCTGT GCCACATCAC GCGACGGCTC GTCTGGGGCG TCTACGGATC CGACGGTGCG CCGATCGGCT CCTTCCGCAT CGCCGAGGAC CTGAGCTACG CCGACGTCCA CGACGCGCAC TACGAGATTC CCGATGGCGC GACCATCGGC GTCGCGCATC CCGTGGAACT CGGCCCGGCG GTGGCGGAGT GGAGCGAGGT CTTCGCCGAC TACGAGATCC TGCAGCCGCT GGACCAGCTG GGCCGTCCGG CGTTGGAGCT GACCGCCGCC GAGCGCGCCA GCTGGATGCT CGAGCGGTTC CAGCGTGTGG TGCTGGACGC GAACCACATC GCCGCGTTGG AACCGCGCGG CTGGTACAGC GGTCCGGTGG GCGATCCGCC GGTGTGGTCG CGGTTCCTGC GGCCGCTGGG GGAGGACGAC CGGTACCTGA TCGTGGACAT CTCGCCCGGG TTGAAGGGCG GCGTGGCGGC GGGCTCGGGC TACCAGAGGA TCGAGCGGGT CTGGCTCAGC GTGGCCGGCG AGCAGGCGTC CCCCGTTTTG CACGCCGTGC CGCTGTCCGA GCTCGATCCC GTTCCGGCTT CGGAGATCCT GCGGGATCTG ACCGAGGTGA CCGGACGATG A
|
Protein sequence | MSGVVELPKA YQKVKPVERG RCRPDQVPAV DAAAMQQLYE RRRGSVAHHQ TKIDAAMRNP ASDKDLVSSM KGVDFGDLST GTPLQAAVAE AALVTVEDFG DDAAIGVWHH IRGVPFALEA FVDYCHLDSW PRVGWLQQIP VLLDRMGMAR RLLDLVRAAP DDVRADTMAL AERLRAEDLV GCGFARTVAT FLFPERSDWL EEDLAAAGSR ELPAASLIPS VATPEQAAAV GEMMRAGAKW TWDGDPAIEA TFLTVAGDQA LLFLLAWHDG QFTGTKRLTA ARRRVLDLIS RIPGDAAVRA LAERDTGNPE TLAYLHAAAE RFPESALRVL SAMEPTPATT HVLAALNRTE PAGAAATEIQ IPAILSRPPW SDPKAKRPKP LVVEGLTAPS DVSAAWLPGE RERWLGRGAG WEPSQGWAAI AEQIAVWRTQ PAKPDDPLTP VALYFAAYAS PDLVRPLLAD GWKPALRRAD DRALAFVARY EVLALPAVQA IRGLPADRGR LLQPFVSADV ATSMVEGMTR RCSVARGAAL GWLRRHAEPA VGFLLPAALG KAGPQRRNAD AALRWLAAEG TDVVGVAIAA HGEAVGVAVK ELLSEGGLGT YPRSMPETPM WAMPSVLPAI RLKDGTATLP ARAAQTVVEM LQISKPEAPH PGLEVVKELC DEPSLGEFAF ALFENWRATG SDTSHRWAFD AMGLLGDDGT VARLEPLIGP WTTARDYALV GDALTVLGMI GGSRALAALH TVVQKGRRKP VRRRAAERFQ DVAASLDLAA DDLADRVVPT LGLGADGTLR LDYGPRSFGV GFDELLRPRI TDESGKLLPR MPRPVKKDDE ELATAAYQQF TEVKKAAQII AVDQIKRLDK AMFTRRRWDP EGFAAHMVAH PLLCHITRRL VWGVYGSDGA PIGSFRIAED LSYADVHDAH YEIPDGATIG VAHPVELGPA VAEWSEVFAD YEILQPLDQL GRPALELTAA ERASWMLERF QRVVLDANHI AALEPRGWYS GPVGDPPVWS RFLRPLGEDD RYLIVDISPG LKGGVAAGSG YQRIERVWLS VAGEQASPVL HAVPLSELDP VPASEILRDL TEVTGR
|
| |