Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1412 |
Symbol | |
ID | 8332751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1605167 |
End bp | 1606837 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644954560 |
Product | type III restriction protein res subunit |
Protein accession | YP_003112176 |
Protein GI | 256390612 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00105152 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0540272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGGTG GGTTCCTCTC AGCTGAGGCG CTGCTCGCCG CTGGCCCACG CGGGCTGCCG CATGCCGTCG AACGCGCGCT ATGGCATCTC GGTTTCTCCG ACGTTCGAAT CGTCGACGGC TCCGGGGACG ACGGCGCGGA CCTTCTCGCC GTTCGTGATC GGGAGCAGTG GGTCTTCCAA TGCAAGTGGT CGAGTAACCG CGCGATCGAT CGGCAAGGCG TCGATGACCT CGAGCGCGCA CGTCACACGT ACCGTGCCGA CAAGGCCGTA CTGGTCACGA ACATCGGCTT GAACCGCTCC GCCGAGGAGA GGCGAAGTGC CCTGGAGTCC ATTGGCATCA ACATCACGAC GTGGACCGGC CCGACCCTCG CGTCAATCTG GGAGCGGATG CCGGTTCGGG TTCCAGCCAC ATTCGACCTT CGGGACTACC AAGTCACGGC GGCCAACCGA GTGGAGGCCG ACTTGCGTGA TCACGGAAGC TCCCTGCTCG TCCTCGCCAC TGGTTTGGGC AAGACGGTCA TTGGCGGCGA AGTGATCAGA AGGCACCTCG CCGATCGGCC GGACGCGGCC GTGCTCGTCG CCGCGCACAT GAAAGAGCTC GTCGAACAAT TGGAACGGGC ACTCTGGCGG CATCTAGGCA AGGACGTCCC CACCCGCCTG GTCACTGGAG ACCACAAACC TCCAGCCCTC GACGGTGTCG TGGTGGGGAC GGTGGAGTCA GTGTTGGGCC TTGTGCGCTC GGGAAGACTC ACCCCGTCGC TGGTGATGAT CGATGAGACG CATCACGTCA GCGAGAACGG AAGATTTGCG GAACTGCTTG ATCTGTGTGG TGATGCCGCC AGGTTTGGCG TGACAGCCAC GCCGTGGCGC GGGGACAAGT TCGACATCAC GTCCCGCTTC GGGCGCCCAA GTTTCAAGAT GAGCATCGCC GAAGGCATGT CCGCCGGCTA CCTCTCAGCG GTCGACTACA GGATCTTCGT CGACAACATC GACTGGGAGT TCGTACGGCG AGCGAGTGAT CACCAGTACT CCATCAAAGA GCTGAACCGA CAACTATTCC TGCCGCAGCG CGATGAGGAG ATCTTGGAGT TCTTCCGTAC CGCGTGGCGA GAGACCCGCG ACCCAAGAGC CATCCTGTTC TGCCAGACCA TCGAACACGC GGAGCACATC GCGAAACTCC TCGCCACAGC CGACTCCGCC TGGCAACGAG CCACATTCCT GCACAGCGGG TTGTCTCGGC AAAGGCGCCA GATCCTGCTC AACGAGTTCC GACTCGGCCG GGTACCGGTG ATCACCTGCG TGGACGTGTT CAACGAAGGT GTCGACGTAC CCGACGTCAA CTTGATCGGG TTCCTTCGCG TCACGCACAG CCGGCGGATC TTCGTCCAGC AACTCGGCCG AGGCCTACGT CTAAGCCCGG GCAAGAAGGA ACTCAAGGTG CTCGATTTCG TCACAGACAT CCGGCGCGTA GCGGCCACCC TTGACCTGCG AAGATCGCTC GATGAATCAG AGTCCGAACA TCTGAGGTTG GCCGCGCCAC ATGCCGGCGT CCGCTTCAGT GACGAGACCG CCGGAAGCCT TCTCGATAAT TGGATCAAGG ACGCGGCTGA TTTGGAGACG GCCGCGGACG AGGTAAGACT GCAGTTTCCT GAGAATTGGG GGATTGACTA A
|
Protein sequence | MSGGFLSAEA LLAAGPRGLP HAVERALWHL GFSDVRIVDG SGDDGADLLA VRDREQWVFQ CKWSSNRAID RQGVDDLERA RHTYRADKAV LVTNIGLNRS AEERRSALES IGINITTWTG PTLASIWERM PVRVPATFDL RDYQVTAANR VEADLRDHGS SLLVLATGLG KTVIGGEVIR RHLADRPDAA VLVAAHMKEL VEQLERALWR HLGKDVPTRL VTGDHKPPAL DGVVVGTVES VLGLVRSGRL TPSLVMIDET HHVSENGRFA ELLDLCGDAA RFGVTATPWR GDKFDITSRF GRPSFKMSIA EGMSAGYLSA VDYRIFVDNI DWEFVRRASD HQYSIKELNR QLFLPQRDEE ILEFFRTAWR ETRDPRAILF CQTIEHAEHI AKLLATADSA WQRATFLHSG LSRQRRQILL NEFRLGRVPV ITCVDVFNEG VDVPDVNLIG FLRVTHSRRI FVQQLGRGLR LSPGKKELKV LDFVTDIRRV AATLDLRRSL DESESEHLRL AAPHAGVRFS DETAGSLLDN WIKDAADLET AADEVRLQFP ENWGID
|
| |