Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3971 |
Symbol | |
ID | 8335324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4504691 |
End bp | 4507873 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644957086 |
Product | helicase domain protein |
Protein accession | YP_003114689 |
Protein GI | 256393125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00406642 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.666286 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCATA CCCATACTCC GAAGCCGCCA GCGGTTGTCA CCAACGGCGA CGGAGCCTCC GTCCTTACCG CGGCGGGGAA CTGGCTGTCC GATATGGCTC CTGGATCGCA AGGCTCAGTC GTATCACTGG CCGCCGGATA CCTGAGCATC CACGGCCTGA TGCTACTGAC CGGCCGGCTG AGGGCTCTGC TGGACAATGG GGCAACGCTT CGGCTGCTGT TCGGTGTCGC CCCGACCGGC ACGGCAGCCG TCACCGTCGA TGCTGGGGAC AAGGACATCG CCGACCTGCC CGAGCTGCTG GCCGCGGACG AAGGCGCGCT GCGCGCCGAG ATCGCAGGGA TCCCGATGTC AGCGGCCAAC GCCAGCCGCC TGGCTGACTT GCTCAGCGTG CTGTCCCACA AGGATGTCGA GATCCGACGA TTTGAGCGTG GCTTCTTTCA CGCCAAGGCT ATTGTCGCGG ACTATCCAGT TGGCGCGGTT GCCCTAGTCG GTTCCTTCAA CCTGACTCGT GGCGGCATGC TCAGCAACAT CGAACTCGGT GTCGGTGTCG CTCGTTCCCA GGGCCGGGCG GTGGCCGAAC AGGTCCACGG CTGGTGGAAC CAATCCGAGC CCTACGACCT CAACGAACTC ATCTCCACTC TGTTTCTGCT GGCTCCCATC GAACTCGTGT ATCTACGGAT CCTCTCCGCG CTGTTCGCCG AGGAACTCGC GGTGTGTGCC TCGCCGATCG GGCTTACCGG TTTCCAGGAG GCAGCGGTTG CCAAAGCGCT GCTCACACTT CGCCACCGTG GCGGCGTGCT GCTGGCTGAC GACGTCGGCC TTGGCAAGAG CTATATCATC GCCGACCTCG CCCGTCGTGA AACGATCGCC GAAGTAGGCA CTGCCCGGAT CGCGCTATTC TGCCCCGCGC ACCTGAAACC CATGTGGCTG GCCTACAAGA ACCGTTGGAA TCTGCATGTT GACATCCACA GCTACAACGC TCTGTCTGAT ATGTACAAGA AGGTCCGCCG TGGCGGCGGA GTCTGGCATC AATATTCGAT GATCATCTGC GACGAGTCGC ACTACTTGAA CAATCGCGAC CGCAAGCGTT ACAAAGCACT GGTAGACCTC CTTGCGGCCA AGGGCAGACG CCCAAAGATC ATCCTGGCCA CTGCCACGCC AGCCAACAAC AGTGGCGAAG ATTTGGCAAC GCAACTCAAC CTCGCCTGCC CGTTCCCAGA GACCGGACGT CCCCGCGCCC ATGGCTGGTC GCCGTGGCCG ACCGTGCGCA TGTCCAGAAC TCGGCTGTTC GACCTGTGCC GCCATGCGAC CAAGCTGCCG AAGCCCGTCT TGAGGGACCT CCACGCTGAG ATCGATGCCC TCACCGTGCG GCGCACTCGC CCGTTCATCA AGGCGACATG GGCCAGTCCC GCCAAAAGCC TGCAATTCCC TGTCGTGCGG CAGCACGCCC TCTACTACCA ACTCGGCGAC CAGATGCGCG ACCTGTTCGC CGACGTGCTG GACGCAGCAA CCATCGGGCC GGCGGCCAAG GACGATCAGT TCCGAGCCGC CATGCGAGAC CTGCGCGGAC CGACCGCGCG GGTCAGGCCA CTGACGCTGG CGGCGTTCAT GCCCCAGTGC TACGCACTCG ACGAGGCGCC GCCATTGTGG ACCGATCTCC TGCCGGCCCT GATGAAGATC GCCCTTCTGA AACGGATCGA ATCTTCCACC GCGGCCTTCG CTGTCACCGC CGCTGTCCTG GCTCAACGTA CCCAGGAAGC AATCGACGAG CTCGATCGGC GTGACCGCGT CCGCATTACC ATTGGCCGCG AACGCCGCGA TCGTTTGCAG GACCTCATGG CTGCGCTGCT CGACCAAGGC GCGGACCGGG AGCGCATCGA CGCCATCTTC ACAAACCTGC TAGATGGGGA CAGCAAGCTC GGACCGGCAG GCGGCATCCC GCACTCGATG TACCGTGCCG CGCACCGATT CGACCGCGAA CGCTTGTCGC AGGACCTGCA GGCCGACCGG GACACGCTCG AACAGTTCGC TGCACGCGCC GCCGCTGCGA CGGCCGCTGG CGATCCCAAG GCCGACGCGT ACACCACGTT GCTGGACGCC ATCGCTGGTA AGACGCTCAC CTTCGCTGCG GCCCGGGCCA CGACCGCCGA CCTGGGATTG CGTATCGATG CCCACCTCGG CGCCGGCGGC GCGCCGCACT ACGCTGGACG TTTCGCCACT ATCGGTACGA AAGACCCGCC GACCAAAGCA ACAGTTGCAC GGATCCTGGC TGGCTACTGT CCCAAAACCG CCTCCGCAAC GGAGGCCCTT GGGACTCGCC GCTCCCGTGA CGAGTACGAC TTATTGCTGG CCGGCGACGG AATCTCCGAA GGCGTCAACC TCCAGCAAAC CCGCGTCATC ATCAACTACG ACCTGCCCTG GGCCCCCGGT CGACTCGCCC AGCGCATCGG ACGTGCCGAT CGCATTGGCT CACCCCACAA GGTCATCGAC GTCTACACCG TTCTGCCAGA CCAGGTGCTG GACGCTTACC TGCGGCTAAT GGACACCCTT GCCGCTAAGG CCGAAACTGC CGCCGTCCTG GTCGGCTCCA CCACCGCACT GTTCCCCGGA GCTGCCATTC GGCCGCTGAA CTTCACTGCC ATGTACGAAG ATCTCGTCAA GCCCGACAGC GAGCCCGTCA TCGATGTACC GCCGTCAGAA ACGCGCCGAG CAGTTGCGCG GCGCGCGCAC GATGAACCGA CGGTCGCTCG CCGGTTGGCC GAAGCGACCA CATGGGCCGG AGCAATTCAC CCTGAACCTG CCGATGACCC AGTCGCAGTC TTCTGTTTTG AGCTGCACGG CGTCGAAGGT CAGGCCGCGG TGCCGGTCCT GTGCCTGGTT GGAGCCGGAC GCCGCCTCGG CTTCATCACT ACCGACCCCG AAGTGTGCTT GGCCGAGATC GAGGTTGATC CCACAGACTG GATGGAGCAG GTCGCAGCCG GCACGCATAC CGACCTGCGA GTGCCGACCC CACCTGCTGC CCACCAGCTC ATTTTCGAGC TGTGTGCACG GGCCAAGTCA GACCTGGCGT CCACCTATGG AATCGACCAC GATCACCTCG ACGAACGTCT CCGGCTCGTC GGCTGGATCC TGCGCCCCGA CAAGAACATG GCACGGGCCG CCCGTGACAA GTGCTGCTAC TGA
|
Protein sequence | MSHTHTPKPP AVVTNGDGAS VLTAAGNWLS DMAPGSQGSV VSLAAGYLSI HGLMLLTGRL RALLDNGATL RLLFGVAPTG TAAVTVDAGD KDIADLPELL AADEGALRAE IAGIPMSAAN ASRLADLLSV LSHKDVEIRR FERGFFHAKA IVADYPVGAV ALVGSFNLTR GGMLSNIELG VGVARSQGRA VAEQVHGWWN QSEPYDLNEL ISTLFLLAPI ELVYLRILSA LFAEELAVCA SPIGLTGFQE AAVAKALLTL RHRGGVLLAD DVGLGKSYII ADLARRETIA EVGTARIALF CPAHLKPMWL AYKNRWNLHV DIHSYNALSD MYKKVRRGGG VWHQYSMIIC DESHYLNNRD RKRYKALVDL LAAKGRRPKI ILATATPANN SGEDLATQLN LACPFPETGR PRAHGWSPWP TVRMSRTRLF DLCRHATKLP KPVLRDLHAE IDALTVRRTR PFIKATWASP AKSLQFPVVR QHALYYQLGD QMRDLFADVL DAATIGPAAK DDQFRAAMRD LRGPTARVRP LTLAAFMPQC YALDEAPPLW TDLLPALMKI ALLKRIESST AAFAVTAAVL AQRTQEAIDE LDRRDRVRIT IGRERRDRLQ DLMAALLDQG ADRERIDAIF TNLLDGDSKL GPAGGIPHSM YRAAHRFDRE RLSQDLQADR DTLEQFAARA AAATAAGDPK ADAYTTLLDA IAGKTLTFAA ARATTADLGL RIDAHLGAGG APHYAGRFAT IGTKDPPTKA TVARILAGYC PKTASATEAL GTRRSRDEYD LLLAGDGISE GVNLQQTRVI INYDLPWAPG RLAQRIGRAD RIGSPHKVID VYTVLPDQVL DAYLRLMDTL AAKAETAAVL VGSTTALFPG AAIRPLNFTA MYEDLVKPDS EPVIDVPPSE TRRAVARRAH DEPTVARRLA EATTWAGAIH PEPADDPVAV FCFELHGVEG QAAVPVLCLV GAGRRLGFIT TDPEVCLAEI EVDPTDWMEQ VAAGTHTDLR VPTPPAAHQL IFELCARAKS DLASTYGIDH DHLDERLRLV GWILRPDKNM ARAARDKCCY
|
| |