Gene Caci_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3971 
Symbol 
ID8335324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4504691 
End bp4507873 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content64% 
IMG OID644957086 
Producthelicase domain protein 
Protein accessionYP_003114689 
Protein GI256393125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00406642 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.666286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCATA CCCATACTCC GAAGCCGCCA GCGGTTGTCA CCAACGGCGA CGGAGCCTCC 
GTCCTTACCG CGGCGGGGAA CTGGCTGTCC GATATGGCTC CTGGATCGCA AGGCTCAGTC
GTATCACTGG CCGCCGGATA CCTGAGCATC CACGGCCTGA TGCTACTGAC CGGCCGGCTG
AGGGCTCTGC TGGACAATGG GGCAACGCTT CGGCTGCTGT TCGGTGTCGC CCCGACCGGC
ACGGCAGCCG TCACCGTCGA TGCTGGGGAC AAGGACATCG CCGACCTGCC CGAGCTGCTG
GCCGCGGACG AAGGCGCGCT GCGCGCCGAG ATCGCAGGGA TCCCGATGTC AGCGGCCAAC
GCCAGCCGCC TGGCTGACTT GCTCAGCGTG CTGTCCCACA AGGATGTCGA GATCCGACGA
TTTGAGCGTG GCTTCTTTCA CGCCAAGGCT ATTGTCGCGG ACTATCCAGT TGGCGCGGTT
GCCCTAGTCG GTTCCTTCAA CCTGACTCGT GGCGGCATGC TCAGCAACAT CGAACTCGGT
GTCGGTGTCG CTCGTTCCCA GGGCCGGGCG GTGGCCGAAC AGGTCCACGG CTGGTGGAAC
CAATCCGAGC CCTACGACCT CAACGAACTC ATCTCCACTC TGTTTCTGCT GGCTCCCATC
GAACTCGTGT ATCTACGGAT CCTCTCCGCG CTGTTCGCCG AGGAACTCGC GGTGTGTGCC
TCGCCGATCG GGCTTACCGG TTTCCAGGAG GCAGCGGTTG CCAAAGCGCT GCTCACACTT
CGCCACCGTG GCGGCGTGCT GCTGGCTGAC GACGTCGGCC TTGGCAAGAG CTATATCATC
GCCGACCTCG CCCGTCGTGA AACGATCGCC GAAGTAGGCA CTGCCCGGAT CGCGCTATTC
TGCCCCGCGC ACCTGAAACC CATGTGGCTG GCCTACAAGA ACCGTTGGAA TCTGCATGTT
GACATCCACA GCTACAACGC TCTGTCTGAT ATGTACAAGA AGGTCCGCCG TGGCGGCGGA
GTCTGGCATC AATATTCGAT GATCATCTGC GACGAGTCGC ACTACTTGAA CAATCGCGAC
CGCAAGCGTT ACAAAGCACT GGTAGACCTC CTTGCGGCCA AGGGCAGACG CCCAAAGATC
ATCCTGGCCA CTGCCACGCC AGCCAACAAC AGTGGCGAAG ATTTGGCAAC GCAACTCAAC
CTCGCCTGCC CGTTCCCAGA GACCGGACGT CCCCGCGCCC ATGGCTGGTC GCCGTGGCCG
ACCGTGCGCA TGTCCAGAAC TCGGCTGTTC GACCTGTGCC GCCATGCGAC CAAGCTGCCG
AAGCCCGTCT TGAGGGACCT CCACGCTGAG ATCGATGCCC TCACCGTGCG GCGCACTCGC
CCGTTCATCA AGGCGACATG GGCCAGTCCC GCCAAAAGCC TGCAATTCCC TGTCGTGCGG
CAGCACGCCC TCTACTACCA ACTCGGCGAC CAGATGCGCG ACCTGTTCGC CGACGTGCTG
GACGCAGCAA CCATCGGGCC GGCGGCCAAG GACGATCAGT TCCGAGCCGC CATGCGAGAC
CTGCGCGGAC CGACCGCGCG GGTCAGGCCA CTGACGCTGG CGGCGTTCAT GCCCCAGTGC
TACGCACTCG ACGAGGCGCC GCCATTGTGG ACCGATCTCC TGCCGGCCCT GATGAAGATC
GCCCTTCTGA AACGGATCGA ATCTTCCACC GCGGCCTTCG CTGTCACCGC CGCTGTCCTG
GCTCAACGTA CCCAGGAAGC AATCGACGAG CTCGATCGGC GTGACCGCGT CCGCATTACC
ATTGGCCGCG AACGCCGCGA TCGTTTGCAG GACCTCATGG CTGCGCTGCT CGACCAAGGC
GCGGACCGGG AGCGCATCGA CGCCATCTTC ACAAACCTGC TAGATGGGGA CAGCAAGCTC
GGACCGGCAG GCGGCATCCC GCACTCGATG TACCGTGCCG CGCACCGATT CGACCGCGAA
CGCTTGTCGC AGGACCTGCA GGCCGACCGG GACACGCTCG AACAGTTCGC TGCACGCGCC
GCCGCTGCGA CGGCCGCTGG CGATCCCAAG GCCGACGCGT ACACCACGTT GCTGGACGCC
ATCGCTGGTA AGACGCTCAC CTTCGCTGCG GCCCGGGCCA CGACCGCCGA CCTGGGATTG
CGTATCGATG CCCACCTCGG CGCCGGCGGC GCGCCGCACT ACGCTGGACG TTTCGCCACT
ATCGGTACGA AAGACCCGCC GACCAAAGCA ACAGTTGCAC GGATCCTGGC TGGCTACTGT
CCCAAAACCG CCTCCGCAAC GGAGGCCCTT GGGACTCGCC GCTCCCGTGA CGAGTACGAC
TTATTGCTGG CCGGCGACGG AATCTCCGAA GGCGTCAACC TCCAGCAAAC CCGCGTCATC
ATCAACTACG ACCTGCCCTG GGCCCCCGGT CGACTCGCCC AGCGCATCGG ACGTGCCGAT
CGCATTGGCT CACCCCACAA GGTCATCGAC GTCTACACCG TTCTGCCAGA CCAGGTGCTG
GACGCTTACC TGCGGCTAAT GGACACCCTT GCCGCTAAGG CCGAAACTGC CGCCGTCCTG
GTCGGCTCCA CCACCGCACT GTTCCCCGGA GCTGCCATTC GGCCGCTGAA CTTCACTGCC
ATGTACGAAG ATCTCGTCAA GCCCGACAGC GAGCCCGTCA TCGATGTACC GCCGTCAGAA
ACGCGCCGAG CAGTTGCGCG GCGCGCGCAC GATGAACCGA CGGTCGCTCG CCGGTTGGCC
GAAGCGACCA CATGGGCCGG AGCAATTCAC CCTGAACCTG CCGATGACCC AGTCGCAGTC
TTCTGTTTTG AGCTGCACGG CGTCGAAGGT CAGGCCGCGG TGCCGGTCCT GTGCCTGGTT
GGAGCCGGAC GCCGCCTCGG CTTCATCACT ACCGACCCCG AAGTGTGCTT GGCCGAGATC
GAGGTTGATC CCACAGACTG GATGGAGCAG GTCGCAGCCG GCACGCATAC CGACCTGCGA
GTGCCGACCC CACCTGCTGC CCACCAGCTC ATTTTCGAGC TGTGTGCACG GGCCAAGTCA
GACCTGGCGT CCACCTATGG AATCGACCAC GATCACCTCG ACGAACGTCT CCGGCTCGTC
GGCTGGATCC TGCGCCCCGA CAAGAACATG GCACGGGCCG CCCGTGACAA GTGCTGCTAC
TGA
 
Protein sequence
MSHTHTPKPP AVVTNGDGAS VLTAAGNWLS DMAPGSQGSV VSLAAGYLSI HGLMLLTGRL 
RALLDNGATL RLLFGVAPTG TAAVTVDAGD KDIADLPELL AADEGALRAE IAGIPMSAAN
ASRLADLLSV LSHKDVEIRR FERGFFHAKA IVADYPVGAV ALVGSFNLTR GGMLSNIELG
VGVARSQGRA VAEQVHGWWN QSEPYDLNEL ISTLFLLAPI ELVYLRILSA LFAEELAVCA
SPIGLTGFQE AAVAKALLTL RHRGGVLLAD DVGLGKSYII ADLARRETIA EVGTARIALF
CPAHLKPMWL AYKNRWNLHV DIHSYNALSD MYKKVRRGGG VWHQYSMIIC DESHYLNNRD
RKRYKALVDL LAAKGRRPKI ILATATPANN SGEDLATQLN LACPFPETGR PRAHGWSPWP
TVRMSRTRLF DLCRHATKLP KPVLRDLHAE IDALTVRRTR PFIKATWASP AKSLQFPVVR
QHALYYQLGD QMRDLFADVL DAATIGPAAK DDQFRAAMRD LRGPTARVRP LTLAAFMPQC
YALDEAPPLW TDLLPALMKI ALLKRIESST AAFAVTAAVL AQRTQEAIDE LDRRDRVRIT
IGRERRDRLQ DLMAALLDQG ADRERIDAIF TNLLDGDSKL GPAGGIPHSM YRAAHRFDRE
RLSQDLQADR DTLEQFAARA AAATAAGDPK ADAYTTLLDA IAGKTLTFAA ARATTADLGL
RIDAHLGAGG APHYAGRFAT IGTKDPPTKA TVARILAGYC PKTASATEAL GTRRSRDEYD
LLLAGDGISE GVNLQQTRVI INYDLPWAPG RLAQRIGRAD RIGSPHKVID VYTVLPDQVL
DAYLRLMDTL AAKAETAAVL VGSTTALFPG AAIRPLNFTA MYEDLVKPDS EPVIDVPPSE
TRRAVARRAH DEPTVARRLA EATTWAGAIH PEPADDPVAV FCFELHGVEG QAAVPVLCLV
GAGRRLGFIT TDPEVCLAEI EVDPTDWMEQ VAAGTHTDLR VPTPPAAHQL IFELCARAKS
DLASTYGIDH DHLDERLRLV GWILRPDKNM ARAARDKCCY