Gene Caci_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1412 
Symbol 
ID8332751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1605167 
End bp1606837 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content62% 
IMG OID644954560 
Producttype III restriction protein res subunit 
Protein accessionYP_003112176 
Protein GI256390612 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00105152 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0540272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTG GGTTCCTCTC AGCTGAGGCG CTGCTCGCCG CTGGCCCACG CGGGCTGCCG 
CATGCCGTCG AACGCGCGCT ATGGCATCTC GGTTTCTCCG ACGTTCGAAT CGTCGACGGC
TCCGGGGACG ACGGCGCGGA CCTTCTCGCC GTTCGTGATC GGGAGCAGTG GGTCTTCCAA
TGCAAGTGGT CGAGTAACCG CGCGATCGAT CGGCAAGGCG TCGATGACCT CGAGCGCGCA
CGTCACACGT ACCGTGCCGA CAAGGCCGTA CTGGTCACGA ACATCGGCTT GAACCGCTCC
GCCGAGGAGA GGCGAAGTGC CCTGGAGTCC ATTGGCATCA ACATCACGAC GTGGACCGGC
CCGACCCTCG CGTCAATCTG GGAGCGGATG CCGGTTCGGG TTCCAGCCAC ATTCGACCTT
CGGGACTACC AAGTCACGGC GGCCAACCGA GTGGAGGCCG ACTTGCGTGA TCACGGAAGC
TCCCTGCTCG TCCTCGCCAC TGGTTTGGGC AAGACGGTCA TTGGCGGCGA AGTGATCAGA
AGGCACCTCG CCGATCGGCC GGACGCGGCC GTGCTCGTCG CCGCGCACAT GAAAGAGCTC
GTCGAACAAT TGGAACGGGC ACTCTGGCGG CATCTAGGCA AGGACGTCCC CACCCGCCTG
GTCACTGGAG ACCACAAACC TCCAGCCCTC GACGGTGTCG TGGTGGGGAC GGTGGAGTCA
GTGTTGGGCC TTGTGCGCTC GGGAAGACTC ACCCCGTCGC TGGTGATGAT CGATGAGACG
CATCACGTCA GCGAGAACGG AAGATTTGCG GAACTGCTTG ATCTGTGTGG TGATGCCGCC
AGGTTTGGCG TGACAGCCAC GCCGTGGCGC GGGGACAAGT TCGACATCAC GTCCCGCTTC
GGGCGCCCAA GTTTCAAGAT GAGCATCGCC GAAGGCATGT CCGCCGGCTA CCTCTCAGCG
GTCGACTACA GGATCTTCGT CGACAACATC GACTGGGAGT TCGTACGGCG AGCGAGTGAT
CACCAGTACT CCATCAAAGA GCTGAACCGA CAACTATTCC TGCCGCAGCG CGATGAGGAG
ATCTTGGAGT TCTTCCGTAC CGCGTGGCGA GAGACCCGCG ACCCAAGAGC CATCCTGTTC
TGCCAGACCA TCGAACACGC GGAGCACATC GCGAAACTCC TCGCCACAGC CGACTCCGCC
TGGCAACGAG CCACATTCCT GCACAGCGGG TTGTCTCGGC AAAGGCGCCA GATCCTGCTC
AACGAGTTCC GACTCGGCCG GGTACCGGTG ATCACCTGCG TGGACGTGTT CAACGAAGGT
GTCGACGTAC CCGACGTCAA CTTGATCGGG TTCCTTCGCG TCACGCACAG CCGGCGGATC
TTCGTCCAGC AACTCGGCCG AGGCCTACGT CTAAGCCCGG GCAAGAAGGA ACTCAAGGTG
CTCGATTTCG TCACAGACAT CCGGCGCGTA GCGGCCACCC TTGACCTGCG AAGATCGCTC
GATGAATCAG AGTCCGAACA TCTGAGGTTG GCCGCGCCAC ATGCCGGCGT CCGCTTCAGT
GACGAGACCG CCGGAAGCCT TCTCGATAAT TGGATCAAGG ACGCGGCTGA TTTGGAGACG
GCCGCGGACG AGGTAAGACT GCAGTTTCCT GAGAATTGGG GGATTGACTA A
 
Protein sequence
MSGGFLSAEA LLAAGPRGLP HAVERALWHL GFSDVRIVDG SGDDGADLLA VRDREQWVFQ 
CKWSSNRAID RQGVDDLERA RHTYRADKAV LVTNIGLNRS AEERRSALES IGINITTWTG
PTLASIWERM PVRVPATFDL RDYQVTAANR VEADLRDHGS SLLVLATGLG KTVIGGEVIR
RHLADRPDAA VLVAAHMKEL VEQLERALWR HLGKDVPTRL VTGDHKPPAL DGVVVGTVES
VLGLVRSGRL TPSLVMIDET HHVSENGRFA ELLDLCGDAA RFGVTATPWR GDKFDITSRF
GRPSFKMSIA EGMSAGYLSA VDYRIFVDNI DWEFVRRASD HQYSIKELNR QLFLPQRDEE
ILEFFRTAWR ETRDPRAILF CQTIEHAEHI AKLLATADSA WQRATFLHSG LSRQRRQILL
NEFRLGRVPV ITCVDVFNEG VDVPDVNLIG FLRVTHSRRI FVQQLGRGLR LSPGKKELKV
LDFVTDIRRV AATLDLRRSL DESESEHLRL AAPHAGVRFS DETAGSLLDN WIKDAADLET
AADEVRLQFP ENWGID