Gene Caci_2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2338 
Symbol 
ID8333687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2650980 
End bp2653520 
Gene Length2541 bp 
Protein Length846 aa 
Translation table11 
GC content71% 
IMG OID644955491 
ProductHedgehog/intein hint domain protein 
Protein accessionYP_003113097 
Protein GI256391533 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACCT TCGCGATCGG CGGCCGGCTG CGCCGCTCCG TAGCGGCGGT GGCGACCGCC 
GCACTGGGCC TCGCGGCGTT CGTCGCACCG ACCACAGCGC AGGCACAGGC CGCCGCCGGA
CCGGGGAGCG CGGCTCAGCA TCCTGCCTCA GCCCTCGGCA GGTCGGGCTT CGCCATGGCC
GACCCCGCGC CGCCGCCCGC GCTGCCGGCG CCTCTGGACT CCTCGGGTCC GGCGCAGGTC
CAGGCGCTGG AGCAGCAGTA CCAGACCGAC GCCCAGAACG ACCAGCAGAA GTACGACCAG
CTCGCCGCCG AGAAGGGCAC GCTGGAGCAG CGCGCCGGGG ACGTGCAGAA CCGCGAGAGC
TCGCTGGAGA CGCAGGCCAC GAACCTGGAG GACCAGAGCA ACGCGCTCAA CAGCCAAGCG
GAGACCCTCA ACAGCGAGAT CGACGCGCAC AACGCCGAAC CGCACACCTT CGAGCTCCCG
GACGAAGAGG CGGAGTACGC GGCCTACAAC GAGGAGAAGG CCAACCTCGA CGAGCAGAAA
GCGAACCTGC AGAACCAGAT CGACGCGCTG ACCGCGCAAA GCAAAAAGCT GCAGAGCGAC
CAGGCGCAGG CCGACGCCGA CCAGACGCAG CTCGAGACGG ACGTCCAGAC CCACAACGAC
GCCGTCTCGG CGCTGGAGGG CGACGTCGGG AAGCTCGAAG CCGAACGCCA GCAGATCCTC
ACCCAGATCG ACAGCCTGCT GCAGGACTAC GCCGGCGCCG AGCCGGGCGG CGAGGGCGCC
CCGCTCGCCG CCGAGGGCGG CGACGAATCC GAACCGGCGG CCGCCGCCGC ACCCTCCCCG
GGAGCCCGGG CCCTGCCCAG CGGAGGCGGC GACCAGTCCG CACCGCCGCG CAGCACCTAC
GCACCGGTCC AGAACGCACC GTCCGGCTCG GGATCGGGCC AGGCCCAGGC TCAGGCCGCG
CCGGCGCCCG CCCCGACCCA GACCCCGGTG ACCGTGACGC TCGCGCCGTC CACGGTGTCC
GGCCTGCCCG CGAGCGAAGC CGAGAACCTG CAGCCCAGCG AGACCTTCGA CGGGCTGATC
CCCGAGGCGA ACGGCGACTA CGCCGCCGAG GAGATCCAGC CGCCCGCGGG CGAGTCCGTA
CCGCCGGCGC AGAAGGCGTT CGACAACGTC GTCAACAAGG GCGGCAAGGC CTCCACCCGG
ATCGGCGGCC GACCGGCGAC CATCGACAAG ATCGTGCCGG AGGCCGCCGC CCCGGCCGAG
AACCAGGGCG GGGACACACC GCGCCCGGCG AAGGCGGCGG CGCCACCGGC GCGGTCGAGC
TGGGTCCCGG CCGTGAACCA GCCCAACCCG GCGAACGGTC CGCCGGTCAG CATCGACGCG
CTGAAGTCGC TGCTGGACCA GCAGGGCCTG GGATCGGACG CGGACCAGTT CGACCTCGAA
TACTCGCCGA CCGTCCTGGG CCAGGACGGC GAACCCGCCT ACGCGGTCGC GCCGACGGAC
GCGGCGGGGA ACCCCGAACT CGGCGCCGAG GGCAAGCCGA TCCTCCGGTT CTCCAACCTC
GGCCTGCAGA ACCCCGAAGT GGCGCAGGAC GCCTTCGAGA ACGAGGGTCT GGACGTCGAG
CCGGCCCAGG ACGACTCCCC GTGTCCGCAC AGCTTCTCCG GTGCCACCAG GGTCCTGATG
GCCGACGGGT CCACGAAGGC GATCGCGGAG GTCGGGGTCG GCGACCTGGT CGAGAACGCC
GAGCCGGGCG GCCGCGCCGA GGTCCACCGG GTGGACCAGG TCCACACGAC CACCACGGAC
GCCGCCTTCG TCGACGTGGT CGTGGCCTCG GCCGCACCGG GCGGCGGCGG AACGCTCACC
GGCACCGCGA ACCACCCCTA CTACGACGCC ACCGCCGGCG GGTTCGTCGA CGCCGGAGCA
CTGCGCGCCG GCGACCGGCT CCAGAGCGCC GGCGGCGGGC AGGCGACGGT CAGCGGGGTC
CACGCGCGGT TCGGCCCGCT CGTCACCTAC GACCTGACGA TCGACGGACT GCACACGTAC
TTCGTCGTCG CCGGCAGTGC ACCAGTCCTC GTTCACAACT GTGACGGGGA CCTGCTGGAC
ATCGCCAAGG ACTCCGCGAC CCGCGCGAGC GGCTATCAGA AGCTCGCCGG CAGCGACTGG
GAGGTCAAGA ACAAGACCAC CTCGGTGATC CGGGCGAGGT TCCCCAGCGG AGACCCGAAG
AATCCCTGGG TCTACAAAAA CGTCGTGTCG AGCAGCGGCT CCGGCTTGTC GCCCGCGCAG
ATCAGCGCCA TCGAAGCCAA CGGCGATGTC GCGGTCACGG ATAACCTGGA AGGCTTCACC
CACGCCGAGT ACAATGGCCT GCGATATATC GACAGCATGG GCGGCCAACC GATAGCGGGC
GGCGCGTCCA GAAGCGTGTG CACCACCATC TGCGGCCCAT TCATCAGAGG GACCGACGGC
AACATATCCG GCCCGGTCTA TCAGCTTGAG CACGGAACAA AGATAAGGAC GTTCTATTGG
CCGGGATCGA CGCCGGGGTA G
 
Protein sequence
MATFAIGGRL RRSVAAVATA ALGLAAFVAP TTAQAQAAAG PGSAAQHPAS ALGRSGFAMA 
DPAPPPALPA PLDSSGPAQV QALEQQYQTD AQNDQQKYDQ LAAEKGTLEQ RAGDVQNRES
SLETQATNLE DQSNALNSQA ETLNSEIDAH NAEPHTFELP DEEAEYAAYN EEKANLDEQK
ANLQNQIDAL TAQSKKLQSD QAQADADQTQ LETDVQTHND AVSALEGDVG KLEAERQQIL
TQIDSLLQDY AGAEPGGEGA PLAAEGGDES EPAAAAAPSP GARALPSGGG DQSAPPRSTY
APVQNAPSGS GSGQAQAQAA PAPAPTQTPV TVTLAPSTVS GLPASEAENL QPSETFDGLI
PEANGDYAAE EIQPPAGESV PPAQKAFDNV VNKGGKASTR IGGRPATIDK IVPEAAAPAE
NQGGDTPRPA KAAAPPARSS WVPAVNQPNP ANGPPVSIDA LKSLLDQQGL GSDADQFDLE
YSPTVLGQDG EPAYAVAPTD AAGNPELGAE GKPILRFSNL GLQNPEVAQD AFENEGLDVE
PAQDDSPCPH SFSGATRVLM ADGSTKAIAE VGVGDLVENA EPGGRAEVHR VDQVHTTTTD
AAFVDVVVAS AAPGGGGTLT GTANHPYYDA TAGGFVDAGA LRAGDRLQSA GGGQATVSGV
HARFGPLVTY DLTIDGLHTY FVVAGSAPVL VHNCDGDLLD IAKDSATRAS GYQKLAGSDW
EVKNKTTSVI RARFPSGDPK NPWVYKNVVS SSGSGLSPAQ ISAIEANGDV AVTDNLEGFT
HAEYNGLRYI DSMGGQPIAG GASRSVCTTI CGPFIRGTDG NISGPVYQLE HGTKIRTFYW
PGSTPG