Gene Caci_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1137 
Symbol 
ID8332472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1281839 
End bp1284727 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content73% 
IMG OID644954285 
Productserine/threonine protein kinase 
Protein accessionYP_003111904 
Protein GI256390340 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGGGG GACGCTACCG ACTGGTCGAC CGGTTGGGTT CCGGGGCGTT CGGCCAAGTG 
TGGCGCGCGC ACGACATGCA CCTCGGGGTG GATGTCGCGG TGAAGCAGGT CCTGCTGCCG
ATGGAGGGCA AGGGCCCGCA GGCCGCCGAG CGGGTCGCTC GAGCCGCCCG CGAGGCGCGC
AACACCGCGC GGCTGCGGGA CTGCCCGAAC ATCGTGACCG TCCACGACGT GCACATCGAG
AACGGCGCGC CCTGGATCGT CATGCAGCTG GTCAAGGGCG AGAGCCTGGC CGAGCGGCTG
TCGCACCGCG GACCGCTCAC CGAGAACAAC GCCGTGGCGC TCGCGCGCGA TCTGTTACGC
GCGCTGGCAG CAGCGCACGC CGCAGGGATC GTGCACAGAG ATGTGAAGCC GGGAAACGTC
CTCCTGGCCG CCGACGGCAC CGCACTGCTC GCGGACTTCG GCATCGCCGT CCGCTTCGAC
GACGCGCGGG TCACGCGCAC CGGGACCTTC ATCGGATCGC CCGGATACGT CGCCCCCGAG
CGCGTGATCT CGCAGGACGC AGGCGCCATC GGCGACCTGT TCTCCCTCGG AGCCACTCTT
TATGAGGCGG TGGAAGGCAT TCCCGCCTTC CGCGCCGACA TCATCGGCTC GGTCGTCGCC
GGACAGCCGG CCTCGATGCG CCGGGGCGGG CGGCTGACCG CTTTGATCAC CCGGCTGCTC
GACAAGGATC CCGCGACGCG GCCCGACGTG GCGGCGGCGC TCGCGCTGGT CGGCGACTGG
ACGCCGCCTC CTACGGTCGA GGCGCCCGGG TCGCTGCTGC CGTCGCTGGC GCAAGATGTC
CAGCTCGGCG TGGAGTGGGT CGAGCACTTC GCGTTCGTCA ACGAGCGGCT GCGTAATGAC
GACATCGCGG CCAGCCGTGG CGCGCGCACG TCCAACGCGG TGGAGGCGCT GGAGAGCATC
CGCACCGAGA TCGGCGTCGA CCAGAACGAA TATCTGGCGG CGGGCGGTCG CGAGGTGCAC
GCGGTCGTCT CGGTCAGCAC TGCCGATTCC GGGCGCAGCG CTCCCACGCA CGCGGCCGCC
TCCGCTGCCG CGCTGCGTGC CGATCGCAGC TACGGCGCGA AGCCCGTGCC GGTGGCGAGC
GCCGACGGAC CGCCGTCGGT CGTCTTCATC GCCGACTGCT CGCGGTCGCT GGCCGAGCCG
GGGCGGTTGG CGGGCGTCAA GGCCGCGCTG CACGCCGGGA TCGACAGCCT GCCGGACGGC
GCGCGGTTCG CGGTGATCGC CGGGCGCGCC GACAGCCAGG CGGTCTATCC CGAGGACGGC
GGCACCGAGG CGGTGACGAA GGCTTCGCGT GCGGCGGCGA AGGCTGCCGT GGACCGGCTG
ACGGCCTACG GCGGCCGCGA GATCGGGCTC TGGCTCAGCC GGGCGGCGCG GTTGTTCGCG
CTCGGGTCCG GACCGGTGCG GCACGCGATC ATCGTCGTGA ACGGCCGCGA CGAGGGTCTG
CATCCGTCCC GGTTGGCGAT GACCGCGCGC GCCTGCACCG GGTTGTTCAC CGCTGACTGC
CTGGGTTTGG GCGACGATTG GGACGTCCAC GAGATGCGCC TGGTCTCCGA GCAGCTGTCC
GGGACGGTCG CGCTGGTCGC CGAGCCGGCG GCGCTGCCGC AGGCGGTCCG GGAGGCGACG
GTCGGCGCGG CGGCGAAGCG CGTCGCAGAC GTCGAACTCA GTGTGTGGAC GCCGAGCGGC
GCGGTGATCC GGTACGTGAA GCAGGTCGAA CCGGTGCTGC GGGACCTGAC CGACAACCGG
ATCCCCGGCG ACGCCGTGCG CACCGTCGGC TTCCCGACCG GTGCGTGGGG ACCGGAGACG
CGGACGTTCC ACATCTGCGT CGAGGTGGCG CCGGGCGAGC TCGGGCAGGA GAAGCTGGCG
GCGCGCGTGC AGCTCACGGC GCGCGGTCCC GAAGCGGTGG AAGTGCTCGG CGAGGGCAAG
ATCCGCGCGG TGTGGGTCGA CGACGAGACG CTGGTGATGC GCGTCGTGTC GAAGGTCGGT
CCCTACGGCG GGCAGTCCGA GGCCGAGGTG TGGGTCGATC CCGAAATCGG GGACCTGGAA
CGGGAACTGG CCGGGCTCGC CGACGAGCTG TCGCAGGCCG AGGCCGAACT CGCCGAGGTG
CGCAACCTGC TGGCGGTGTT CGGCCGCGCG CACGCGCGGA TGTTCGCGCC GCTGCTGGCC
GAACTCGACG AGATCGAGGC GCGCGTCGCC GAGGTGCACG CCGCGCGCAG CGGCCGCGCG
GACGACCAGC GCGACGCCGA GACGGCCCGC GAGCGCGCGC AGCAGTCGGC GCGCCAGGCC
GACGAGGAGA AGGTGCGCGC GACCCGCGCC GAGCCGCCGC GCCCGGCGCC GACCGGCGAA
GCCAAGCGCA TGTACCGCAA ACTCGCCCGC CGCTGCCACC CCGACCTCGC CGACGACGAG
TCCGACCGCC AGCGCCGCGA GGTGTTCATG GCGCGCGTGA ACGACGCCTA CACCCGCGGC
GACATCGGGC TGCTGGCGCA ACTGTCGCGC GAGTGGGACG CCGAGGGCGG CGCCGGCACG
TCGGCACCGC CGAAGGACCC CGGCGAGCGG CGCAAGCTCA AGGAGCACCT GCAGCAGGCG
CTGTCCTCGG TCCACAACCG GCTGGACCGC ATCCGCGACG AGCTGTCGGC GGCCCGCGAA
TCCGAACTCG GACGCATCGT CTTCGCGCCG GGACAGGACG CGGGCATGCC GGCCGCGATG
CGGCGCCTGG ACAAGATGGC GGACAAGCTC AAGGCCTTGG TCGACGAACG GCGGCGGGTG
CTGGACGGGT TGGTGGACGC CGCCGCGGCC GCGGCCGACG CCGGGATGGC GGACGAGCGC
CGGCGGTGA
 
Protein sequence
MIGGRYRLVD RLGSGAFGQV WRAHDMHLGV DVAVKQVLLP MEGKGPQAAE RVARAAREAR 
NTARLRDCPN IVTVHDVHIE NGAPWIVMQL VKGESLAERL SHRGPLTENN AVALARDLLR
ALAAAHAAGI VHRDVKPGNV LLAADGTALL ADFGIAVRFD DARVTRTGTF IGSPGYVAPE
RVISQDAGAI GDLFSLGATL YEAVEGIPAF RADIIGSVVA GQPASMRRGG RLTALITRLL
DKDPATRPDV AAALALVGDW TPPPTVEAPG SLLPSLAQDV QLGVEWVEHF AFVNERLRND
DIAASRGART SNAVEALESI RTEIGVDQNE YLAAGGREVH AVVSVSTADS GRSAPTHAAA
SAAALRADRS YGAKPVPVAS ADGPPSVVFI ADCSRSLAEP GRLAGVKAAL HAGIDSLPDG
ARFAVIAGRA DSQAVYPEDG GTEAVTKASR AAAKAAVDRL TAYGGREIGL WLSRAARLFA
LGSGPVRHAI IVVNGRDEGL HPSRLAMTAR ACTGLFTADC LGLGDDWDVH EMRLVSEQLS
GTVALVAEPA ALPQAVREAT VGAAAKRVAD VELSVWTPSG AVIRYVKQVE PVLRDLTDNR
IPGDAVRTVG FPTGAWGPET RTFHICVEVA PGELGQEKLA ARVQLTARGP EAVEVLGEGK
IRAVWVDDET LVMRVVSKVG PYGGQSEAEV WVDPEIGDLE RELAGLADEL SQAEAELAEV
RNLLAVFGRA HARMFAPLLA ELDEIEARVA EVHAARSGRA DDQRDAETAR ERAQQSARQA
DEEKVRATRA EPPRPAPTGE AKRMYRKLAR RCHPDLADDE SDRQRREVFM ARVNDAYTRG
DIGLLAQLSR EWDAEGGAGT SAPPKDPGER RKLKEHLQQA LSSVHNRLDR IRDELSAARE
SELGRIVFAP GQDAGMPAAM RRLDKMADKL KALVDERRRV LDGLVDAAAA AADAGMADER
RR