Gene Cagg_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3042 
Symbol 
ID7266573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3697972 
End bp3700986 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content57% 
IMG OID643567862 
Productserine/threonine protein kinase with WD40 repeats 
Protein accessionYP_002464336 
Protein GI219849903 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00005148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCGCAA TCGATACCTT GCTAAACGGA CATTATCGGA TTACCGTTGT GCTCGACGCT 
TACCCCGATG CTGAGCTTTA CCGAGCGATT GATCAGCGTT CATCGCTGCG CGTCTTGATT
ACAGCCTTGC CACAGCCAGA TCAGACGGCG GTGAACGATG TCTTGCGGTT AGCCCGCGAG
CTGGCGCAGG TGCAGATGCC GGGCTTCTTG GCGCTGCGCG ACTATTTTGC GATTGAACAC
GTATGCTATC TGGTAGCCGA TGATCCGGGT GGGTCGGATT TAGAACGGTT TGCGCGAGAA
CGTGGATCAC CGTTGTCTGA ACAAGAGACG CTGGCGATAG TCGACCGCCT ATTGGCGGTT
CTCGAACGGT TGCATCGTCA TCAGCCGCCG CTCTTGTTGG GTGATGTACG TACTTGTGAT
TTGTGGTCGT CACCGGAAGG CGGGTTGAGT TTGGCACCGT TCGCCTGTGC GCGCCATATT
GGGGCAGAGG CAACACCGTA TCGCGCGCCG GAGTTGTACG ATCACGCGGT TGAGCCGGCT
CCGGTGAGCG ATATTTATGC GATGGGGGCC GTGTTGTACC ACTTGCTGAC CGGCTGGCCG
CCACCGCCGG CCAATCAGCG TCAAGCCGGG ATGCCGCTTA ACGCACCGCG CGTGTTGAAT
CCACAGGTGT CGGTGTTGGC CGAACAATTG ACCTTGCGAG CACTGGAATT GAAACCGGCT
AACCGCTATC AGCAGGTAAG CGAGATGCGG AGTGCGCTAG AGACGGTTCG GCTGATGGCG
GGACGACCGA TGGGGGCTAC CCCGCCAATT GAACGTCCGG TTACCCCGGT AACGCCTGCC
CCTCCTCCAA CGGCATCGGC GACGACTGTA TCACCTCCGG CGCCGACGAC CGCCGTACCT
CCTCCGGCGT TGGCTGCGCC GTTGCCTCCC ACCCCACCAC CGATTGCCGC ACCGACAGCA
CCGGTAGCCG CCGCGCCGTC GCGGCCTTTC CTGAGTACCT CGTGTTTGCT GGCAATTGTG
GGCGGTTTGG CCGTGATTGC GTTCGGGGTG TGTGTCCTGG TGGCGGTACT GGTTGGTTTG
TATATGACCA ATAGCTCGGT CTTCGGATGG ATCGGCAGTA CCGCGGCGAT GTCACCGACG
GCATCCGCCT TGCCTACTCC GTCTGCCGCA GTGACGACAG AATTGCGACA ACAGGTTGAG
GCGATTACGC AAACCGCTCA GTTACGTGAA GACGGTCTAG GTGCTGCAAC GTATAGCCCC
GACGGTCAAC TCGTTGCGGT CGCGGTTGGT AAGGGGGTAC AGTTGCGGGA TGCTGAGACA
TTGGCGTTAC AGCAATCGCT CAATGGTCAT ACGGGTGATG TTAGTGCGCT AGTGTTTAGT
CCTGACGGTA CAATCCTTGC CTCTGGTGCG CAAGATGATC CGGTCGTGCG GGTGTGGAAT
GTGCGCAACG GTCGTGAGGT GCTCCAGTTG CAAGGTCACG AAGATTGGAT TCGCTCGCTG
GCGTTTAGTC CTGATGGCCG ATTGCTCGCT TCGGGGAGTG CTGACCGCAC GATTAGGATT
TGGGACGTTG CCCGTGGCGA GACGCTCGTG GTACTGCGAG GACATACCGA CCTGCTCGGC
AATGTGGCGT TTAGTCCTGA TGGTCGGCGA TTGGCCTCGG CCTCGCGCGA TGGAACGGTG
CGCTTGTGGG ATGTAGCGAG CGGGCAGCAG ATTGATACGT TTCGGTTTAC CGCGCCGGTT
GACACCCAGA GTAATGCCCC GTTCTGGATG ACGGGGATCG CGTTTTCTCC TGATGGTCGT
CAAATCGCAG CCGGATCGAT TAACGGTAAT GTCTATCTCC TCGATGCTGA GACAGGTAAT
GTTCAACGCG AACTGCGTGG TCATGATGGG TGGGTGGTGA TTCGTGGTGT CGCGTACAGC
CCGGATGGTC GCCTGTTGGC TAGTGCCAGC CTTGATGGCA GTGTACGGCT CTGGAATCCG
GTGAATGGGG TCGAGCGTGA CGTGTTGCGG CAACGCGGTC TCCGTCTACT TGGCTTGAGC
TGGAGTCCCG ATGGTTCGCG TATTCTCTCA TCGAGTGATA TGGGCGGGAA TCTGGCCATT
TGGGATGTGG CCTCGGCCCA GATTGTGCAG AGTTTTCAAA TAACGCAAGG GGTTGTAACG
GGCGTTCACT ATAGCCCTGA CGGCAAGTTA CTGGTTGCGA GCGGTGCGAA CGGTGCGGTA
CGAGTGCATG TCCTCGAGAG TGGTCGTACT TTGAACCTTG ACGGTGGCGC AGCGACGAAT
GATTATATCG AGTGTATTAG CAATAACGAA GTGGTGGCAA TTAGCGAAGC CGGTGAGATT
GTCGTCATTG ATTTAACCAA TCGCCGTCCC AACGAAATGC TCGACGGTAT GAATGGTTTT
CCGCTCAATC TGGCGGTAAG TCCAGATCAT AGTCTGATCG CAGTTGGGAA CGAGCGGGGT
GAAATCTACC TGTGGGAAAC GGTGAGCCGC ACCTACTTGC GTCGGTTGGA CGGTCTGAGT
GGGCCGGTTT ACACGTTGGC CTTCAGCGCC GACAACGCAT ATCTCGCTGC TGCGACGAAT
CAGCCTGCTG ATGCACCGCA AATCGCCGTC TGGGATCTAG CGCGTGGGGG GAATCCGCAA
ATTCTCCGCG GCCATAATGG ACCGATTGCG AAATTAGTCT TCTCTGGCAC GCTTCTCTTC
AGCGCTAGTA GCGATGGTTC GTTGCGGGTG CGTGATGTAG CGCACGATAA TACCGAAGTG
TTGCAGATGA GTCTGCCGGC AGATCGCGGC TGGATGACGA GTGTTGCCAT TACGCCCAAT
GGTAAGGTGT TGGTTGCCGG TACGATTAGT GGTCATCTGG GCTTTTACAA CATCAGCAAC
GGCGAATTAC TACGAGAGAT CGATTTAGCG TCCGGTGCGG TGCTCGATCT CGCTATTACC
CCTGATGGTC GGCAATTGGC CGTCAGTACG CGCGATGAGG GTATCTTGTT GTTCGATCTA
TCGTCGGTAC GCTAG
 
Protein sequence
MLAIDTLLNG HYRITVVLDA YPDAELYRAI DQRSSLRVLI TALPQPDQTA VNDVLRLARE 
LAQVQMPGFL ALRDYFAIEH VCYLVADDPG GSDLERFARE RGSPLSEQET LAIVDRLLAV
LERLHRHQPP LLLGDVRTCD LWSSPEGGLS LAPFACARHI GAEATPYRAP ELYDHAVEPA
PVSDIYAMGA VLYHLLTGWP PPPANQRQAG MPLNAPRVLN PQVSVLAEQL TLRALELKPA
NRYQQVSEMR SALETVRLMA GRPMGATPPI ERPVTPVTPA PPPTASATTV SPPAPTTAVP
PPALAAPLPP TPPPIAAPTA PVAAAPSRPF LSTSCLLAIV GGLAVIAFGV CVLVAVLVGL
YMTNSSVFGW IGSTAAMSPT ASALPTPSAA VTTELRQQVE AITQTAQLRE DGLGAATYSP
DGQLVAVAVG KGVQLRDAET LALQQSLNGH TGDVSALVFS PDGTILASGA QDDPVVRVWN
VRNGREVLQL QGHEDWIRSL AFSPDGRLLA SGSADRTIRI WDVARGETLV VLRGHTDLLG
NVAFSPDGRR LASASRDGTV RLWDVASGQQ IDTFRFTAPV DTQSNAPFWM TGIAFSPDGR
QIAAGSINGN VYLLDAETGN VQRELRGHDG WVVIRGVAYS PDGRLLASAS LDGSVRLWNP
VNGVERDVLR QRGLRLLGLS WSPDGSRILS SSDMGGNLAI WDVASAQIVQ SFQITQGVVT
GVHYSPDGKL LVASGANGAV RVHVLESGRT LNLDGGAATN DYIECISNNE VVAISEAGEI
VVIDLTNRRP NEMLDGMNGF PLNLAVSPDH SLIAVGNERG EIYLWETVSR TYLRRLDGLS
GPVYTLAFSA DNAYLAAATN QPADAPQIAV WDLARGGNPQ ILRGHNGPIA KLVFSGTLLF
SASSDGSLRV RDVAHDNTEV LQMSLPADRG WMTSVAITPN GKVLVAGTIS GHLGFYNISN
GELLREIDLA SGAVLDLAIT PDGRQLAVST RDEGILLFDL SSVR