Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3042 |
Symbol | |
ID | 7266573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3697972 |
End bp | 3700986 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643567862 |
Product | serine/threonine protein kinase with WD40 repeats |
Protein accession | YP_002464336 |
Protein GI | 219849903 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00005148 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCGCAA TCGATACCTT GCTAAACGGA CATTATCGGA TTACCGTTGT GCTCGACGCT TACCCCGATG CTGAGCTTTA CCGAGCGATT GATCAGCGTT CATCGCTGCG CGTCTTGATT ACAGCCTTGC CACAGCCAGA TCAGACGGCG GTGAACGATG TCTTGCGGTT AGCCCGCGAG CTGGCGCAGG TGCAGATGCC GGGCTTCTTG GCGCTGCGCG ACTATTTTGC GATTGAACAC GTATGCTATC TGGTAGCCGA TGATCCGGGT GGGTCGGATT TAGAACGGTT TGCGCGAGAA CGTGGATCAC CGTTGTCTGA ACAAGAGACG CTGGCGATAG TCGACCGCCT ATTGGCGGTT CTCGAACGGT TGCATCGTCA TCAGCCGCCG CTCTTGTTGG GTGATGTACG TACTTGTGAT TTGTGGTCGT CACCGGAAGG CGGGTTGAGT TTGGCACCGT TCGCCTGTGC GCGCCATATT GGGGCAGAGG CAACACCGTA TCGCGCGCCG GAGTTGTACG ATCACGCGGT TGAGCCGGCT CCGGTGAGCG ATATTTATGC GATGGGGGCC GTGTTGTACC ACTTGCTGAC CGGCTGGCCG CCACCGCCGG CCAATCAGCG TCAAGCCGGG ATGCCGCTTA ACGCACCGCG CGTGTTGAAT CCACAGGTGT CGGTGTTGGC CGAACAATTG ACCTTGCGAG CACTGGAATT GAAACCGGCT AACCGCTATC AGCAGGTAAG CGAGATGCGG AGTGCGCTAG AGACGGTTCG GCTGATGGCG GGACGACCGA TGGGGGCTAC CCCGCCAATT GAACGTCCGG TTACCCCGGT AACGCCTGCC CCTCCTCCAA CGGCATCGGC GACGACTGTA TCACCTCCGG CGCCGACGAC CGCCGTACCT CCTCCGGCGT TGGCTGCGCC GTTGCCTCCC ACCCCACCAC CGATTGCCGC ACCGACAGCA CCGGTAGCCG CCGCGCCGTC GCGGCCTTTC CTGAGTACCT CGTGTTTGCT GGCAATTGTG GGCGGTTTGG CCGTGATTGC GTTCGGGGTG TGTGTCCTGG TGGCGGTACT GGTTGGTTTG TATATGACCA ATAGCTCGGT CTTCGGATGG ATCGGCAGTA CCGCGGCGAT GTCACCGACG GCATCCGCCT TGCCTACTCC GTCTGCCGCA GTGACGACAG AATTGCGACA ACAGGTTGAG GCGATTACGC AAACCGCTCA GTTACGTGAA GACGGTCTAG GTGCTGCAAC GTATAGCCCC GACGGTCAAC TCGTTGCGGT CGCGGTTGGT AAGGGGGTAC AGTTGCGGGA TGCTGAGACA TTGGCGTTAC AGCAATCGCT CAATGGTCAT ACGGGTGATG TTAGTGCGCT AGTGTTTAGT CCTGACGGTA CAATCCTTGC CTCTGGTGCG CAAGATGATC CGGTCGTGCG GGTGTGGAAT GTGCGCAACG GTCGTGAGGT GCTCCAGTTG CAAGGTCACG AAGATTGGAT TCGCTCGCTG GCGTTTAGTC CTGATGGCCG ATTGCTCGCT TCGGGGAGTG CTGACCGCAC GATTAGGATT TGGGACGTTG CCCGTGGCGA GACGCTCGTG GTACTGCGAG GACATACCGA CCTGCTCGGC AATGTGGCGT TTAGTCCTGA TGGTCGGCGA TTGGCCTCGG CCTCGCGCGA TGGAACGGTG CGCTTGTGGG ATGTAGCGAG CGGGCAGCAG ATTGATACGT TTCGGTTTAC CGCGCCGGTT GACACCCAGA GTAATGCCCC GTTCTGGATG ACGGGGATCG CGTTTTCTCC TGATGGTCGT CAAATCGCAG CCGGATCGAT TAACGGTAAT GTCTATCTCC TCGATGCTGA GACAGGTAAT GTTCAACGCG AACTGCGTGG TCATGATGGG TGGGTGGTGA TTCGTGGTGT CGCGTACAGC CCGGATGGTC GCCTGTTGGC TAGTGCCAGC CTTGATGGCA GTGTACGGCT CTGGAATCCG GTGAATGGGG TCGAGCGTGA CGTGTTGCGG CAACGCGGTC TCCGTCTACT TGGCTTGAGC TGGAGTCCCG ATGGTTCGCG TATTCTCTCA TCGAGTGATA TGGGCGGGAA TCTGGCCATT TGGGATGTGG CCTCGGCCCA GATTGTGCAG AGTTTTCAAA TAACGCAAGG GGTTGTAACG GGCGTTCACT ATAGCCCTGA CGGCAAGTTA CTGGTTGCGA GCGGTGCGAA CGGTGCGGTA CGAGTGCATG TCCTCGAGAG TGGTCGTACT TTGAACCTTG ACGGTGGCGC AGCGACGAAT GATTATATCG AGTGTATTAG CAATAACGAA GTGGTGGCAA TTAGCGAAGC CGGTGAGATT GTCGTCATTG ATTTAACCAA TCGCCGTCCC AACGAAATGC TCGACGGTAT GAATGGTTTT CCGCTCAATC TGGCGGTAAG TCCAGATCAT AGTCTGATCG CAGTTGGGAA CGAGCGGGGT GAAATCTACC TGTGGGAAAC GGTGAGCCGC ACCTACTTGC GTCGGTTGGA CGGTCTGAGT GGGCCGGTTT ACACGTTGGC CTTCAGCGCC GACAACGCAT ATCTCGCTGC TGCGACGAAT CAGCCTGCTG ATGCACCGCA AATCGCCGTC TGGGATCTAG CGCGTGGGGG GAATCCGCAA ATTCTCCGCG GCCATAATGG ACCGATTGCG AAATTAGTCT TCTCTGGCAC GCTTCTCTTC AGCGCTAGTA GCGATGGTTC GTTGCGGGTG CGTGATGTAG CGCACGATAA TACCGAAGTG TTGCAGATGA GTCTGCCGGC AGATCGCGGC TGGATGACGA GTGTTGCCAT TACGCCCAAT GGTAAGGTGT TGGTTGCCGG TACGATTAGT GGTCATCTGG GCTTTTACAA CATCAGCAAC GGCGAATTAC TACGAGAGAT CGATTTAGCG TCCGGTGCGG TGCTCGATCT CGCTATTACC CCTGATGGTC GGCAATTGGC CGTCAGTACG CGCGATGAGG GTATCTTGTT GTTCGATCTA TCGTCGGTAC GCTAG
|
Protein sequence | MLAIDTLLNG HYRITVVLDA YPDAELYRAI DQRSSLRVLI TALPQPDQTA VNDVLRLARE LAQVQMPGFL ALRDYFAIEH VCYLVADDPG GSDLERFARE RGSPLSEQET LAIVDRLLAV LERLHRHQPP LLLGDVRTCD LWSSPEGGLS LAPFACARHI GAEATPYRAP ELYDHAVEPA PVSDIYAMGA VLYHLLTGWP PPPANQRQAG MPLNAPRVLN PQVSVLAEQL TLRALELKPA NRYQQVSEMR SALETVRLMA GRPMGATPPI ERPVTPVTPA PPPTASATTV SPPAPTTAVP PPALAAPLPP TPPPIAAPTA PVAAAPSRPF LSTSCLLAIV GGLAVIAFGV CVLVAVLVGL YMTNSSVFGW IGSTAAMSPT ASALPTPSAA VTTELRQQVE AITQTAQLRE DGLGAATYSP DGQLVAVAVG KGVQLRDAET LALQQSLNGH TGDVSALVFS PDGTILASGA QDDPVVRVWN VRNGREVLQL QGHEDWIRSL AFSPDGRLLA SGSADRTIRI WDVARGETLV VLRGHTDLLG NVAFSPDGRR LASASRDGTV RLWDVASGQQ IDTFRFTAPV DTQSNAPFWM TGIAFSPDGR QIAAGSINGN VYLLDAETGN VQRELRGHDG WVVIRGVAYS PDGRLLASAS LDGSVRLWNP VNGVERDVLR QRGLRLLGLS WSPDGSRILS SSDMGGNLAI WDVASAQIVQ SFQITQGVVT GVHYSPDGKL LVASGANGAV RVHVLESGRT LNLDGGAATN DYIECISNNE VVAISEAGEI VVIDLTNRRP NEMLDGMNGF PLNLAVSPDH SLIAVGNERG EIYLWETVSR TYLRRLDGLS GPVYTLAFSA DNAYLAAATN QPADAPQIAV WDLARGGNPQ ILRGHNGPIA KLVFSGTLLF SASSDGSLRV RDVAHDNTEV LQMSLPADRG WMTSVAITPN GKVLVAGTIS GHLGFYNISN GELLREIDLA SGAVLDLAIT PDGRQLAVST RDEGILLFDL SSVR
|
| |