Gene Cagg_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0120 
Symbol 
ID7266858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp165829 
End bp167793 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content50% 
IMG OID643564992 
Productserine/threonine protein kinase 
Protein accessionYP_002461508 
Protein GI219847075 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00183398 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTATGA CGATAAATAA ACCGACCTTT ACCAATCTGC CGCGCACATT TGGGCAATAT 
GACATCGACC ACCTCATCGG GCGCGGCGAA TCGAGTCAAG TGTTTCGCGC TCATCATCGC
TATATCCCTG AGCATAAAGT TGCGCTGAAA GTGCTCTTGA GTCAAGAGAC GGCGCGTATC
CAACGTTTTA CCCAAGAGGC GAGTATTGCT GCTCGCCTTC GTCATCCCCA TATCTCACGC
CTGATCGATT ACGGTGTACA GAATCCGTTT CACTATACGG TGTTTGAATA TATTAACGGT
AACTCGTTAC GCGATTTGGT GAAAAGTAAA GAACATCGTT TACCGCCCGA TAAGGTGTTG
CGCTATTTCC AGCAAATTGC CGATGCGCTT GATTATGCGC ATAGTCTCAA TATTGTGCAT
CGTGATGTAG CACCGGGTAA TATTTTGATC GATAGTGATG CAGAAAATGC TTATGTGATC
GATTTTGGCA TTGCTCGTGA TCCCGATCAA TCGCTCACCT CAACCGGAAT GGTGATGGGT
ACAACCGGTT TTATTGCTCC TGAATGCATG ATTTCTGCAA ATAATGCGAC TCATCTATCC
GATATTTTTA GCCTGGGTGT TGCTTTATTT TTTATGCTGA CCGGTGAATT GCCGTGGTAT
GAAGTGCCAA AAATGGTTGA TTCATCGCTC ACAGTGTTTC AACGGGTGCG CACGTTGGCC
GAGGCTGGGG TTAAGTTACC CGGAGAAGTT GACCGGATTA TTCGGGTATT GCTGGCGCTT
GATCCATCAC ATCGCTATGC ACACGCCGGT ATAGCGGCTG CTGAGTTGGA AGCGGTGCTC
GGCCCCCATT TCTCACAAAC TCAGATCGTT ACCGGAACAA CGTCCGTACA GCCGATTCGC
AAGACGAAAC AGATCGTTCT CATCGAGCCA AACGAAGTAG AGCAAGCGTT GAGTGGTTTA
CTCGTTCATG AACCGCTTGA ACGCGCGCTT GAACGGGCAC GTATGCTCGA TGAGGTCCGC
ATTAGCCAGT TGCTCGATCA GTGGTCGCAA GAGCGTCCGT TACGCTTGCC ATTGCTTGGC
CGTTTGGTCC GGATTCATGA AATCAAACAC TACAATGTGT TCTTTTTCCA TCTTCGCTTG
CTCATCGAGC AACGCCGTGA CGCCGGAGTG ACCGAAGAAC CCGATACCAA CCAAAAACCA
ATACCACTTC AGCCTGAGTA CGATCGCTGG CAGATTGAAT TGCCGTCGCC AAAAGAGTTT
ACGCACGAAC AGGGAAAACC TATGGTCGTA CCCGGTTCGG AACGGGTGAT CAATTGTCCG
AATTGTAACG GGCTTGGCAT TATCGTTTGC CAAAAGTGTA AAGGTGCCAG GCGGATAACC
ATCGAGGAGC GTGATTCTGC ACAATCGGCA ACCGATGATC GGTCCACGTC ATCGAACACA
CCGGTAGTAC GCCAACGGGT GATTTCTTGT CCCACATGCG AGGGCCGCGG TAAGATACCC
TGCGAACGGT GCAAAAGTAT CGGACGGTTG CTGCAACGTA AGCTGATGGA ATGGTCGCGT
TGGCCAAAGT TCGATCGGGC GCAGAACGAC TTACCTGAGG TTGATGAAAA TTGGCTACAC
CGCACTTGCC GCGAGGAATT GGTCTATCGC AAACGAGAAA ACCGCATTCC CACAGAGTGG
TTGCAGATCA CCGAAGTCAA AGCGATGATC GAACGTCAAC AGCGCGAGCT TGATCAGGAT
AGCCGAATTG TTATGGCCGA ATTACAGATC AATTTTATTC CGCTCACCGA GATTGAGTTC
GATCTCGGCA ATCCAGCACA ACCGTACCAA TTGGCGATCT ATGGTTTCGA GAACCTGATC
CCGTCTGATT GGCGCTTTTT GCACTGGGAG CGGATCCTCT TTTCTGCCGG TATTGGTCTC
CTTTCGTTCT TTTTGGTGAT AAGTTTGGTA TTCCTCGCAA TGTAA
 
Protein sequence
MAMTINKPTF TNLPRTFGQY DIDHLIGRGE SSQVFRAHHR YIPEHKVALK VLLSQETARI 
QRFTQEASIA ARLRHPHISR LIDYGVQNPF HYTVFEYING NSLRDLVKSK EHRLPPDKVL
RYFQQIADAL DYAHSLNIVH RDVAPGNILI DSDAENAYVI DFGIARDPDQ SLTSTGMVMG
TTGFIAPECM ISANNATHLS DIFSLGVALF FMLTGELPWY EVPKMVDSSL TVFQRVRTLA
EAGVKLPGEV DRIIRVLLAL DPSHRYAHAG IAAAELEAVL GPHFSQTQIV TGTTSVQPIR
KTKQIVLIEP NEVEQALSGL LVHEPLERAL ERARMLDEVR ISQLLDQWSQ ERPLRLPLLG
RLVRIHEIKH YNVFFFHLRL LIEQRRDAGV TEEPDTNQKP IPLQPEYDRW QIELPSPKEF
THEQGKPMVV PGSERVINCP NCNGLGIIVC QKCKGARRIT IEERDSAQSA TDDRSTSSNT
PVVRQRVISC PTCEGRGKIP CERCKSIGRL LQRKLMEWSR WPKFDRAQND LPEVDENWLH
RTCREELVYR KRENRIPTEW LQITEVKAMI ERQQRELDQD SRIVMAELQI NFIPLTEIEF
DLGNPAQPYQ LAIYGFENLI PSDWRFLHWE RILFSAGIGL LSFFLVISLV FLAM