Gene Cagg_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1065 
Symbol 
ID7268517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1314843 
End bp1316939 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content55% 
IMG OID643565910 
Producthypothetical protein 
Protein accessionYP_002462415 
Protein GI219847982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGC ACAGAAACAA ACAGACACCA AAGCGTGGAC AGATGCATCA AAGCGGAGGC 
AAGGCCAATC GCCCCGAAAA GCGGGATCGT CATCAGACAC CGGTGCAGGT GGTTCAACCA
TCGCCACCGA CGGTGCCGTG GCCACGTGCC AAAGAGTCAA CCAAGGCGCG GGAAGCCTAC
CGCTTCCTCA ACCCATACAA TTTCGTCCGC TATTTGCCAC CACCCGATAT ACCGGAAACC
GATCCCGATG CGCAGTTGCT GGGCCGTTGC CCACCACCGC CTCACGACCG GTATGTGGGT
TTGACCGGGC GTATCACCTG CACCCTCGAA GCGGTCACCC CTCTTTTTAT TGCGGATAGC
CACGATGTGC AATCGACTAC TATTCTGCTG GCCGATAACC GCGAAGTTCA GCATAAAAAC
TACCGTTTTT TCCAGTATGA TGGTCAAGAC GCTATACCTG CTACGAGTCT GCGCGGTATG
ATCCGTGCCT TGTTTGAAAC GGTGACCAAT TCACCGTTCA GTGTCTTCAA TGGCGAGGAA
CGGCTCGAGT ATCGTATCGA CCCGATTGAG TCCAAACGGT TTAAACCCGG CATCGTGCTG
AGTTTGCCGG ACGGTGATCA ACCGGGCGTC ATTGCCCTCT GCGAGGAGGC GACAATCGGT
GCGTATCACG AAGACCGTAA CCTGAACGTT TTGCACGGCG ATTGGCGTTG TGGTGAAACG
GCGTATGCCG TTCTGAGCAC CGCCAAGAAT GGTGTGAAGA AGGTGGAAGC ACTTGCGCGT
GAGCAAGATA AAAATCGCTT ACTAAAGTAC AACAAACCCT TGGTTAAGGG GTGGGTGAAA
ATAACCGGCC GCACTATCGA GACAAAACGC AATGAGCGTT TCTTTTATTT CAAAGAAGGT
GCTCCGGCCA AAGCCAAGCA CGTCCATTTC GATGCCGAGC GCGAAGCCGA CTTTAACGCG
GTGTTGCAGG CCCAACTCCA CGAACGACGT GACGATTTTC ACAGCCAGGG GCAGAGTGAT
CGGCTGGCAT CGGGCAATTT GGTGTACGTT GAACTGGAAC CGGATCAAAA GACCGTGCGC
AATATTGCCC TGGCAAAAGT GGCGCGGCTG CGCTATCGCC ATTCTATCGG CGATCTGCTC
CCTGAGCACC TGAAGCCGAG TGAAGAGTAC GAGCGGCTAG ACATTGCTTC GCGCGTATTC
GGATGGGTAC GCGCCACGCC AGCCGAGGAT CGCAAGGATC GCGTTGCCTA TGCCGGACGG
GTGCGCTTTA GCCACGCAGT ATTGATTGAT GACAAAGGCG TATATGCCGA ACCCATGCCG
TTGGCCGTCT TGGGTTCACC CAAACCGACC ACCACGCTTT TCTACCTGCG CAAGAAAGAC
GGCGAATGGA GTGAGGAGGA GCGCAAGAGA CCAGGCTCTG CAACCACCAT TGGCTACGAT
GGACCCAATC AGTTGCGTGG ACGGAAGTTC TACCGTCACC ACGGAAATAG TCTGAATCGG
CTCGAATATG AGCGTGCCGG ACAGCGTCGC GATCATCAGA ATCGCACGGT GCGCGGAGTA
CGTGTGCCGG GTAACGTATT TCAGTTCACA ATCGATTTTC ATAACCTGGC ACCGGTTGAG
CTTGGTGCGT TGCTATGGAC GCTAAGCTTG GGCGAAGAAA AATGCTTTTT CCGGCTCGGT
TATGCTAAAC CGCTCGGATT TGGCAGCGTT AAGTTGACGG TAGATCAGGT TGATCTGCTC
GAACTCAGCA CTCGCTATTG TTCATTGCAG CAATCGGGGT GGCGGCGAGC TGAGGTTGGC
AAACGAAGTG AATGGGTGGC AGCGTTTGCA CAGGCGATGC AGCGATGCTA CGGTCAATCA
TTGGACCGAT TACCGCATAT TACCGATCTC CTCGCCTTGC TGCGCGATCC GGTTCCACCG
CTTCCCATCC ATTACCCGCG CACCGATGTT CATCCAGACC CAGAGGGAAA GAACTTCGAG
TGGTTTGTGG CCAATAAGGT AAAATCGAAC AAAATGGCCG ATGCCGGTCC TAATCTCGTG
CTCGAAGAGC CGGGCGATGA GCAGGGTTTG CCGTTACTCA CAAAGGAGAA GGAGTAG
 
Protein sequence
MAKHRNKQTP KRGQMHQSGG KANRPEKRDR HQTPVQVVQP SPPTVPWPRA KESTKAREAY 
RFLNPYNFVR YLPPPDIPET DPDAQLLGRC PPPPHDRYVG LTGRITCTLE AVTPLFIADS
HDVQSTTILL ADNREVQHKN YRFFQYDGQD AIPATSLRGM IRALFETVTN SPFSVFNGEE
RLEYRIDPIE SKRFKPGIVL SLPDGDQPGV IALCEEATIG AYHEDRNLNV LHGDWRCGET
AYAVLSTAKN GVKKVEALAR EQDKNRLLKY NKPLVKGWVK ITGRTIETKR NERFFYFKEG
APAKAKHVHF DAEREADFNA VLQAQLHERR DDFHSQGQSD RLASGNLVYV ELEPDQKTVR
NIALAKVARL RYRHSIGDLL PEHLKPSEEY ERLDIASRVF GWVRATPAED RKDRVAYAGR
VRFSHAVLID DKGVYAEPMP LAVLGSPKPT TTLFYLRKKD GEWSEEERKR PGSATTIGYD
GPNQLRGRKF YRHHGNSLNR LEYERAGQRR DHQNRTVRGV RVPGNVFQFT IDFHNLAPVE
LGALLWTLSL GEEKCFFRLG YAKPLGFGSV KLTVDQVDLL ELSTRYCSLQ QSGWRRAEVG
KRSEWVAAFA QAMQRCYGQS LDRLPHITDL LALLRDPVPP LPIHYPRTDV HPDPEGKNFE
WFVANKVKSN KMADAGPNLV LEEPGDEQGL PLLTKEKE