Gene Cagg_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1124 
Symbol 
ID7268578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1389098 
End bp1391131 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content56% 
IMG OID643565967 
ProductPglZ domain protein 
Protein accessionYP_002462470 
Protein GI219848037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0434385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCACC ATGCACCACC TACCCCACAC ACCGAATCAT CCTGGCGCAG TCTCATCCTG 
CGCGAATTCA CACCAAAGGT CGCCCGGCTG ACTCTCGTCG CCGATCCAGA CGGACTACTG
CTCGAAGAAG GTGTCCTTGA CGGAATCCGA TCCCAGGGCT TTGAGCTGAT CCCGTTTGAC
GATCACATCG CCTTTCGCTA TGTCTACGAA GCGAAGTTTC GCTCACGCTG GGATCGGGGC
GAGGACACCG ATCTCGTCGT CGTGCTGAGA TCGCCATCAA GCGATCTCAC CACCTTACCT
TACGATCTGC TTCAGGCGGG CCGCAAGCTC TCGTTCAACC TCGGCGATCT GTTCCCAAAG
CTCAGCTATC CCGTGGTCGC AGCGCTCGAC AAGACCTGGT TCGACACCTT GTTCGAGGCG
CAGGCGAAGT ACGCCCAGGA GCAGCTCGGC GATACCGCAA CAAAGGCGTT CGTTCTTCGG
CACGTGTTCG AGATCGCGCC AGAACTCATC AAACAGCCAT CAGACCTGCT CCACATCTTG
CTCCGCCGCC ATTACCGCCG ACAACGAATT CCGGCCATCC TCGATGAGCA CCTGCTTCAG
GAGCTTCGTC AAACCAGCCT GTTTGCCGAT TGGCCGCTGG AAGCCATCAT TCCAGACGCG
CAAGCGTTCT TTGCCTTCTT GCAAGAGCGC TGGCCGGTGT TCCTCGACCG CTTTGCAGCC
GAGATGCCTG GTGAGCAGAT GCACGTGCGC GAGCGAGCAC CAGCCTACGA ACTACCCGCC
CTGCACCTAC CGTTCGACCA CCCGGATGTG CGGATTTATA TCGACAACCT TTTTCTCGAA
GGTATCCTTC AGCCGGTTGC TCACGATCAA TCGCACGAAC TCGCCAAAAC CTGGGTGAAG
TACGGTATCA AAATCTCCCC TGCGGAAGAC CGTCTGCGCC GGGTCAAGGG ATTGCTGGCT
TCCTGCGAAA CATCGCTTCC TAACGAGAAA GCAAACTATA GAGAGTGGAT CCATTTTGCT
CAACAGTGGG CACAACTAAA CGCATTGATT ACCGAAGGCG AGCTACAAGC AGAAACTGAT
GCCAAGATCC GACAATTTAC CAATGGGGTA GATGCCCAAT TCCTCGCATG GGTGAAGCAG
CATTACGGAA ATCTCTTCAA TCTGCCGGCT ACCGATCCGG TCATGCTTCA CCATCTGCCA
CGATACTTCG CCCACTCACT ACCGCCAAAA TCAAAACTCG CGTTTCTGCT CGTCGATGGT
CTCTCCCTCG ACCAATGGAT CACCTTACGC GAGGTGCTCC ACGAACAAGA GCCAACCCTT
CTCTTCCGTG AGCGTGCTGT GTTCGCGTGG ATTCCTACCA TCACCTCGGT GTCGCGACAG
GCGGCTTTTG CCGGCCAACC GCCGATCTAC TTCTCAGAGA GCATCCATAC CACTCATAAG
GAACCGGAGC TTTGGACAAC ATTTTGGGAA AAGCAAGAGC TTTCTAAAGC GGAAGTGGCG
TATCTCAAGG GGTTGGGCAA CGGGCAAGGT GATGTGCAGA AGATTGACGA ACTCATCGCT
CAACCTAAGC TGCGCGTCGT TGGATTGGTC ATCAACAAGG TTGACGATAT TATGCACGGC
ATGCAGCTCG GCTCGGCAGG CATGCACAAC CAGATCCGGC AATGGGCCAG ACAGGGATTC
ATGCGCGATC TTATCACGCT ACTGCTGGGT AAAGGGTTCC ACATCTACCT TGCCTCAGAC
CATGGTAACA TTGAGGCGAG CGGCGTGGGA CGACCGAGTG AAGGGGTCGT AGCCGAAGTA
CGCGGCGAGC GTGTCCGGGT CTATTCCGAT TTTCGGCTCA GGCAACAGAC AGAAGCGGTA
TTTCCGGGAT CGTTGGCATG GGGATCGGTG GGTCTGCCGG CCAATTACTT CGTCCTCATT
GCGCCGAATC GAGCTGCGTT TGTCCCGAAA AACGAAATTA TCGTCAGTCA TGGCGGCATA
AGTGTTGAGG AACTGATCGT TCCGTTTGTC CAGATCGAGA GGAGGAGTCA ATGA
 
Protein sequence
MTHHAPPTPH TESSWRSLIL REFTPKVARL TLVADPDGLL LEEGVLDGIR SQGFELIPFD 
DHIAFRYVYE AKFRSRWDRG EDTDLVVVLR SPSSDLTTLP YDLLQAGRKL SFNLGDLFPK
LSYPVVAALD KTWFDTLFEA QAKYAQEQLG DTATKAFVLR HVFEIAPELI KQPSDLLHIL
LRRHYRRQRI PAILDEHLLQ ELRQTSLFAD WPLEAIIPDA QAFFAFLQER WPVFLDRFAA
EMPGEQMHVR ERAPAYELPA LHLPFDHPDV RIYIDNLFLE GILQPVAHDQ SHELAKTWVK
YGIKISPAED RLRRVKGLLA SCETSLPNEK ANYREWIHFA QQWAQLNALI TEGELQAETD
AKIRQFTNGV DAQFLAWVKQ HYGNLFNLPA TDPVMLHHLP RYFAHSLPPK SKLAFLLVDG
LSLDQWITLR EVLHEQEPTL LFRERAVFAW IPTITSVSRQ AAFAGQPPIY FSESIHTTHK
EPELWTTFWE KQELSKAEVA YLKGLGNGQG DVQKIDELIA QPKLRVVGLV INKVDDIMHG
MQLGSAGMHN QIRQWARQGF MRDLITLLLG KGFHIYLASD HGNIEASGVG RPSEGVVAEV
RGERVRVYSD FRLRQQTEAV FPGSLAWGSV GLPANYFVLI APNRAAFVPK NEIIVSHGGI
SVEELIVPFV QIERRSQ