Gene Cagg_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0654 
Symbol 
ID7266905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp808686 
End bp810128 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content54% 
IMG OID643565516 
ProductO-antigen polymerase 
Protein accessionYP_002462026 
Protein GI219847593 
COG category 
COG ID 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00317709 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATTA ATATTCCCGC GCAACCGCCA ACGTTGCGCG ACCAGCTTGT TCAATCACCA 
CTCCTCATTG CCGGGCTGGC GCTGGCTTTT GGTGGCCTGA TCGGTGGGAT CACCGCTTTC
GGGCCACTGT ACGCCGTCGC CGGCCTGCTC GCCCTTGCGT TAGCTACTAC CCTGTTGGTC
AGTGTCAAGG CAGGATTAAT TGCCGCATTG GCGATTGCAA CGATTATCCC ATTCGGCACG
CTTCCCTTTA AAGCCATTAT TACCCCGAAT TTTCTGACGG TAACCCTCGT TACGCTGAAT
GCTGTCTGGT TTTTACGTGC ATTGGCTCGT TCAGACACCT ATGACGTTCG CTTCGGATCG
CTCGGTCTCC CCCTGATCGG GTTTCTCGGT TTGACCCTCT TTTCGCTCAT ATTGGGCGCA
CGTGGTTTGC CCGATCCGCA GACCTTACAC AATTACGCGA AATTTGTCTT GGGGGTGTTC
TGTTATCTTA CCGTGATCAA CTGTGTGCGT GACCGCGATA CCGCCCGTTT AGTTGTTCGT
GCATTGATTA TTGTCGGTGG GATCTCGGCC CTCATCGGTT TGATCTTGTG GGTATTACCT
GATGCAACAG CTCTGCAGTT GTTGGTAGCA CTGGGCCGGA TCGGCTATCC AACAAGTGGT
CGAGTGCTAC GCTACGTCGA AGACGATCCG AACGGCCTCG AACGGGCTAT TGGCTTGAAT
GTCGATCCGA ACAGCTTTGG CGGGATGTTA GCACTCGTCG CCGTGCTTAC CCTAACCCAA
TTGGCAGCAC CACGCCCCCT TTTGCCGCGA TGGTTGTTGG CAACCCTTGG CGGGATACAG
GTATTGACAC TTTTGCTCAC CTTTTCGCGC GCCGCCCTCT TCGGCTTGGT CATCGCTGCT
GCGTATCTCG CGACGGTGCA GTATCGACGG CTGTGGCGCT ATATGATCAT CGCCGGAGTC
ACGGGGGGTG TGTTGCTTAT GGGATTGGGA TATGCCGATG ACTTTATCAA CCGCGTGCTC
TCCGGTGTCC AGTTTCGCGA TCAAGCCCAG CAGATGCGGC TCGATGAATA TGCCAATGCA
ATCGCGATTA TTCAGCGCTA CCCGGTGTTT GGTATCGGAT TTGGTGCTGC ACCAGACCTT
GATCTGTCGG CCGGCGTCAG TAGCATTTAT CTGGCAATCG CTCAGCGCAT GGGTCTGGTC
GGCCTGATCG CCTTTATCGG CCTCATCGGC TTTTGGTATA CCCGCAGTCT CGACATTTTG
CCGCAGCTCG ATGACGAATC GACCAGTTGG CTCCTCGGTT GTCAGGGGGC TGTTGTAGCA
GCATTGGCCG TGGGGTTGGC CGATCACTAT TTCTTTAATA TTGAGTTTAG CCATATGGCA
ACCTTGTTAT GGTGTACGAT GGGGCTAGGT AGTGCCATTG AATGGCTGAT CAATGAGTCG
TAA
 
Protein sequence
MSINIPAQPP TLRDQLVQSP LLIAGLALAF GGLIGGITAF GPLYAVAGLL ALALATTLLV 
SVKAGLIAAL AIATIIPFGT LPFKAIITPN FLTVTLVTLN AVWFLRALAR SDTYDVRFGS
LGLPLIGFLG LTLFSLILGA RGLPDPQTLH NYAKFVLGVF CYLTVINCVR DRDTARLVVR
ALIIVGGISA LIGLILWVLP DATALQLLVA LGRIGYPTSG RVLRYVEDDP NGLERAIGLN
VDPNSFGGML ALVAVLTLTQ LAAPRPLLPR WLLATLGGIQ VLTLLLTFSR AALFGLVIAA
AYLATVQYRR LWRYMIIAGV TGGVLLMGLG YADDFINRVL SGVQFRDQAQ QMRLDEYANA
IAIIQRYPVF GIGFGAAPDL DLSAGVSSIY LAIAQRMGLV GLIAFIGLIG FWYTRSLDIL
PQLDDESTSW LLGCQGAVVA ALAVGLADHY FFNIEFSHMA TLLWCTMGLG SAIEWLINES