Gene Cagg_2760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2760 
Symbol 
ID7269830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3386932 
End bp3390051 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content55% 
IMG OID643567581 
Productglycosyl transferase group 1 
Protein accessionYP_002464059 
Protein GI219849626 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACCA GCGCGCTGAT TGGATGTTTA ACCATTCATA GACATCTTAT GAAATCACAA 
GCACTCTACG GGGTTGAATA CACCTTATTC GATCCATTGC CGACACGGTT GGTCGAAAAT
GAACCGCTCT TGATCGCGTT GCAGTTGCGC AACACCGGCC TGACTCCGTG GTTGACGACT
GGCTACCCGG TCATGCTGGT CGTGCGTTGG AAAACCCTTG ATGGGATCGT TGTGCAAGAG
CTGCCGTGGC AATCACTACC ACGACCGGTA CCGTGCGGTG ATGCTATCAC CATCGAGTTT
CGGGTGACAG CTCCAGTCGT GGCGGGTGAG TATGTGTTCA CCATAGAGCT GGTTGAACAC
ATGATCGCTT GGTTTCACGA GCGAGGCGTT GCACCATTTT GTTCACCAAT TACGGTTGAG
CCGCCGCGCA ATCCACGGAT TACCATTATC AACGGCAATT GTTTAGCAAA TGATGCGGTA
GGAACCCATG TAGCGGCGCA GGTGCAGGCA TTAGCGGAAG CTGGGTTTCA GCCGTTGTTA
TTGACGAGCT TTGTTGATGA ACGATTGCCA CGTGCGTTGC GCCGGTTTAT GGTGGTTGTG
CGGCCTGATG AGATTATCAA TCCTACTGAC TGGAGCCGGC CATTCGCTGA GCATTTTCGT
CGCTCGGCTG CTATTATTGT AAACTATTCG ACCTATTACG ATCTTGTCGA ACTGATCCGG
ATTGCACCGG CGCCAGTGCT GTTTGATTAT CACGGTGTCA CTCCGCCACA GATTTGGGGG
GTTGGGCAGC CGGGGTACGA AGATGTCGCG CGTGGTTTAG CCAATATGCA TCTCGTTCAG
TTCGCCGATT ATGCTGTGGC TCACAGCCAA TACATGGTCG ACGAGCTGGT GGCGACCGGT
CTGATCGCAC CAAGCCGTAC TAGTGTCTTA CCATATCCGG TCACTCGCGC TGCAGCGTAC
GCTGGGCCGC CCGATCCTCA GTTGCAACAG CAATACGCTT TGGCCGGGAA ACGGGTGTTG
TTGTATGTGG GTCGTATGGC GCGCAATAAG CGAATTAATA TCCTTGTGGA AGCGCTGCCA
TTGATCCGCG CGCAGTACCC TAACACGGTA TTGTTGCTAG TGGGTGAAAC CGGTCACGCT
TACGCGGAGT ATGTCGCGGA AACAAAGGCG CGCGCGGAAG AGTTAGGGGT GGCCGATGCC
GTGATCTTTA CCGGCGCTCA AAATCGTGAT CAGATCGGCG CTTTCTACCA GTTGTGCGAT
GTCTTTGTGA TGGCGAGCAT CCATGAGGGC TTTTGTATGC CAGTCATTGA GGCGATGGCG
CTAGGCAAAC CGGTGGTGGC GGCGGCGGCT ACCGCATTGC CTAGCACCGT CGGTGATGCC
GGCCTCCTCT TTACCCCGGA TGATCACGAG GAGTTGGCGC GGCAGGTGCT GCGTGTGCTC
GCTGCGTATG AAGAGCCGTC GCCTGATCCA CTCGATAGCG CCCAAGCCCA ACACCTCCTG
CGCAACTGTC CGATAGCCTT CGTGACACCA CGGTATGGAC GCGATGTGTT AGGCGGCGCC
GAGCGCGGTG CGCAAGCATG GGCCGAACAA TTGGCGACCC GCGGGTTTGC CGTCGAGGTT
TTAACAACCA ATGCCATTGA TTTGGTCGGC TGGCGCACGG CCCCGTTGTC CGAAATCGAA
GTGATTAATG GCGTTACGGT ACGTCGGTTT GCAGTTGATC CGGTCGATCC GAGCGGCTTT
CACGACGTAC AGATGAAAGC CGCTCGGGGT GAGGTTATTA CCCGGCGGGA CGAAGAGCGA
TTTATGCAGC ATAACTTGCG CAGCCGGGCG CTCGAGGAGT ATATTGCCCG TCACCGCGAC
GAGTATGCCG CCTTTATCTT CACCCCGTAT CTGTTTGGTA CTACGTATTA CGCCGCGAAG
CAGGCCGGCG ATCGCGCTTT TCATATCCCA TGTCTACACG ATGAGTCGGC AGCGCAATTT
GCCATCTTCC GCGAGATGTT AGAAGAAGCG CGAGGCATCT TCTTTAATAC TTCCGCCGAA
GAGCAACTAG CGCGACAGAA GCTGCACGTA GCTAATCCAT TTTCAACCGT GCTTGGTTAT
GGCTTCCCTG ATGAGCCATT ACGAGGGGAT CCAGTGCGTT TTCGAGAACG TACCGGTGTG
AAATCTCCGT TCTTGCTTTA CAGCGGGCGT CTAGAAGAAG CCAAGAATGT GCCATTACTT
ATCGAATGGT TTATAACCTA CAAGACAGCT CATCCTGAAA GCGATTTGAA GTTGGTGTTG
GCAGGAAAAG GTGAGATACC AATACCTGCT CGGTCAGATA TTGTGCATAT AGGCATGATA
GTCGATCGAC AAGAACTGGC TGATGCGTAT GCAGCAGCGC TTGCCCTCTG CCAGCTTTCG
CGCAACGAGA GTTTCTCGAT TGTGATGATG GAATCATGGC AGCAAGGTCG TCCGGTTATC
GTGCATGCTG ACTGTGCCGT AACTCGTGAG CATGTTAAAC GAAGTGGCGG TGGTTATAGT
TGTGATAGCG TGGCTAGTTT TAGTGCTGCT ATCGATGATC TACTCGCCGA TCCTCAACGT
GGCGCAGTGC TTGGAGAACA GGGCCGCACG TATGTGCAAG CTCATTTTGG CTGGAATACG
CTCGTTGATA AAATGATAGC AGCACTCGCT TCCTTCTTAC AGCCGAGACC ACTCATAAGC
GAGTTTGCCC AGCGTGGTAT TCGGCGCGCA CTCGATTTTA CCTATGCGCG CTTTGAAGCG
CATCTCATCG GGTTGATCCA GCATCTCTGC CATCAGAGCG CTGGTTGGCA ACTGTTCGAG
CAGGTGCAGG GCCGGTTGGC TGCACTGGCG AATAGTCATG GTGGGTTAGC CAACAAACCC
GAACCGGTGC CGGTTATGGC CCGTGCCCGC CGATGGGTGG ATCGGTTGCG TCAGCAGCCG
CATCCTTCTG CAGGGCTCCC GCCGGTTAAC GGCCATCAGC ACGAGCAGCA CGCAGTGATG
CAACTCGTCG CTGAATTACT CGATCTGCTC GCCTATACTC GCCACGAGCA GCGCCGGCTT
GAGCGCGAAT TGGCCTTGTT GCGCGATCAA ATGGCCATTC AGAAACAGAA TGCATCGTAA
 
Protein sequence
MRTSALIGCL TIHRHLMKSQ ALYGVEYTLF DPLPTRLVEN EPLLIALQLR NTGLTPWLTT 
GYPVMLVVRW KTLDGIVVQE LPWQSLPRPV PCGDAITIEF RVTAPVVAGE YVFTIELVEH
MIAWFHERGV APFCSPITVE PPRNPRITII NGNCLANDAV GTHVAAQVQA LAEAGFQPLL
LTSFVDERLP RALRRFMVVV RPDEIINPTD WSRPFAEHFR RSAAIIVNYS TYYDLVELIR
IAPAPVLFDY HGVTPPQIWG VGQPGYEDVA RGLANMHLVQ FADYAVAHSQ YMVDELVATG
LIAPSRTSVL PYPVTRAAAY AGPPDPQLQQ QYALAGKRVL LYVGRMARNK RINILVEALP
LIRAQYPNTV LLLVGETGHA YAEYVAETKA RAEELGVADA VIFTGAQNRD QIGAFYQLCD
VFVMASIHEG FCMPVIEAMA LGKPVVAAAA TALPSTVGDA GLLFTPDDHE ELARQVLRVL
AAYEEPSPDP LDSAQAQHLL RNCPIAFVTP RYGRDVLGGA ERGAQAWAEQ LATRGFAVEV
LTTNAIDLVG WRTAPLSEIE VINGVTVRRF AVDPVDPSGF HDVQMKAARG EVITRRDEER
FMQHNLRSRA LEEYIARHRD EYAAFIFTPY LFGTTYYAAK QAGDRAFHIP CLHDESAAQF
AIFREMLEEA RGIFFNTSAE EQLARQKLHV ANPFSTVLGY GFPDEPLRGD PVRFRERTGV
KSPFLLYSGR LEEAKNVPLL IEWFITYKTA HPESDLKLVL AGKGEIPIPA RSDIVHIGMI
VDRQELADAY AAALALCQLS RNESFSIVMM ESWQQGRPVI VHADCAVTRE HVKRSGGGYS
CDSVASFSAA IDDLLADPQR GAVLGEQGRT YVQAHFGWNT LVDKMIAALA SFLQPRPLIS
EFAQRGIRRA LDFTYARFEA HLIGLIQHLC HQSAGWQLFE QVQGRLAALA NSHGGLANKP
EPVPVMARAR RWVDRLRQQP HPSAGLPPVN GHQHEQHAVM QLVAELLDLL AYTRHEQRRL
ERELALLRDQ MAIQKQNAS