Gene Cagg_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1960 
Symbol 
ID7268876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2393178 
End bp2394458 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID643566798 
Productglycosyl transferase group 1 
Protein accessionYP_002463291 
Protein GI219848858 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000390342 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00283274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCATTG CGATGCTGAG TGTGCATAGC AGTCCGTTGG CCCGTCTTGG CGGGAAAGAA 
GCCGGTGGAA TGAACGTGTA TGTCCGCGAG TTGGCCCGCG AGTTGGGACA GCGCGGGATA
GGGGTTGATA TTTTTACCCG TCGCCAAGAT CGAACAACAC CAACGGTGCT TCCACTAGGA
CGTGGTGTCC GCTTGATCAG TTTGCACGCC GGTCCGGCTG CACCCTACGA TAAAAATTTG
CTCCTGACCT ACTTGCCGGA GTTTGTTAGC CGGGTACGCT GTTTTGCCGA TAGTGAAGAT
GTGAGTTATG ATCTCATCCA TAGTCACTAT TGGCTGTCGG GTGAAGCAGC GTTGCGCCTA
CGTCGTGTGT GGCGCGCACC GGTCGTGCAG ATGTTTCACA CCTTGGGAGC GATGAAGAAT
AGTGTCGCTC GCTCGGAGGA AGAGGTGGAG ACTAAGCGCC GCATTGCGAT TGAACGACGT
TTACTGCGAG AAGTCGATGT CGTTGTCGCT GCTACACCGC TTGACCGGGC GCAGATGGTG
TGGCACTATG GCGCTGATGC GAACCGCATT AGGGTGATAC CATGTGGGGT TGATTTACGC
CGATTTCAAC CGGGCGACCG TATGCAGGCG CGGGCAGCGC TCGGGATTGC ACCTGATGCG
ATAATGCTGG TATGCGTGGG GCGGATGGAG CCGCTGAAGG GGATGGATGC GCTGATCCGG
GCTGCAGCGC GGTTGCTAGC CCAACATCCA GACTGGAAAG AGCGATTGCA GGTAGTGTTA
GTCGGCGGTG AAGATGAGAC TCAACCGGAC CGTTGGAATA GCGAGCAACG CCGACTCGAT
GCGTTACGTC ACGAGCTTGG TCTGCCAGCA CACGTCATCT TTGCCGGCGC ACAACCCCAA
GATCGGTTGC CACTGTATTA CACGGCGGCT GATGTAGTCG CTGCGCCTTC TCATTATGAG
TCGTTTGGCC TGGCAGCGCT GGAAGCGTTG GCGTGCGGGG CGGCGGTAGT GGCGTCAAAT
GTGGGCGGGT TGGCGCTGAC GATTGAAGAT CGGCGCAGCG GTTTGTTGGT TCCGCCGGAT
GATGATGCGG CGCTCGCTGA CCAGATCGAG CGCATTCTGA CCGACGCAGC ATTGGCCGCA
CGGCTGCGTT CGGGAGCAGT ACAGCGTGCG GCTGAGTATG GCTGGCCGGC GATTGCCCGG
CGCATTGCCG CGCTCTACGA TGAGCTGACG GCAGCGAACG CGGCGATCTG GTCGGCGCGG
CGATTGGTGC GGGTATCGTG A
 
Protein sequence
MRIAMLSVHS SPLARLGGKE AGGMNVYVRE LARELGQRGI GVDIFTRRQD RTTPTVLPLG 
RGVRLISLHA GPAAPYDKNL LLTYLPEFVS RVRCFADSED VSYDLIHSHY WLSGEAALRL
RRVWRAPVVQ MFHTLGAMKN SVARSEEEVE TKRRIAIERR LLREVDVVVA ATPLDRAQMV
WHYGADANRI RVIPCGVDLR RFQPGDRMQA RAALGIAPDA IMLVCVGRME PLKGMDALIR
AAARLLAQHP DWKERLQVVL VGGEDETQPD RWNSEQRRLD ALRHELGLPA HVIFAGAQPQ
DRLPLYYTAA DVVAAPSHYE SFGLAALEAL ACGAAVVASN VGGLALTIED RRSGLLVPPD
DDAALADQIE RILTDAALAA RLRSGAVQRA AEYGWPAIAR RIAALYDELT AANAAIWSAR
RLVRVS