Gene Cagg_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1979 
Symbol 
ID7268895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2417616 
End bp2418989 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content56% 
IMG OID643566814 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002463307 
Protein GI219848874 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0160004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAC GCCTGCGCTC GCGAGTGATC AGGAGTGTCC TTGGATGGGA TCTCATCGCG 
ACGCTGCTCT GCCTCATGCT GATCGCAGGA GTTGATCCTG CGACGATGCT CCGCCAATGG
TCATTGTATG CCGGAGCGGT AGTAATCTGG TCAATCGTCT TTATCGTGCT GGCTCCACAG
CAGGCTGTCT TTGAACGTAC TATCCTCGAA GCATTTACCC GACTGTTGCT AGCCGTACTA
CTGGCAGCAA CCTCCTTTGC CGGCTTGCTC TACTTGATCA ATGGCAATCT CTTTGAGCGT
GGCAGCTTCC TTCGCTTTGT CATGCTCGAT CTCGTCATAG TGAGCGTCAT TCATTTGAGT
TTTCGTACTT TGGCTCGCCG GCGACAAACC CGCAGCAGGC GGCGCGTACT TGTGGTCGGT
CAGGCGAATG CGGCTGCCCG GTTGGCCGAG GAGTTTGGTC GCCGCCCGTG GACGGGGGTG
CAGATCGTGG GGTATGCCTC TGATGAGTGG GAAGCGCCGA GTACCCTGCC CCGCCTGGGT
GAGTTGAGCG ATCTTGTGAC GATCGTGCAG CGCCACCGGA TTGATGAAGT CATTTTTGCG
TTGCCACCGG CGCAGTACGA CCGCGTTGCC GAGTTATCCC TCCTGCTGTT GCGCGAACCG
GTGATGCTCC ATACAGTTCC GACTGCGCTC GATTTGGTCT TTGCCCGGAC ACCGGTGGAT
AGCGTCGGCG GGATCCCCCT CGTCTCACTT CGTGAGTCGG CCCTCACTCC CTCGCAGCGC
ATTGTGAAAC GACTGTTTGA TATTGTCGTG AGTTTTGGGC TGATTGTCCT CCTCGCACCA
CTGATGCTGG TGATTGCGCT CTTGATCAAG CTCGAATCAC CCGGTCCGGT ACTGTTTAAA
CAAGAGCGAA TTGGTGAGCA TGGTCGTCGT TTCACGATGT TCAAATTTCG CAGTATGTAC
GTGGACGCTG AACAGCGTTG GCAAGCGGTC GCGAAACGTG ATCCGACGAC CGGTAAATTG
ATCCACAAGC TGAAAGACGA TCCACGGGTA ACTCGCGTGG GCCGGAAGCT ACGCCGTACC
TCGCTCGATG AATTACCGCA GTTGTTTAAC GTCCTGCGCG GGGAAATGAG CCTGGTCGGG
CCACGCCCCG AAATGCCATA CATCGCTGCT GAGTATGAGC CGTGGCAATG GCAGCGCTTT
CGTGTCCCAC CCGGTATGAC GGGCTGGTGG CAAGTGAATG GGCGGAGTGA GAAACCGATG
CACTTGCATA CCGAAGATGA TCTGTATTAC ATTCAGAACT ACTCGTTCTG GCTTGATCTG
CGCATCTTAG CCAAGACGCT GGTAGTAGTG TGGCAAGGGC ATGGCGCGTA TTGA
 
Protein sequence
MLERLRSRVI RSVLGWDLIA TLLCLMLIAG VDPATMLRQW SLYAGAVVIW SIVFIVLAPQ 
QAVFERTILE AFTRLLLAVL LAATSFAGLL YLINGNLFER GSFLRFVMLD LVIVSVIHLS
FRTLARRRQT RSRRRVLVVG QANAAARLAE EFGRRPWTGV QIVGYASDEW EAPSTLPRLG
ELSDLVTIVQ RHRIDEVIFA LPPAQYDRVA ELSLLLLREP VMLHTVPTAL DLVFARTPVD
SVGGIPLVSL RESALTPSQR IVKRLFDIVV SFGLIVLLAP LMLVIALLIK LESPGPVLFK
QERIGEHGRR FTMFKFRSMY VDAEQRWQAV AKRDPTTGKL IHKLKDDPRV TRVGRKLRRT
SLDELPQLFN VLRGEMSLVG PRPEMPYIAA EYEPWQWQRF RVPPGMTGWW QVNGRSEKPM
HLHTEDDLYY IQNYSFWLDL RILAKTLVVV WQGHGAY