Gene Cagg_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0142 
Symbol 
ID7266881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp189495 
End bp190985 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content56% 
IMG OID643565014 
ProductO-antigen polymerase 
Protein accessionYP_002461529 
Protein GI219847096 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0300218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTAT TCACCGGTCA ACGATTAGAG CGATCTGTGT GGTTATGGCT AGCCGCGGCT 
GCCTTAGTTG GGGTTGGTAT TGCGCTCGTC CCACCACTCT TCGCCGTAAG CTGGCTCCTC
GGTCTGGCCG CGCTCGGTTT GGCGGTGTGT GATCCGATTT GGCCGGTAGC GCTGGCCGTG
CTCTCGGTAC CATTCCAGCA ATTGGTCACG CTGCCCGGTG GGCTGAGTGT GACCCAGTTC
TGTTTCATCC TGGTGGCCCT TAGCTTCCTC TGGCAATTGT CCCAACGACG GTGGCCCTGG
CCGGATATGC CGGGCATTGC TCTGGCCATT TTCCTTTGGA CACTCGCCGT GACCGCCGCT
TTGACACCAC TTAGCCGCAG TGAGGGACTA AAAGAGACAC TCCGTTGGGG AACAGTACTC
CTGATCTACC TTGCTGCAAT GAGTGCGCTG CAAGACCCTG ATCGAGTACA ATGGCGACGG
GCCGTACTCG TTGCCTGTTT GCTTGCTGCC CCGGCGATAA CGGCGTTGAT CGGTATTGGT
CAGCACCTGA CCGGAATCGG CCCGGCGAGT TTTGCCGTTG GAGACGGGCG GGTGCGCGCC
TATGGCACGA TTGGTCAACC AAACTCGTTT GCCGGCTACC TGAATCAGGC GTGGCCGTTG
GCAGCCGGTT TTGGCCTGGT GATGATCGTC ACACATCATT GGCACACCTG GCGCGACAGG
TTGCGCTTAG GCATCGTCTT CATCACGGCG GGTAGCTTGA TCGGTGGGTT ACTGGCGAGC
TTTTCGCGTG GCGGCTGGGT AGGAGCAGCA CTCGGTGCGA CGGTCATGAC GGTTGTGCTT
GGCGCCTGGT ACGGACGACG GATGCTGCGA CAGAGCATAC CGGTTATCCT TGTGGCAGTA
TTTGGGGGAA TGATCCTGGT GAATAGTGGG TTGCTACCGA CCGCGCTGAG TAGTCGGCTT
ACATCCATTA TCGCCAATCT CCAGCCGTTC GATGTGCGTA ATGTTAACAT CACACCGGAC
AACTTCGCAG TAGTCGAGCG AATGGCGCAC CTGCAAGCAG CGTGGAATAT GGTGCAAGAA
CGGCCGCTAT TGGGAGTAGG ACCGGGAAAT TTCACCATCG CCTACGAACG GCTGGTGTAT
AGTGGGCAAA CACCCACATG GATTAAACCA TGGTATGATT CTCGTGGTCA CGCTCACAAC
TACTACCTGC ACATCGCTGC CGAAAGTGGT TTGATCGGAT TGAGTGCGTA TCTGCTCTTG
CTAGGTAGCG TTTGGCGTAC TGCGGTGCGA GCAGTTCAAC AAGCGAACGA TTGGTTTACA
CGCGGTATCG CACTGGGTGG CATAGGAGTA GTGAGCACAC TGAGCGGTCA CAATCTCTTT
GAAAATCTGC ATGTTTTGAA TATGGGAGTG CAGTTTGCGG CAATCATTGC GCTTATCGCG
ACCATCAATA CCGGTCGCAC TGAACTGCAC AGTTGCAACG AGGACCTATG A
 
Protein sequence
MYLFTGQRLE RSVWLWLAAA ALVGVGIALV PPLFAVSWLL GLAALGLAVC DPIWPVALAV 
LSVPFQQLVT LPGGLSVTQF CFILVALSFL WQLSQRRWPW PDMPGIALAI FLWTLAVTAA
LTPLSRSEGL KETLRWGTVL LIYLAAMSAL QDPDRVQWRR AVLVACLLAA PAITALIGIG
QHLTGIGPAS FAVGDGRVRA YGTIGQPNSF AGYLNQAWPL AAGFGLVMIV THHWHTWRDR
LRLGIVFITA GSLIGGLLAS FSRGGWVGAA LGATVMTVVL GAWYGRRMLR QSIPVILVAV
FGGMILVNSG LLPTALSSRL TSIIANLQPF DVRNVNITPD NFAVVERMAH LQAAWNMVQE
RPLLGVGPGN FTIAYERLVY SGQTPTWIKP WYDSRGHAHN YYLHIAAESG LIGLSAYLLL
LGSVWRTAVR AVQQANDWFT RGIALGGIGV VSTLSGHNLF ENLHVLNMGV QFAAIIALIA
TINTGRTELH SCNEDL