Gene Cagg_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2074 
Symbol 
ID7269233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2538390 
End bp2539439 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID643566909 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002463398 
Protein GI219848965 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGCT TGATCATTCT GCGCTTACTA GAAAGCTTCT TCCGCCGGCC TTTGCTGAGT 
GTTGCACCTT TTGTGATCGG GGTGCTATTA GGTGCCGGCT ACATCTTACT CTCTCCACCA
GAATTTTTGT CGAGTGGCAA GATTTATATT GAAAAAGATA GTCTCCTTGC GTCGCTCACC
TCATCCAAAA GCGACGCTTC ATGGTGGGTA ACGCCGGCCC AAGCAACGAC GAACGAACTG
TATGAGTTAC TCGCTACCAA CGCTTTTGTG CGGGCGGCGA TTCAGCAAAC CAAACTCGAA
CCGTACATGT CCGGTGGGCC AGATGTCGTC TGGGAAACGT TTACCTTCTT CCGTGACACG
ATTAGCATTA ACCCATTAGG CGATAAACTG GTTGAAATTC GCGCCACGAC CGACGATCCT
GAACTATCAT ATCAGATGGT CGTAGCGACG ATGGATACCT ACTTGAAATG GAAGTTGAAC
ACCGATTTTC AAGAGAGTGT CGCGGCCCAA AAGTTTTTTG AAGATCTGAT CGCTCCGTAT
CAAGCCGACG TTGATCAGGC TCGTCAGGCA TTAATCGACT TTCTCAGTGC TAATCCCGAA
CCGGTACGCG GCGATCGCCC GCCCGGTGAG CAGTTTCAAC TCGACCAATT ACGGGCAGCA
CTGGCCCGCG CCGAAGAACG TCTGAGCACG GCCCAAGAGA ACGAAGAGAG CGCACGCTTG
GCGTTGGTCA AGAACGAGAG CTTGATCCGG CAGACATACC AGATCGTTGA CCAGCCCGAA
ATCCCGCTCA GAGCCGAATT CTCGATCACG ACGTTCGTCA AGAATATGAT CATTTTTGTC
GTGATTGGTT TATTCCTTTC AGTGAGTTTG ATCGGTGGCG GTGCTCTCAT CGATCGCAGT
CTGCGTTTTC CGATTGACGT GCGTAATAGT CTGAATCTGC CGCTGCTCGC AGTGGTACCG
CTGAGTTGGG AACCGCTCAC ACCGACACCG ATTGCAACGA TCACAGAGAC TGACCAACCG
ACACTGCAAG CTCAAGTACA GGTGAAATGA
 
Protein sequence
MVRLIILRLL ESFFRRPLLS VAPFVIGVLL GAGYILLSPP EFLSSGKIYI EKDSLLASLT 
SSKSDASWWV TPAQATTNEL YELLATNAFV RAAIQQTKLE PYMSGGPDVV WETFTFFRDT
ISINPLGDKL VEIRATTDDP ELSYQMVVAT MDTYLKWKLN TDFQESVAAQ KFFEDLIAPY
QADVDQARQA LIDFLSANPE PVRGDRPPGE QFQLDQLRAA LARAEERLST AQENEESARL
ALVKNESLIR QTYQIVDQPE IPLRAEFSIT TFVKNMIIFV VIGLFLSVSL IGGGALIDRS
LRFPIDVRNS LNLPLLAVVP LSWEPLTPTP IATITETDQP TLQAQVQVK