Gene Cagg_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0830 
Symbol 
ID7268282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1032275 
End bp1033744 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content53% 
IMG OID643565680 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002462189 
Protein GI219847756 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.691353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGA CCAGCGTCGA ACTAAAAGCT AACAGTTCTG AGCGGATCGG ATGGCAATTC 
CATCGTCTCG CCTTAATCGG TGCGCTCATA GTCGGTGATG CCTTCATCGT GACCGTTAGT
TTTGTACTTG CCTATGTCGT GCGATTTTTG ATCAATCTAC CCTTCTTTAA CGAAGGTGCG
ATGCAGCCGG AATTTTATAC GCTGCTGATT ATGATGTTGG TCCCATGCTG GAGCGCACTG
TTTGCCGTAT ATCACCTCTA CGATGACAAG CTGTTGTTCA ACGGCACCCA AGAGTACCAA
CAAATTACCA ACGCTACGAG CATGGGTATG CTCATTGTCG TTCTCCTCAC CTTCTTCTGG
GACAACCTTG TGGTAGCGCG CGGTTGGTTG CTCCTCAGTT GGTTCCTCTC GCTGAGTCTG
ATGACCCTGT GGCGCTTCGG TGTTCGTCGG TTTGTTTACC GACTGCGGAA ACACGGTCAC
TTGCACAAAC GAGTCCTGAT TATCGGCGCC ACTGAGGAGG GTCAGGCAAT TGCCGAGCAG
CTTTTGGCCG AGAAACGCGC CGGTGCAACT ATCGTTGGCT TTATCGATAA CACGCTACCG
GTGGGCAGCA ATGTAGGGCA TGGTAAGGTG AAGGTTCTAG GTACCACCGG TGATTTTACC
CAATTGGTCC AACAGAACAA CATTGAGGCA ATCATTATTG CCGATACAAA CCTGATCCGT
GAGCAGCTCA TTACCATCAA CGGTGCAATG GATGTGCTCA GCCGATTGGA GGTTATGCTC
GCCCCTGGCC TTTTCGATCT CTTAACCATT GGGGTACAGG TCCGCGAGCA GGGTGCGGTG
CCACTGCTCA GCCTGAACAA AACGCGCATT ACCGGACTGC ACGCCATCGG TAAAAAGATC
GTCGACGTGG TGGGCGCGTT GGTAGGCCTG ATTCTACTGT CTCCACTCTT GATCTGCGTC
GCGATTGCCA TCAAATTGGA TAGTCCGGGT CCGATCATCT ACCGGCGGCG CGTTATTGGG
GTCGGCTATC GTGAGTTTTC CGCCTTTAAG TTTCGCACGA TGTACATCGA TGGCGACCGG
CGACTAACAC CAGAACAACG CGCCGAGTTG GCCCAGAAGG GAAAATTGAT CGACGATCCG
CGGATTACGC GCGTGGGCAA GTGGCTGCGG CGCACGAGTA TCGATGAATT ACCGCAACTC
CTCAATGTCT TGCTCGGCCA GATGAGCCTT GTCGGACCAC GAATGATCAC CGCCGGCGAG
ATGCATCATT TTGGGCGTTG GCAGCACAAT CTGCTCACGG TCCGACCCGG TTTAACCGGC
CTCTGGCAGA TCAGTGGGCG AAGCAATCTT GGCTACGCCG ACCGTGTGCG ACTCGATATG
CACTACATCC GCAACTATTC GATCTGGCTC GATCTATTCA TTATCTACCG TACTATCCCG
GTCTTGCTGA AGGGAGAAGG CGCCTACTAG
 
Protein sequence
MSLTSVELKA NSSERIGWQF HRLALIGALI VGDAFIVTVS FVLAYVVRFL INLPFFNEGA 
MQPEFYTLLI MMLVPCWSAL FAVYHLYDDK LLFNGTQEYQ QITNATSMGM LIVVLLTFFW
DNLVVARGWL LLSWFLSLSL MTLWRFGVRR FVYRLRKHGH LHKRVLIIGA TEEGQAIAEQ
LLAEKRAGAT IVGFIDNTLP VGSNVGHGKV KVLGTTGDFT QLVQQNNIEA IIIADTNLIR
EQLITINGAM DVLSRLEVML APGLFDLLTI GVQVREQGAV PLLSLNKTRI TGLHAIGKKI
VDVVGALVGL ILLSPLLICV AIAIKLDSPG PIIYRRRVIG VGYREFSAFK FRTMYIDGDR
RLTPEQRAEL AQKGKLIDDP RITRVGKWLR RTSIDELPQL LNVLLGQMSL VGPRMITAGE
MHHFGRWQHN LLTVRPGLTG LWQISGRSNL GYADRVRLDM HYIRNYSIWL DLFIIYRTIP
VLLKGEGAY