Gene Cfla_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0729 
Symbol 
ID9144600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp790786 
End bp792366 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content75% 
IMG OID 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003635839 
Protein GI296128589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00280675 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00149377 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCAG AGCACGGGGC CACGACGCAC GCGCGGGCGG CCGCCCGGCG CCGCACGGGA 
GCACCGCGGG ACGTCCCCGT GACGACGCCC GCCGCCCCAC CCGCACCGGA GGCCCCACCC
CCGCACGCGC TGCGGTGGGA CCCCGCCCGC ATGCACACCC GCGGCCCGCG ACGGCCCGTG
TGGCTCGTGC GGTTCCACGC GCTGCTCATC GCGAACGACA CCGCCGTGGT GGTCGCCGCG
ACCGCCCTCG GCGCGTGGCT GTGGGGCGGC CCGCGGCCCG TGACGTTCTT CGGCGCGCCG
GTGCCCACCG TGTCGTGGCT CGCCGCCGTG GTCGCGATCT GGCTGGTCGC GCTCGCGGCC
GTGCGCTCGC GGTCCGAGCT GATCCTCGCC GTCGGCGTCA CCGAGCTGCA GCGCGTCCTC
AACGCGTCGG TGTTCGCGCT CGCGGCCGTC ATGAGCACCG CGTACCTGGG CGACGCGCAG
ATCCCGCGCG GCACGCTCGC CGGAGCGTTC GGCTCGGGTC TGCTCGGGCT CATGGTGACG
CGCCTGGCGT GGCGGCACCG GCTCATCGCG TGGCGCGCCG GCGGGCGGTG CAAGCGCAAC
GCGCTGCTCG TGGGGCCGCA CCGGGACGTC GTGCGGCTGC TCGGGGACCT GCGGCGCAAC
CACCGCGCCG GGTTCCGGGT GGTCGGCATC GCGCTCACCG ACGTCGACCC GGCACCCGAC
GCGCGCATCG ACGACGTCGA GACCTTCGGG CTCGAGGAGC TGGTCGACCG CGCGCACCAC
CCGCGCGTCA CCAGCGTCGT GCTCGCCGGC GACCTTCCCG GCGGGCGCGC CGCGATCCGC
CGGCTCGGGT GGTCCCTGGA GGGCGCCGCG ACCGAGCTCG TCCTGCCGAG CCGCCTGACG
TACGTCGCCG GGCCGCGCAT CCACCTGCGG CCCGTCGAGG GCATGCCGCT GGTGCACCTG
TCGCTGCCCA CGTACACGGG CGTCGCGCAC GTCGCCAAGC GCGGCGTCGA CGTGGTCGTC
GCGTCGCTCG CGCTCGTCGT GCTCCTCCCC GCGCTGCTCG CCGTGGCCGT CGCGATCAAG
CTCGACGACG GCGGGCCCGT GCTGTTCCGT CAGGAGCGCG TCGGCAACCG TGAGCAGCTG
TTCACCATGT ACAAGTTCCG CACGATGGTG GTCGACGCCG AGGCGCGGCT CGCGGCGCTG
CAGGAGCGCA ACCAGGGCGC GGGCGTGCTG TTCAAGATGA CCGACGACCC GCGCGTCACG
CGCGTGGGCC GCGTGCTGCG CGCCTGGTCG CTCGACGAGC TGCCGCAGTT CCTCAACGCG
CTGCTCGGGA CCATGTCGGT CGTCGGGCCG CGGCCCCCGC TGCCGCGCGA GGTCGCGCTC
TACGACGGCG ACGTCCACCG GCGCCTGCTG TCCAAGCCGG GGATCACCGG CCTGTGGCAG
GTCAGCGGGC GGTCCGACCT GACGTGGGAG GAGAGCGTGC AGCTCGACCT GTCCTACGTC
GAGAACTGGT CGCTGTCCGG GGACCTCATG ATCATCCTGC GCACGTTCCG CAGCGTGCTG
GCGCGTGCCG GGGCGTACTG A
 
Protein sequence
MTAEHGATTH ARAAARRRTG APRDVPVTTP AAPPAPEAPP PHALRWDPAR MHTRGPRRPV 
WLVRFHALLI ANDTAVVVAA TALGAWLWGG PRPVTFFGAP VPTVSWLAAV VAIWLVALAA
VRSRSELILA VGVTELQRVL NASVFALAAV MSTAYLGDAQ IPRGTLAGAF GSGLLGLMVT
RLAWRHRLIA WRAGGRCKRN ALLVGPHRDV VRLLGDLRRN HRAGFRVVGI ALTDVDPAPD
ARIDDVETFG LEELVDRAHH PRVTSVVLAG DLPGGRAAIR RLGWSLEGAA TELVLPSRLT
YVAGPRIHLR PVEGMPLVHL SLPTYTGVAH VAKRGVDVVV ASLALVVLLP ALLAVAVAIK
LDDGGPVLFR QERVGNREQL FTMYKFRTMV VDAEARLAAL QERNQGAGVL FKMTDDPRVT
RVGRVLRAWS LDELPQFLNA LLGTMSVVGP RPPLPREVAL YDGDVHRRLL SKPGITGLWQ
VSGRSDLTWE ESVQLDLSYV ENWSLSGDLM IILRTFRSVL ARAGAY