Gene Cfla_3154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3154 
Symbol 
ID9147069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3504315 
End bp3506177 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003638235 
Protein GI296130985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00273511 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCACGA CCTTCGACCG ACTGATCCTC GACCCACGCC GTACCGACGA CACCCGCCTG 
CGCCCGCAGA GCACCCGCTC GCCCGAGAAC GAGGCCACCC AGAGCCCGTC GCTCGTGCTG
CTGGTGCTGC TGGCCGCGGC CGGCATCGTC GTGTACGCCG CGTTCCTGCT GAACCCCGCC
AACCGGGGTG ACTTCCTGCC GTACGTGCTG GTCATCGTCG CCGAGACCGT CCTGGTCGCC
CACGCCCTGC TGGCGATGTG GACCGTGCTC TCGGCCGGCT GGAACCCCCG CGGGTTCACG
TTCCATCACT CGCAGGAGCG GCTGTACGAC CTCGCGGAGA TCATCCGCGA CGGCGCCGAG
CACGAGCCGT GGCGCTGGCA GATGTACATG GACGACCGCC CCGTCGAGGT CGACGTCTTC
ATCACGACGT ACGGCGAGGA CCTCGAGACC ATCCGCCGGA CGGTCACCGC GGCCCTGCGG
ATCCAGGGCA GGCACCACAC GTGGGTGCTC GACGACGGCC GCTCCGACGA CGTGCGCGAC
CTGGCCGCCG AGCTCGGTGC GCGCTACGTG CGCCGGCTGT CCAGCGGCGG CGCCAAGGCG
GGCAACATCA ACCACGCCCT GTCCCTCGCG CGCGGCGACT ACTTCGCGGT GTTCGACGCG
GACTTCGTGC CCCGGCCCGG GTTCCTGCAC GAGACCGTGC CGTTCTTCGC GACGCAGGAC
GTCGCGTTCG TCCAGACGCC GCAGACGTAC GGCAACTACG ACAACGTCAT CAGCCGCGGC
GCCGGTTACA TGCAGGCGGT CTTCTACCGG TTCGTGCAGC CAGGCCGGAA CAGGTTCAAC
GCCGCGTTCT GCGTCGGCAC CAACGTCATC TACCGGCGGT CGGCGGTCGA CGCGATCGGT
GGCATCTACA CCGACTCCAA GTCGGAGGAC GTGTGGACGT CGCTCATGCT GCACGAGCGC
GGCTGGCGCA CGGTCTACAT CCCCACGACG CTCGCGGTCG GCGACACCCC CGAGACCGTC
GAGGCGTACA CCAAGCAGCA GCTGCGCTGG GCGACCGGCG GCTTCGAGAT CATGCTCACG
CACAACCCGC TGTCCCGGAA GCGCAACCTC ACGATGGACC AGCGCATCCA GTACCTCGTG
ACGGCGACGC ACTACCTGAC GGGCATCGCC CCGCTGCTGC TGCTGCTCGT GCCGCCGCTG
GAGATCTACT TCGACCTGTC CCCGATGGAC CTGACCATCA CGCCCGCGAC GTGGGCGCTG
TACTACGCCG GGTTCTACGT GCTGCAGATC CTGCTCGCGT TCTACACGCT CGGGTCGTTC
CGGTGGGAGG TGCTGCTGCT CGCGTCGGTG TCCTTCCCGA TCTACGTGCG TGCGCTGGTC
AACGCGGTGC TGCGCCGCGA GCAGGCGTGG CACGTCACAG GACGCAAGGG CGCGTACCGC
TCACCGTTCG CGTTCATGGT GCCGCAGGTG CTGTTCTTCC AGTTCCTGCT CCTGACGACC
GTGGTCGCCG CGTGGAAGAC GTACACGTCG GGCGTGTTCA CGCTCGCGCT CGCCTGGAAC
GCCACCAACA CGGTCATCCT GGGCGGCTTC ATGGTGACCG CGTGGCGCGA GGGGCGCCGC
GGACGCGCCG AGGCGCGCGC CCGGCAGCGG GCGCTCGCGA CCCACGACCG AGCCGTCGAC
GACCTCGCGG CCGACGCGGT CCTGCTCGAG CTGGAGGACG CCCGTACGCC CGCGGCCCGT
CTGCTGGAGC GCGCCGAGCT CCTGCGGGCC GACAGCCCCG CAGCAGACGC GCCCCGCCCC
GCCGGCGTCG AGCGGCCCGC CGGGCCTGCC GCCGCCCCCG CCGACCGAGA GGTGCAGTCA
TGA
 
Protein sequence
MATTFDRLIL DPRRTDDTRL RPQSTRSPEN EATQSPSLVL LVLLAAAGIV VYAAFLLNPA 
NRGDFLPYVL VIVAETVLVA HALLAMWTVL SAGWNPRGFT FHHSQERLYD LAEIIRDGAE
HEPWRWQMYM DDRPVEVDVF ITTYGEDLET IRRTVTAALR IQGRHHTWVL DDGRSDDVRD
LAAELGARYV RRLSSGGAKA GNINHALSLA RGDYFAVFDA DFVPRPGFLH ETVPFFATQD
VAFVQTPQTY GNYDNVISRG AGYMQAVFYR FVQPGRNRFN AAFCVGTNVI YRRSAVDAIG
GIYTDSKSED VWTSLMLHER GWRTVYIPTT LAVGDTPETV EAYTKQQLRW ATGGFEIMLT
HNPLSRKRNL TMDQRIQYLV TATHYLTGIA PLLLLLVPPL EIYFDLSPMD LTITPATWAL
YYAGFYVLQI LLAFYTLGSF RWEVLLLASV SFPIYVRALV NAVLRREQAW HVTGRKGAYR
SPFAFMVPQV LFFQFLLLTT VVAAWKTYTS GVFTLALAWN ATNTVILGGF MVTAWREGRR
GRAEARARQR ALATHDRAVD DLAADAVLLE LEDARTPAAR LLERAELLRA DSPAADAPRP
AGVERPAGPA AAPADREVQS