Gene Cfla_0155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0155 
Symbol 
ID9144021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp190622 
End bp192124 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content74% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003635273 
Protein GI296128023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0565906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGC ACGGCAGGGT GACGGGTCTG GTGGCCATGG TGTGCGGGGT GGCGGTCGCG 
CTGGGGGCGT GCTCGGCCGG CGGTTCCGCG GACGGGACGG CCGACTGGGA GGGCAGCGGC
GCGTACCAGC CCGGCCCGTA CCAGGAGGAC CTGCCGTACC CCGAGCCCGG CCCCACGGGG
CCGACCGCCG CCGGCATGAC CGACCCGGCG CGCGACGCGC TGTCGACGTT CGCGCTCGAC
GTGGACACCG GCGCGTACAC CCGGTTCCGC GACGCGGTGC GGCAGGGGTT CTCCGTGGAC
CCGTTCGGGG TGCGGACCGA GGAGTTCGTC AACTACTTCG CGCAGGACTA CGAGCCGCCC
GCCGAGGGGC TGGGTGTGAG CATCGACGCG ACCGCGCTGC CGTTCCGGCC CGACCACCGG
CTCGTGCGCG TGGGCATCAG CAGCGCGCCG GCGTCGGCGG TGTCGCGGGC CGACGCGGAC
CTCGTGCTCG TCGTGGACTG CTCCGGCTCG ATGGACGAGG CCGGGAAGAT GGAGACCACG
AAGTACGCGC TGCGCACCCT GGTGTCGTCG CTGCGGCGCA CCGACCGCGT CGCGATGGTC
TGCTACTCCA CCGAGGCCGA CGTCTACCTC GAGCCCACGC CCGTCGCCGA GCGTGAGGGC
GTGCTCGCCG CGATCGACCG GCTGGCGCCG CGGGACTCGA CGAACGCCGC CGCCGGCCTG
GCGCTCGGGT ACGACCTGGC GATGTCGATG CGCACCGAGG GGCGCCTCAC GCGCGTCGTC
CTGGTCAGCG ACGGTGTCGC GAACGTCGGC GAGACGGACC CCGAGGGCAT CCTCGCGCGC
ATCTCGTCGC AGGCGAAGGC CGGGATCAGC CTCATCTCGG TGGGGGTGGG CATCACGACG
TACAACGACC ACCTGCTCGA GCAGCTCGCC GACCAGGGCG ACGGCTGGCA CGTGTACGTC
GACGGCGAGG CGGAGGCCGA GCGGGTGTTC GCCACCGGCC TGACGGGCTC GCTCGTCGTG
GCCGGGACGG ACGCCCGCGC GCAGGTCGAG TTCGACCCCG CGCAGGTCGC CGGGTACCGG
CTCCTGGGCT ACGAGAACAG GGCGGTGGCC GACGAGGACT TCCGTAACGA CGCGGTCGAC
GGCGGCGAGG TCTTCGCCGG TCGCTCCACG ACCGCGCTCT ACGAGGTCGC CATGCGCGAG
GGGGCCGGGG ACGGGGCGTT CGTGCGGGCC ACCGTCCGCT ACCTCGACGA CGACGGGCGG
CCCGTGGAGC GTGACGCGTC GCTGAGCCGT GACGACTGCG CGGCGTCGCC CCGCGAGGCC
TCGCCGCGGC TGCGGCAGGA CCTCGTGGTC GCGCTGCTCA CCGACCACCT CACCGACGGT
CCGTGGTCGC AGGAGATCGC CCCGGCGGAC GTGCGCGCCG AGGCGCGCAC GTTGCTCGGG
GTGCTCGACG GCGACCGGGC CGTCCAGGAG CTCGTGGAGC TCGTGGACCG GGCCACCACC
TGA
 
Protein sequence
MGEHGRVTGL VAMVCGVAVA LGACSAGGSA DGTADWEGSG AYQPGPYQED LPYPEPGPTG 
PTAAGMTDPA RDALSTFALD VDTGAYTRFR DAVRQGFSVD PFGVRTEEFV NYFAQDYEPP
AEGLGVSIDA TALPFRPDHR LVRVGISSAP ASAVSRADAD LVLVVDCSGS MDEAGKMETT
KYALRTLVSS LRRTDRVAMV CYSTEADVYL EPTPVAEREG VLAAIDRLAP RDSTNAAAGL
ALGYDLAMSM RTEGRLTRVV LVSDGVANVG ETDPEGILAR ISSQAKAGIS LISVGVGITT
YNDHLLEQLA DQGDGWHVYV DGEAEAERVF ATGLTGSLVV AGTDARAQVE FDPAQVAGYR
LLGYENRAVA DEDFRNDAVD GGEVFAGRST TALYEVAMRE GAGDGAFVRA TVRYLDDDGR
PVERDASLSR DDCAASPREA SPRLRQDLVV ALLTDHLTDG PWSQEIAPAD VRAEARTLLG
VLDGDRAVQE LVELVDRATT