Gene Cfla_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0991 
Symbol 
ID9144866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1099034 
End bp1102084 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content78% 
IMG OID 
ProductVanW family protein 
Protein accessionYP_003636096 
Protein GI296128846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.632754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGGA CCGACCGCGA CACGCCGACC GGCGGCGCCG ACCGCGACTC CGCCGACGTG 
CCCCGCGGCG GTGTGCCGCA GCCCCACGAC GCGCCGGAGG CCACCGTCCC CGAGCCGCCC
ACGTCCGACG CCACCACGGA CGGCTCGACC GGTACCGACG CGCCGGCGGC CGACGCCGAC
GGGACGGCGG AGGCCACCGA CGCCCCGGCG GCGGACGCCG CCACCCAGGA CGCCCCGGCG
GGTGCGGACG CCGTCGACGC ACCCGTGACC ACCGGCGAGG ACGGTCCGCG CGGTTCGACG
CCGCCGCCCG CGGTCCCGCC GCCGGTCACC CGCGCGCGCC CGCTCGCGCC GGAGCCCGCC
GCGCCCCTGG CCGCGTGGCC GTCGTGGGAC GACGTCGCGG TCACCGCGCC ACCGCCGGAG
CCGGCGCCGC CGGCGGCCGG GAGCGCCGAG CCCGCGGCGC CGGAGCCCGC CGCCGAGCCG
TCCGCCACCC AGGCGTCGGA CGCCGGGCAG GAGCCGACCG CCCAGGCGCC CGCCCGACCG
GGTCGTGCCG ACGAGCGTGC CGACGAGCCC GAGGCGCCGG CCGCCGGACC GCGGTCCGGG
TCCGTGCCGA CCGGGACCGC GTCTGCCGGG ACTGCGTCTG CCGCCCCCGC TCCGGTGCCC
GGCGCCCCCG GGCCCCTGCC GCCGCTCGAT CTCGGCAGCC TGTCGGCCCT GGCCTTCACG
GCGCTCAACG CGGCATCGGT GGGCGTGCCC GCCCACGCGC TGCCGAAGGA CGCCCCGGCC
CGGCCGGGTG ATGCCGACGC CGACGGTCCC GACGGCGCCG CTGCCGCGGG CCCCGAGGCA
TCCGGGCCGG CCGCGGACAC CGAAGGGTCG GCCACCGAGA ACGAGGCCGC CGCCGCGCCG
GACGACGCCA CGGACGCCTC CGCGCAGGCA GCGGGCGAGC AGGGGCTCGC GGGGGACGCA
GCACCGGGAG CCGGCTCCGC GACCGGTGCG CCCACGCCGG TCGACGTGGA GCGGGTTGCC
GCGGAGCCGG TCGACGCCGA GCCGGTCGCC GCGGAGCCGG TCGACGCCGA GCCGGCTGCC
GAGCGTGCCG ACGAGGTCGA GGCCGTGCCG TCGGTGCCGC GCACCGACGA CACGGCCGTC
GTCCCACCGA CGCCGCGGCC CGACGAGACG ACGGTCCTGC CGGCCGCCGG TACGGACGTC
GCGACGCGTG CGACCCCCGC GGCCCCGACC GTCCCGCCGC GACGGACGTC CGCGCTCGCC
GCACCGCAGG CACCGGCGGA GCCGGTCGCC CCGGCCGGAC CCGAGCCCGA CGACGCGTCG
CCGCTCGCCG TCTTCGAGCC CGAGGAGTCG GACCGCCGGT GGCCGCGGGC GCTGGCGATC
ACGGGTGGTG CACTGGCCCT GCTCGCCGCT GTCTACGTCG GGTCGTCGTT CGCGCTCGCG
GACCGGGTGC CGCGCGGCGC GACGGTCGCG GGCGTCGAGA TCGGCGGCCT GTCGTCGGCG
CAGGCCGAGC AGCACCTGCG CGACGAGCTC GCCGAGCGCA CGACGTCCCC CGTGGCGGTC
GTCGCGCAGG AGGTGCAGGC GGAGGTGGAC CCGGTCGCCG CGGGCCTGGA GCTGGACGCT
GCCGCGACCG TCGCGCGACT GACGGGCGTG GACCTGGCGC AGCCCGCACG CGTGTGGCGC
CACGTGGTCG GTGTGGGGGA GCAGCGGCCC GTCACCGTCG CTGACGAGTC CGCGCTCGAC
ACGGCGCTCA CGCAGCTGTC GGGCTCGCTC GTGCTCGCCC CGGTCGACGG CACGGTCGTG
TTCGCCGACG GTGCGGCGCA CTCGACCGAC GCGGTCGACG GCTGGGAGCT CGACACCGCG
GGTGCCGCGG CTGTCCTCGA GGACGGCTGG CTCACGGCCG AGCAGCCGGT CGAGCTGCCC
ACGAGCGCCG TGCCCCCGGC GGTGACGCAG GAGGAGACCG ACCGCGCGCT CGCCGAGCTC
GCGCAGCCGC TCGCCGCCGC CCCGGTGACG GTGCAGGTGG CCGACCGCCA GGCGGTCCTC
GACGTCGCGA CCCTCACGGC GCACGCGGCC GTCGTGCCCG TCGACGGTCA GCTCCAGCTC
CAGCTCGACG GCGAGGCGCT GTCGCAGTCC GTGCTCGCCC AGCTGCCGGA CCTGCTCACC
TCGGCGTCCG ACGCGCGCTT CGAGTTCCAG GGCGGCGCGC CGGTGATCGT GCCCGGGACG
CCGGGCACGA CGCTCGACCC CGCGACGCTG TCCGCGTCGG TCGCGCAGGC GGCCACCGCG
GGCGAGGGAC GGCTCGCCGC GGTCGACCTC GTGGAGTCCG ACCCGGCGGA GACGACTGCT
GCGCTCGAGG CGCTCGGCGT CAAGGAGATC GTCTCGGAGT TCTCCACCCC GCTGACCAGC
GAGCCGCGGC GCACGTCGAA CATCGCGACC GGCCTGCGGA ACATCACCGG GACGCTCGTG
CGCCCCGGCG AGGTCTTCAG CCTCACGGAG GCGCTGGGGC CTGTCGACGC CGCCCACGGG
TTCGTCCAGG CCGGCGCGAT CGTCAACGGG GAGCACACGG ACGCGTGGGG CGGTGGCCTG
TCGCAGGTCT CGACCACGGC GTTCAACGCG GGGTACTTCG CCGGTTACGA GGACGTCGAG
CACAAGCCCC ACAGCGAGTG GTTCCAGCGT TACCCCGAGG GGCGGGAGGC CACGATCTTC
ACCGGCGTGC TCGACATGAG GTGGCGGAAC AACACGCCGT ACGGCGCGCT CGTGCAGGGT
TTCGTGGCCG ACGGTCGCGC GCACGTGCGC ATCTGGAGCA CCAAGCACTT CACGGTCGAG
ACCGAGAAGA GCGGTCGCTC GGGCGTGGTG GCCCCGACCA CGGTCTACTC GCAGTCGCCG
ACGTGCGAGC CGCAGAGCGC GGGCAACCCG GGTTTCACGG TGACCAACAC GCGCAAGGTG
TACCTCAACG GTGAGCTCGT CGCGACCGAG CCGTTCACGT GGCGCTACAA GCCGCAGAAC
AAGGTCATCT GCGGCACCGC GCCGGCGCCG GGAGCGTCCC CCACGCCCTG A
 
Protein sequence
MHGTDRDTPT GGADRDSADV PRGGVPQPHD APEATVPEPP TSDATTDGST GTDAPAADAD 
GTAEATDAPA ADAATQDAPA GADAVDAPVT TGEDGPRGST PPPAVPPPVT RARPLAPEPA
APLAAWPSWD DVAVTAPPPE PAPPAAGSAE PAAPEPAAEP SATQASDAGQ EPTAQAPARP
GRADERADEP EAPAAGPRSG SVPTGTASAG TASAAPAPVP GAPGPLPPLD LGSLSALAFT
ALNAASVGVP AHALPKDAPA RPGDADADGP DGAAAAGPEA SGPAADTEGS ATENEAAAAP
DDATDASAQA AGEQGLAGDA APGAGSATGA PTPVDVERVA AEPVDAEPVA AEPVDAEPAA
ERADEVEAVP SVPRTDDTAV VPPTPRPDET TVLPAAGTDV ATRATPAAPT VPPRRTSALA
APQAPAEPVA PAGPEPDDAS PLAVFEPEES DRRWPRALAI TGGALALLAA VYVGSSFALA
DRVPRGATVA GVEIGGLSSA QAEQHLRDEL AERTTSPVAV VAQEVQAEVD PVAAGLELDA
AATVARLTGV DLAQPARVWR HVVGVGEQRP VTVADESALD TALTQLSGSL VLAPVDGTVV
FADGAAHSTD AVDGWELDTA GAAAVLEDGW LTAEQPVELP TSAVPPAVTQ EETDRALAEL
AQPLAAAPVT VQVADRQAVL DVATLTAHAA VVPVDGQLQL QLDGEALSQS VLAQLPDLLT
SASDARFEFQ GGAPVIVPGT PGTTLDPATL SASVAQAATA GEGRLAAVDL VESDPAETTA
ALEALGVKEI VSEFSTPLTS EPRRTSNIAT GLRNITGTLV RPGEVFSLTE ALGPVDAAHG
FVQAGAIVNG EHTDAWGGGL SQVSTTAFNA GYFAGYEDVE HKPHSEWFQR YPEGREATIF
TGVLDMRWRN NTPYGALVQG FVADGRAHVR IWSTKHFTVE TEKSGRSGVV APTTVYSQSP
TCEPQSAGNP GFTVTNTRKV YLNGELVATE PFTWRYKPQN KVICGTAPAP GASPTP