Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0991 |
Symbol | |
ID | 9144866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1099034 |
End bp | 1102084 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | VanW family protein |
Protein accession | YP_003636096 |
Protein GI | 296128846 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.632754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGGA CCGACCGCGA CACGCCGACC GGCGGCGCCG ACCGCGACTC CGCCGACGTG CCCCGCGGCG GTGTGCCGCA GCCCCACGAC GCGCCGGAGG CCACCGTCCC CGAGCCGCCC ACGTCCGACG CCACCACGGA CGGCTCGACC GGTACCGACG CGCCGGCGGC CGACGCCGAC GGGACGGCGG AGGCCACCGA CGCCCCGGCG GCGGACGCCG CCACCCAGGA CGCCCCGGCG GGTGCGGACG CCGTCGACGC ACCCGTGACC ACCGGCGAGG ACGGTCCGCG CGGTTCGACG CCGCCGCCCG CGGTCCCGCC GCCGGTCACC CGCGCGCGCC CGCTCGCGCC GGAGCCCGCC GCGCCCCTGG CCGCGTGGCC GTCGTGGGAC GACGTCGCGG TCACCGCGCC ACCGCCGGAG CCGGCGCCGC CGGCGGCCGG GAGCGCCGAG CCCGCGGCGC CGGAGCCCGC CGCCGAGCCG TCCGCCACCC AGGCGTCGGA CGCCGGGCAG GAGCCGACCG CCCAGGCGCC CGCCCGACCG GGTCGTGCCG ACGAGCGTGC CGACGAGCCC GAGGCGCCGG CCGCCGGACC GCGGTCCGGG TCCGTGCCGA CCGGGACCGC GTCTGCCGGG ACTGCGTCTG CCGCCCCCGC TCCGGTGCCC GGCGCCCCCG GGCCCCTGCC GCCGCTCGAT CTCGGCAGCC TGTCGGCCCT GGCCTTCACG GCGCTCAACG CGGCATCGGT GGGCGTGCCC GCCCACGCGC TGCCGAAGGA CGCCCCGGCC CGGCCGGGTG ATGCCGACGC CGACGGTCCC GACGGCGCCG CTGCCGCGGG CCCCGAGGCA TCCGGGCCGG CCGCGGACAC CGAAGGGTCG GCCACCGAGA ACGAGGCCGC CGCCGCGCCG GACGACGCCA CGGACGCCTC CGCGCAGGCA GCGGGCGAGC AGGGGCTCGC GGGGGACGCA GCACCGGGAG CCGGCTCCGC GACCGGTGCG CCCACGCCGG TCGACGTGGA GCGGGTTGCC GCGGAGCCGG TCGACGCCGA GCCGGTCGCC GCGGAGCCGG TCGACGCCGA GCCGGCTGCC GAGCGTGCCG ACGAGGTCGA GGCCGTGCCG TCGGTGCCGC GCACCGACGA CACGGCCGTC GTCCCACCGA CGCCGCGGCC CGACGAGACG ACGGTCCTGC CGGCCGCCGG TACGGACGTC GCGACGCGTG CGACCCCCGC GGCCCCGACC GTCCCGCCGC GACGGACGTC CGCGCTCGCC GCACCGCAGG CACCGGCGGA GCCGGTCGCC CCGGCCGGAC CCGAGCCCGA CGACGCGTCG CCGCTCGCCG TCTTCGAGCC CGAGGAGTCG GACCGCCGGT GGCCGCGGGC GCTGGCGATC ACGGGTGGTG CACTGGCCCT GCTCGCCGCT GTCTACGTCG GGTCGTCGTT CGCGCTCGCG GACCGGGTGC CGCGCGGCGC GACGGTCGCG GGCGTCGAGA TCGGCGGCCT GTCGTCGGCG CAGGCCGAGC AGCACCTGCG CGACGAGCTC GCCGAGCGCA CGACGTCCCC CGTGGCGGTC GTCGCGCAGG AGGTGCAGGC GGAGGTGGAC CCGGTCGCCG CGGGCCTGGA GCTGGACGCT GCCGCGACCG TCGCGCGACT GACGGGCGTG GACCTGGCGC AGCCCGCACG CGTGTGGCGC CACGTGGTCG GTGTGGGGGA GCAGCGGCCC GTCACCGTCG CTGACGAGTC CGCGCTCGAC ACGGCGCTCA CGCAGCTGTC GGGCTCGCTC GTGCTCGCCC CGGTCGACGG CACGGTCGTG TTCGCCGACG GTGCGGCGCA CTCGACCGAC GCGGTCGACG GCTGGGAGCT CGACACCGCG GGTGCCGCGG CTGTCCTCGA GGACGGCTGG CTCACGGCCG AGCAGCCGGT CGAGCTGCCC ACGAGCGCCG TGCCCCCGGC GGTGACGCAG GAGGAGACCG ACCGCGCGCT CGCCGAGCTC GCGCAGCCGC TCGCCGCCGC CCCGGTGACG GTGCAGGTGG CCGACCGCCA GGCGGTCCTC GACGTCGCGA CCCTCACGGC GCACGCGGCC GTCGTGCCCG TCGACGGTCA GCTCCAGCTC CAGCTCGACG GCGAGGCGCT GTCGCAGTCC GTGCTCGCCC AGCTGCCGGA CCTGCTCACC TCGGCGTCCG ACGCGCGCTT CGAGTTCCAG GGCGGCGCGC CGGTGATCGT GCCCGGGACG CCGGGCACGA CGCTCGACCC CGCGACGCTG TCCGCGTCGG TCGCGCAGGC GGCCACCGCG GGCGAGGGAC GGCTCGCCGC GGTCGACCTC GTGGAGTCCG ACCCGGCGGA GACGACTGCT GCGCTCGAGG CGCTCGGCGT CAAGGAGATC GTCTCGGAGT TCTCCACCCC GCTGACCAGC GAGCCGCGGC GCACGTCGAA CATCGCGACC GGCCTGCGGA ACATCACCGG GACGCTCGTG CGCCCCGGCG AGGTCTTCAG CCTCACGGAG GCGCTGGGGC CTGTCGACGC CGCCCACGGG TTCGTCCAGG CCGGCGCGAT CGTCAACGGG GAGCACACGG ACGCGTGGGG CGGTGGCCTG TCGCAGGTCT CGACCACGGC GTTCAACGCG GGGTACTTCG CCGGTTACGA GGACGTCGAG CACAAGCCCC ACAGCGAGTG GTTCCAGCGT TACCCCGAGG GGCGGGAGGC CACGATCTTC ACCGGCGTGC TCGACATGAG GTGGCGGAAC AACACGCCGT ACGGCGCGCT CGTGCAGGGT TTCGTGGCCG ACGGTCGCGC GCACGTGCGC ATCTGGAGCA CCAAGCACTT CACGGTCGAG ACCGAGAAGA GCGGTCGCTC GGGCGTGGTG GCCCCGACCA CGGTCTACTC GCAGTCGCCG ACGTGCGAGC CGCAGAGCGC GGGCAACCCG GGTTTCACGG TGACCAACAC GCGCAAGGTG TACCTCAACG GTGAGCTCGT CGCGACCGAG CCGTTCACGT GGCGCTACAA GCCGCAGAAC AAGGTCATCT GCGGCACCGC GCCGGCGCCG GGAGCGTCCC CCACGCCCTG A
|
Protein sequence | MHGTDRDTPT GGADRDSADV PRGGVPQPHD APEATVPEPP TSDATTDGST GTDAPAADAD GTAEATDAPA ADAATQDAPA GADAVDAPVT TGEDGPRGST PPPAVPPPVT RARPLAPEPA APLAAWPSWD DVAVTAPPPE PAPPAAGSAE PAAPEPAAEP SATQASDAGQ EPTAQAPARP GRADERADEP EAPAAGPRSG SVPTGTASAG TASAAPAPVP GAPGPLPPLD LGSLSALAFT ALNAASVGVP AHALPKDAPA RPGDADADGP DGAAAAGPEA SGPAADTEGS ATENEAAAAP DDATDASAQA AGEQGLAGDA APGAGSATGA PTPVDVERVA AEPVDAEPVA AEPVDAEPAA ERADEVEAVP SVPRTDDTAV VPPTPRPDET TVLPAAGTDV ATRATPAAPT VPPRRTSALA APQAPAEPVA PAGPEPDDAS PLAVFEPEES DRRWPRALAI TGGALALLAA VYVGSSFALA DRVPRGATVA GVEIGGLSSA QAEQHLRDEL AERTTSPVAV VAQEVQAEVD PVAAGLELDA AATVARLTGV DLAQPARVWR HVVGVGEQRP VTVADESALD TALTQLSGSL VLAPVDGTVV FADGAAHSTD AVDGWELDTA GAAAVLEDGW LTAEQPVELP TSAVPPAVTQ EETDRALAEL AQPLAAAPVT VQVADRQAVL DVATLTAHAA VVPVDGQLQL QLDGEALSQS VLAQLPDLLT SASDARFEFQ GGAPVIVPGT PGTTLDPATL SASVAQAATA GEGRLAAVDL VESDPAETTA ALEALGVKEI VSEFSTPLTS EPRRTSNIAT GLRNITGTLV RPGEVFSLTE ALGPVDAAHG FVQAGAIVNG EHTDAWGGGL SQVSTTAFNA GYFAGYEDVE HKPHSEWFQR YPEGREATIF TGVLDMRWRN NTPYGALVQG FVADGRAHVR IWSTKHFTVE TEKSGRSGVV APTTVYSQSP TCEPQSAGNP GFTVTNTRKV YLNGELVATE PFTWRYKPQN KVICGTAPAP GASPTP
|
| |