Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0958 |
Symbol | |
ID | 9144833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 1058917 |
End bp | 1062276 |
Gene Length | 3360 bp |
Protein Length | 1119 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003636064 |
Protein GI | 296128814 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.502077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00295901 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGTGGC GGACGCAGGT CCTGCTCGGA CGCCTGCGTG ACCAGGCGAC CGTCCTGGCG ACGGTCGCGC TCGTGACGTT CGTCGCCACG ACGCTGCTCG GCACGTTCGC GCTGCTGCTG GACGCCACCG GCGACGACGC CGTCGATGCC GCTCTCGGCC GGCTCCCGGA CTCCGCGATC ACGCTCGAGG CGACGATCCG GGTCAACAAC AAGGACACGC AGACGGCGCT CGACGCCGCG GGCGACACCC TCGCCGCGAT GCTGGGCGAC GTCCCCACCG AGCGCACCGC GTGGCTGACC GGCCGCACCT GGTCGCTGCC GCGCGTCGAG GGCGCACCCG TGGCACCCCT CGCCTACCCG GCGAGCACAC CGCTCGTCCC CGACCAGACC GAGCTGCTCA GCGGTACGTG GCCGGACGCC GCGCGCGACG ACGCCGGACG CCTCCTGGTC AACGTGCCGG GCGTCGCCGC CGAACGCTAC GGCTGGGCCG TCGGCACCGA GGTCCCCGTG CGGACGCTCG GCGGGCAGGC GGAGGACACC TGGCTCGTCG TCGGCACGCA CGAGATCACG GGCCCACCCG CGTCGTGGTC GCGCGATCCC CTGGGCGGCG CGGGGCACGA CGCCGCGTAC CCGGTGCCCG GCACGCTGGG CAAGCTCGTC ACGGACCTCT GGGGACCCGT GGTCGTCGCT CCCGAGGCAC TGCTGGGCCC CGGCGTCACC GAGCGAGCGC ACCTGCTCGT GCTCCCCGAC CTGACGGGTG CGCCCCGCGG CGCGCTCGCC ACCGCGCGCG ACTCGCTGAC GTCGGGCCAG GTCCGGCTGT CCGCCGCGCT CACCGACGTC GGCGTCAGCG GGTCGATCCG CACCGACCTC GGGACCACGA TCGACGCCGC CTGGCGCGAG CTGACCGTCA CGCGCGTCGG CGTGGTCGTC GTCGGCCTGC TGCTGGCCGT GCTGGCGACC ACCGTGATGC TGCAGGCCGC GCGCCTGCTC GGCGAACGGC GCGCCGCCGA GGGCGAGCTC GTGGCCGCGC GCGGCGCGTC GCCCGCGCAG CTGCGCTCGC TGGCCGTGCT CGAGGCGGCC CTGCTCGCCG TGCTCGTCAC GGGCACCGCG CCGTGGGCGG CGCGCGCGCT GTTCGCGCGG CTCGCCGACA CCGGGGGGAT GAGCGCGGCC GGCCTGACGG CACCGCCTGG GGTGCCACCG GCGGTGTGGT TCGCGTGCGC CGGGGTCGCG ACCGTGCTGG CCGTCGCACT GGTGGTGCCG TCCTGGCACG TCAGCGGCTC CTCCCACGCG AGCGCGCACG CGCACCTCGT GCGCACGGGC GCCGACGTCG CGCTCGTGGC GCTCGGCGGC GTCGCCCTGT GGCAGCTCCT CGACTACGGC GCGCCCCTGA CCCGCGGTGC CGACGGCCCC CGCCTCGACC CCGTGCTCGT CCTGGGCCCC GCGCTCGTCA CGCTCGCCGC GGCCGTCCTG GCGCTGCGCC TCGTGGGCCC CGTCGGCCGC GGCGCCGACG CGCTCGCGCG CCGGGGCACC ACGCTGGTCG TGCCGCTCGC CGCATGGCAG GTGGCGCGGC GCCCCGCCGC CGCGACGGGC ACGATCCTCG TGGTGGTGCT CGCGGTGGCC GCGGCCACGT TCTCCCACGC GTTCCTCGCG ACGTGGCGCC TGTCGCAGCT CGAGCAGGTC GACCTCGCGC TGGGCACCGA CGCGCGCATC GAGGGCGCGC GCGGCGAGCC GCTCGTGGTC TCGGCCGACG TCCGCGGCGC GCTGGCGGAC GCTCCCGGTG ACGCGGTGCT GCAGCCCGTC GTCGTGCGGA ACGTCGGCGT CGGCCGTGCG CTCGGTGCGG ACCGCGGCTC GTCCGCGATC GACGCCCGGG TCATCGGCGT CGACGCGAGC ACGCCCGACC TGCTGCGCGG GCGCGGCCCG GAGCCCTGGG AGGACGTCGT GCGGGACCTG CCGCACCCTC CCGGCTCGGC ACGGGCCCCG GAGCGCACCG CCACCGGCAC CGAGCTCCCC GGCGATCCGC AGTGGCTGCT CGCGCGCGTG ACCCCCGGCT CCGCGCCCGA GGCGAGCGGC CGGGCGTACC TGCGGATCGC GGTCGAGGAC GAGGCCGGCG CGCGTGCCTG GCTGGCCACG CCGGAGCTGC TCCTCGGCGA GCCGGTCGAC ATCGCCCTCG AGGTGCCCCG CGCGCGCGGT CCCCTGCGCG TCGTCGCCGC GTCGTTCGTC GTCGCGCTCG ACGGCACCCC GTTCGAGGTC GCCGTGGCGA CGAGCCCCCG CGACAAGCTC GGGCAGATCG GTCTCGCGGT GCACGACGTG CGCGTGCTGG ACCGCTCGGT GGACGCCGGG ACGCCCGACG AGGCGACGCT CGCGGCCGCG CAGGGCACCC CGGTCGACCT CTCCGGGGCC CCGTGGGAGG GTTCCGCGAC GAGCGGTGGG GTCGTCCGCG ACGTGCTCGT CGGGAGTGCG GCGACGGCCC CGGCGCCGTC GGGGCTGCCC GCCGACGCGC TCGTGCTCGA CGGCCTGTTC GACATGAGCA CGCTGGACGC CTCGACGGGC CGGCTCGTCG CGCACGCCTG GCCCGCGCAG GAGCGGGTCC GCGCGGTCGT GACCGAGTCG CTCGCGGAGC GCGCGGACCT CCGGGACGGC GCCGGGTTCG TGATGCGCAT CGGCGACGCG CAGGTGGACG TGTACGTCGA GGCGGTCGTC CCGTACGTCC CCGGGGCCCC GCGCGGCCCG GCGCTCCTGG TGGACCGCAC CGCGCTGGGC CGCGTCGTGA CCGAGGCGGC CGGTACGGAC CCGCTCCTGG ACGCCTGGTG GCTGGCCGCA CCCCCGGCGC AGACCGCGGA CCTCGCCGGT GCGGTCGCGC ACGCCACCGG GGGCCACGCC ACCGTGCGCA GCACCGAGCG CGCCGCCGCC GTGGCCGGTC CGCTGCGCGT GACGGTGCCC GCAGCGCTGT CGCTCGTGAC CGCGGCGACC GCGATGCTGG TGCTCGTCGG GCTGGGGTCG AGCGCGGCGG CCGTGGTCCG GTCGCGCCGG CTCGAGCTCG CCCGGCTGCA GGCGCTCGGC GCGTCGCGCC GCTCGCTGGT CGGCGGGCTG CTCGGCGAGC ACGCGCTGCT CGTGCTCCTC GGGGCCGGCA CGGGTGCGCT GATCGGGTAC GGGCTGTCGC GCGTCGTCGC GCCCGTCCTC ACGGTGTCCG GGGACGGCCG CAGACCCGTG CCCGCGCCCG TGGTCGACTG GCAGGCCGAG GAGACCTTCG CGATCACCGC GGGCCTCGCG CTGGCCGGGT GCGCGGTGGT CGCGCTGCTC GCCACCGTGC TGGTACGACG CGCGTCCGGC GCGCTGCTGC GACTGGGGGA CGACCGATGA
|
Protein sequence | MVWRTQVLLG RLRDQATVLA TVALVTFVAT TLLGTFALLL DATGDDAVDA ALGRLPDSAI TLEATIRVNN KDTQTALDAA GDTLAAMLGD VPTERTAWLT GRTWSLPRVE GAPVAPLAYP ASTPLVPDQT ELLSGTWPDA ARDDAGRLLV NVPGVAAERY GWAVGTEVPV RTLGGQAEDT WLVVGTHEIT GPPASWSRDP LGGAGHDAAY PVPGTLGKLV TDLWGPVVVA PEALLGPGVT ERAHLLVLPD LTGAPRGALA TARDSLTSGQ VRLSAALTDV GVSGSIRTDL GTTIDAAWRE LTVTRVGVVV VGLLLAVLAT TVMLQAARLL GERRAAEGEL VAARGASPAQ LRSLAVLEAA LLAVLVTGTA PWAARALFAR LADTGGMSAA GLTAPPGVPP AVWFACAGVA TVLAVALVVP SWHVSGSSHA SAHAHLVRTG ADVALVALGG VALWQLLDYG APLTRGADGP RLDPVLVLGP ALVTLAAAVL ALRLVGPVGR GADALARRGT TLVVPLAAWQ VARRPAAATG TILVVVLAVA AATFSHAFLA TWRLSQLEQV DLALGTDARI EGARGEPLVV SADVRGALAD APGDAVLQPV VVRNVGVGRA LGADRGSSAI DARVIGVDAS TPDLLRGRGP EPWEDVVRDL PHPPGSARAP ERTATGTELP GDPQWLLARV TPGSAPEASG RAYLRIAVED EAGARAWLAT PELLLGEPVD IALEVPRARG PLRVVAASFV VALDGTPFEV AVATSPRDKL GQIGLAVHDV RVLDRSVDAG TPDEATLAAA QGTPVDLSGA PWEGSATSGG VVRDVLVGSA ATAPAPSGLP ADALVLDGLF DMSTLDASTG RLVAHAWPAQ ERVRAVVTES LAERADLRDG AGFVMRIGDA QVDVYVEAVV PYVPGAPRGP ALLVDRTALG RVVTEAAGTD PLLDAWWLAA PPAQTADLAG AVAHATGGHA TVRSTERAAA VAGPLRVTVP AALSLVTAAT AMLVLVGLGS SAAAVVRSRR LELARLQALG ASRRSLVGGL LGEHALLVLL GAGTGALIGY GLSRVVAPVL TVSGDGRRPV PAPVVDWQAE ETFAITAGLA LAGCAVVALL ATVLVRRASG ALLRLGDDR
|
| |