Gene Cfla_0958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0958 
Symbol 
ID9144833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1058917 
End bp1062276 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content78% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003636064 
Protein GI296128814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.502077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00295901 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGTGGC GGACGCAGGT CCTGCTCGGA CGCCTGCGTG ACCAGGCGAC CGTCCTGGCG 
ACGGTCGCGC TCGTGACGTT CGTCGCCACG ACGCTGCTCG GCACGTTCGC GCTGCTGCTG
GACGCCACCG GCGACGACGC CGTCGATGCC GCTCTCGGCC GGCTCCCGGA CTCCGCGATC
ACGCTCGAGG CGACGATCCG GGTCAACAAC AAGGACACGC AGACGGCGCT CGACGCCGCG
GGCGACACCC TCGCCGCGAT GCTGGGCGAC GTCCCCACCG AGCGCACCGC GTGGCTGACC
GGCCGCACCT GGTCGCTGCC GCGCGTCGAG GGCGCACCCG TGGCACCCCT CGCCTACCCG
GCGAGCACAC CGCTCGTCCC CGACCAGACC GAGCTGCTCA GCGGTACGTG GCCGGACGCC
GCGCGCGACG ACGCCGGACG CCTCCTGGTC AACGTGCCGG GCGTCGCCGC CGAACGCTAC
GGCTGGGCCG TCGGCACCGA GGTCCCCGTG CGGACGCTCG GCGGGCAGGC GGAGGACACC
TGGCTCGTCG TCGGCACGCA CGAGATCACG GGCCCACCCG CGTCGTGGTC GCGCGATCCC
CTGGGCGGCG CGGGGCACGA CGCCGCGTAC CCGGTGCCCG GCACGCTGGG CAAGCTCGTC
ACGGACCTCT GGGGACCCGT GGTCGTCGCT CCCGAGGCAC TGCTGGGCCC CGGCGTCACC
GAGCGAGCGC ACCTGCTCGT GCTCCCCGAC CTGACGGGTG CGCCCCGCGG CGCGCTCGCC
ACCGCGCGCG ACTCGCTGAC GTCGGGCCAG GTCCGGCTGT CCGCCGCGCT CACCGACGTC
GGCGTCAGCG GGTCGATCCG CACCGACCTC GGGACCACGA TCGACGCCGC CTGGCGCGAG
CTGACCGTCA CGCGCGTCGG CGTGGTCGTC GTCGGCCTGC TGCTGGCCGT GCTGGCGACC
ACCGTGATGC TGCAGGCCGC GCGCCTGCTC GGCGAACGGC GCGCCGCCGA GGGCGAGCTC
GTGGCCGCGC GCGGCGCGTC GCCCGCGCAG CTGCGCTCGC TGGCCGTGCT CGAGGCGGCC
CTGCTCGCCG TGCTCGTCAC GGGCACCGCG CCGTGGGCGG CGCGCGCGCT GTTCGCGCGG
CTCGCCGACA CCGGGGGGAT GAGCGCGGCC GGCCTGACGG CACCGCCTGG GGTGCCACCG
GCGGTGTGGT TCGCGTGCGC CGGGGTCGCG ACCGTGCTGG CCGTCGCACT GGTGGTGCCG
TCCTGGCACG TCAGCGGCTC CTCCCACGCG AGCGCGCACG CGCACCTCGT GCGCACGGGC
GCCGACGTCG CGCTCGTGGC GCTCGGCGGC GTCGCCCTGT GGCAGCTCCT CGACTACGGC
GCGCCCCTGA CCCGCGGTGC CGACGGCCCC CGCCTCGACC CCGTGCTCGT CCTGGGCCCC
GCGCTCGTCA CGCTCGCCGC GGCCGTCCTG GCGCTGCGCC TCGTGGGCCC CGTCGGCCGC
GGCGCCGACG CGCTCGCGCG CCGGGGCACC ACGCTGGTCG TGCCGCTCGC CGCATGGCAG
GTGGCGCGGC GCCCCGCCGC CGCGACGGGC ACGATCCTCG TGGTGGTGCT CGCGGTGGCC
GCGGCCACGT TCTCCCACGC GTTCCTCGCG ACGTGGCGCC TGTCGCAGCT CGAGCAGGTC
GACCTCGCGC TGGGCACCGA CGCGCGCATC GAGGGCGCGC GCGGCGAGCC GCTCGTGGTC
TCGGCCGACG TCCGCGGCGC GCTGGCGGAC GCTCCCGGTG ACGCGGTGCT GCAGCCCGTC
GTCGTGCGGA ACGTCGGCGT CGGCCGTGCG CTCGGTGCGG ACCGCGGCTC GTCCGCGATC
GACGCCCGGG TCATCGGCGT CGACGCGAGC ACGCCCGACC TGCTGCGCGG GCGCGGCCCG
GAGCCCTGGG AGGACGTCGT GCGGGACCTG CCGCACCCTC CCGGCTCGGC ACGGGCCCCG
GAGCGCACCG CCACCGGCAC CGAGCTCCCC GGCGATCCGC AGTGGCTGCT CGCGCGCGTG
ACCCCCGGCT CCGCGCCCGA GGCGAGCGGC CGGGCGTACC TGCGGATCGC GGTCGAGGAC
GAGGCCGGCG CGCGTGCCTG GCTGGCCACG CCGGAGCTGC TCCTCGGCGA GCCGGTCGAC
ATCGCCCTCG AGGTGCCCCG CGCGCGCGGT CCCCTGCGCG TCGTCGCCGC GTCGTTCGTC
GTCGCGCTCG ACGGCACCCC GTTCGAGGTC GCCGTGGCGA CGAGCCCCCG CGACAAGCTC
GGGCAGATCG GTCTCGCGGT GCACGACGTG CGCGTGCTGG ACCGCTCGGT GGACGCCGGG
ACGCCCGACG AGGCGACGCT CGCGGCCGCG CAGGGCACCC CGGTCGACCT CTCCGGGGCC
CCGTGGGAGG GTTCCGCGAC GAGCGGTGGG GTCGTCCGCG ACGTGCTCGT CGGGAGTGCG
GCGACGGCCC CGGCGCCGTC GGGGCTGCCC GCCGACGCGC TCGTGCTCGA CGGCCTGTTC
GACATGAGCA CGCTGGACGC CTCGACGGGC CGGCTCGTCG CGCACGCCTG GCCCGCGCAG
GAGCGGGTCC GCGCGGTCGT GACCGAGTCG CTCGCGGAGC GCGCGGACCT CCGGGACGGC
GCCGGGTTCG TGATGCGCAT CGGCGACGCG CAGGTGGACG TGTACGTCGA GGCGGTCGTC
CCGTACGTCC CCGGGGCCCC GCGCGGCCCG GCGCTCCTGG TGGACCGCAC CGCGCTGGGC
CGCGTCGTGA CCGAGGCGGC CGGTACGGAC CCGCTCCTGG ACGCCTGGTG GCTGGCCGCA
CCCCCGGCGC AGACCGCGGA CCTCGCCGGT GCGGTCGCGC ACGCCACCGG GGGCCACGCC
ACCGTGCGCA GCACCGAGCG CGCCGCCGCC GTGGCCGGTC CGCTGCGCGT GACGGTGCCC
GCAGCGCTGT CGCTCGTGAC CGCGGCGACC GCGATGCTGG TGCTCGTCGG GCTGGGGTCG
AGCGCGGCGG CCGTGGTCCG GTCGCGCCGG CTCGAGCTCG CCCGGCTGCA GGCGCTCGGC
GCGTCGCGCC GCTCGCTGGT CGGCGGGCTG CTCGGCGAGC ACGCGCTGCT CGTGCTCCTC
GGGGCCGGCA CGGGTGCGCT GATCGGGTAC GGGCTGTCGC GCGTCGTCGC GCCCGTCCTC
ACGGTGTCCG GGGACGGCCG CAGACCCGTG CCCGCGCCCG TGGTCGACTG GCAGGCCGAG
GAGACCTTCG CGATCACCGC GGGCCTCGCG CTGGCCGGGT GCGCGGTGGT CGCGCTGCTC
GCCACCGTGC TGGTACGACG CGCGTCCGGC GCGCTGCTGC GACTGGGGGA CGACCGATGA
 
Protein sequence
MVWRTQVLLG RLRDQATVLA TVALVTFVAT TLLGTFALLL DATGDDAVDA ALGRLPDSAI 
TLEATIRVNN KDTQTALDAA GDTLAAMLGD VPTERTAWLT GRTWSLPRVE GAPVAPLAYP
ASTPLVPDQT ELLSGTWPDA ARDDAGRLLV NVPGVAAERY GWAVGTEVPV RTLGGQAEDT
WLVVGTHEIT GPPASWSRDP LGGAGHDAAY PVPGTLGKLV TDLWGPVVVA PEALLGPGVT
ERAHLLVLPD LTGAPRGALA TARDSLTSGQ VRLSAALTDV GVSGSIRTDL GTTIDAAWRE
LTVTRVGVVV VGLLLAVLAT TVMLQAARLL GERRAAEGEL VAARGASPAQ LRSLAVLEAA
LLAVLVTGTA PWAARALFAR LADTGGMSAA GLTAPPGVPP AVWFACAGVA TVLAVALVVP
SWHVSGSSHA SAHAHLVRTG ADVALVALGG VALWQLLDYG APLTRGADGP RLDPVLVLGP
ALVTLAAAVL ALRLVGPVGR GADALARRGT TLVVPLAAWQ VARRPAAATG TILVVVLAVA
AATFSHAFLA TWRLSQLEQV DLALGTDARI EGARGEPLVV SADVRGALAD APGDAVLQPV
VVRNVGVGRA LGADRGSSAI DARVIGVDAS TPDLLRGRGP EPWEDVVRDL PHPPGSARAP
ERTATGTELP GDPQWLLARV TPGSAPEASG RAYLRIAVED EAGARAWLAT PELLLGEPVD
IALEVPRARG PLRVVAASFV VALDGTPFEV AVATSPRDKL GQIGLAVHDV RVLDRSVDAG
TPDEATLAAA QGTPVDLSGA PWEGSATSGG VVRDVLVGSA ATAPAPSGLP ADALVLDGLF
DMSTLDASTG RLVAHAWPAQ ERVRAVVTES LAERADLRDG AGFVMRIGDA QVDVYVEAVV
PYVPGAPRGP ALLVDRTALG RVVTEAAGTD PLLDAWWLAA PPAQTADLAG AVAHATGGHA
TVRSTERAAA VAGPLRVTVP AALSLVTAAT AMLVLVGLGS SAAAVVRSRR LELARLQALG
ASRRSLVGGL LGEHALLVLL GAGTGALIGY GLSRVVAPVL TVSGDGRRPV PAPVVDWQAE
ETFAITAGLA LAGCAVVALL ATVLVRRASG ALLRLGDDR