Gene Cfla_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2198 
Symbol 
ID9146098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2447547 
End bp2453690 
Gene Length6144 bp 
Protein Length2047 aa 
Translation table11 
GC content74% 
IMG OID 
ProductFibronectin type III domain protein 
Protein accessionYP_003637288 
Protein GI296130038 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0185742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTGGT GGGACGCGCG CACCTGGCGC CGAGGGCCGG CGAGCAGGGC GGCCGCGATC 
GTGACGGTTC CTGCGGTCGT GCTGTCGCTC GCGCTCCTCG ACCAGGGCTT CCCGCTGGCC
CGCGTCGACC TGAACGACGG TGGCGTGTGG CTCACGGCCA CGAGCAAGAT GAGCCTGGGG
CGCTACAACG TGCCCGTCGA GGAGCTCGAC GGTGGCCTGG TCACCACCGG CGGCACGTTC
GACGTGCTGC AGGACGAGGG CTGGGTGCTG CTGCGGGAGC AGAGCACCGT GTCGGTCGTC
GACCCGGCGA GCGTGGCGAC GACGACCCAG CTGGCGACGC CGGGCACCGA GGTGTCCATG
GCCGCGGGGC GCGTCGCGTT CGTCGACAGT GACGGCGACG CGTGGGTCCG CCAGGTCGCC
GCGCTCGACA CGCTCGACCT CACGCAGGAC ACGCCCGACC TCCGGCTCGG TGCGGGCGGG
GTGGCGGTCG TGGCGCGCAG CGGCGCGGTG CTCGCGCTGG CGCCGGAGAC CGGGACGGTC
ACCCGCGCAC CCGCACAGGA GCCGGTCGCG CCGGAGGAGG TCGGCCGGCT CGGGGCCGTC
GAGGCGGATG CCGCGACGAC CGTGGGCGAC GAGCTGGTCG TGCTGTCGGG CAGCACGGTG
CGGACCCTCG CGGGCACGGT GGAGGTCGAC GACGACGACC TGGTGCTGCA GCAGCCGGGA
CCGGCGGCCT CGAGCGTGCT GGTCGCCGGG CGATCGGCGC TGTGGGAGGT GCCGCTCGAC
GGCGGCGCAC CGCGGGAGCA CCCCACGACG GGCTCCGGCA GGCCCGCGAA GCCCGTGCGT
GTGGGGGACT GCGCGTACGC CGCGTGGTCC TCGTCCGTCG GCGGATACCT CGAGCTGTGC
GCCGGCCGGG ACGCGGTGGT CAAGGACCTC GAGGGGCTGA GCACGGGCGA CCAGCTGGAG
TTCCGCGTCA ACCGCGGTCT CGTCGTCCTC AACGACACGG TCGGGGGCCG GGTCTGGCTG
CCGCAGGAGG ACACCGAGGT GCGCGTCCCG AACTGGGACG ACATCGTGCC CGAGGAGGAG
CCGGAGGAGT CCGAGGAGGA CTCCGACAGC GGCGAGGTCA TGCAGGAGCC GGTCACCGAG
TGCAGCGACC AGGGCGCGCC GCCGGTCGCC GCCGACGACG CGTTCGGCGT GCGCGCCGGA
CGCACGCGCC TGCTGCCGGT GATCGACAAC GACTCGTCGG CGGACTGCAA GATCCTGGTG
ATCACGCAGG TCGACGCCCC GCCGCCGGAG TTCGGCACCG TGGAGCCCGT GCGCGGCGGC
CGCGCGCTGC AGGTGCACGT GGCGGAGGGC GCGAGCGGCT CGGTCCAGTT CAAGTACTCC
GTCAACGACG GTGGCGGCGT GAACGCTCCC GCGACGGGCG TGGTGACCCT CACGGTGAGC
GACGGCGACA CCGCGCCGAC CCAGGCGCGC ACGTCGACCC TGACGGTCGA GACCGGCGGA
CAGCTCGAGC ACGGCGTCCT CGCGGACTTC CACGACGCCG ACGGCGACGA CCTGCTGCTC
GTCGGTGCGA CCGCGGACCC CTCGGTGGGG ACCGCCCGGT TCCGGCAGGA CGGCGTGCTG
ACGTTCACCG CCGACGGTGG CGGGCTCGGC CGCACGACCG TGCAGGTGCA GGTCTCCGAC
GGCACGAACG TCGTCGACGG CGAGGTCGTC GTGGACGTGC GCGCCGCGGG CTCCGTCGCG
CCGCAGATCG ACCCCGTGCA CGCGGTGACC TACGTCGGGC AGGAGGTCGT GGTCAGCCCG
CTCGACGCGG TCCGCTCGGC GTCCAGCGAG CCGCCGCGCC TGGCCGCGGT GTCCGACGTG
GTGGGCGCGA CGATCGTCCC CGACCTGCGT GCCGGGACGT TCACGTTCAA GGCGCCGCGG
GCCCAGGTGT ACTACGTGCA GTTCGTCGTC ACCGCGGCCC CGCAGCAGGC GACCGGCCTC
GCGCGGATCG ACGTGCGGGA GTGGCCGGAG CAGGCCCAGC CGCCGATCGC GGTGCGTGAC
CTCGCGTTGC TCCCGGCGGG TGGCGAGGTC ACCGTCGACC CGCTCGCCAA CGACGAGGAC
CCCGCCGACA ACGTGCTCGT GCTCCAGACC GTCACCGCGC CCGAGGGATC GGGGCTGCAG
GTCGCCGTCA TCGACCACCG GTACGTGCGG ATCCGCGCCG AGCGGACGCC GGACGGGCCG
GTACCGCTCG TGTACGAGGT CTCGAACGGC TCGGCGTCGG CGCGCGGCGA GATCGTCGTG
CACCCCATCC CGCCGTCCGC GTCGTCGCAG GCGCCTGTCG TACCGCCCGT GGAGGCGAGC
GTGCGCGCCG GCGGCGTCGT GACGATCCCG GTGCTGGCCG GGGCCTCCGA CCCGGACGGC
GACCCGCTGA CCGTGCTGCG CGAGCTGTCG GAGCCGCTGC CGGAGGGCGA CGGCCTGCTG
TTCGTCTCGG GCGACGTGCT GCGCTACCAG GCGCCGAACC GGGCGCTCAC GGCGCGTGCG
ACCTTCCAGG TGCAGGACAC CGCGGGGAAC GTCACGGGGG CCACCGTGAC CGTGCGCGTC
CACGAGTCCG ACCCGCAGAC GAAGTCGCCG CCGCGGCCGA AGGACGTCGA GGCGCGCGTC
TTCGAGGGCG ACGTCGTGCG CATCCCCGTG CCGCTCGTCG GCATCGACGA CGACGGGGAC
GGCGTGACCC TGCTCGGTCC CGCGTCGGCG CCGGCCCTGG GACGCATCAC CGAGGTCGGC
CCCGACTGGC TGGAGTACGA GGCGGACCGC GGCTCGCGCG GCACCGAGAA GTTCACGTAC
GCCGTCGAGG ACTGGGTGGG CCAGCGGGCG GTCGCGTCCG TGCGCGTGGG CATCGCGCCC
CGTCCCTCCG GTGCCGCGGG GCTCGTCGCG GTCGACGACG CGGTGACGCT CCGGCCCGGC
CAGCGCGTGG AGGTGCGCGT CCTGGCGAAC GACATCGACT CGACCGGGCG TGAGCTGAGC
CTCGAGGCGA TCGAGCCGCC CGAGGGCCTC GACGCGCAGA TCCAGGGCCG GCGGATCGTC
GTGACGGCCC CGCCCGCCCC GGGCGTCGTG CAGATCCCGT ACCTCGCGAC GAACGACGGG
GGCGGCAGCG ACTACGGCGT GCTCACGGTC ACCGTCACGC CGGACGCACC CGTGCAGCCG
CCGCGTGCGC GCGACGTCCT CGTGCCGGCG ATCGACACGC TGGGCAAGAC CGAGGTGTCG
GTCGACGTGC TCGCGGTCGC ACAGAACCCC TCGGGTCCGC TGTCCGACCT CGAGGTGTCG
GTGCCCGCGT CCCACGCCGA CGTCGCGCGC GTCACCGAGG AGGGCGACGT GGTCGTCACC
CTGGTCGACC ACGCGCAGAC GGTGCCGTAC CGGCTGGTGA ACACCACCGA GCCGACCGCC
GACGCGTACG CGTTCATCAC CGTGCCCGCG CTCGGGTTCT TCCCGCCCCA GCTGCGCCCA
CGCGCGCCCG AGCTACGGGT GGCCAGCGGG GAGCAGATCG AGATCCCGTT GGCCGAGCAC
GTCCAGGTCG CTCCCGGCCG CAAGCCCACC ATCGCGGACC CCACGCAGGT GAGCGCGCCG
CGCTCGAACG GCGCCGAGCT CGTCAAGGAC CCCTCGACGA TCGTGTTCAC GTCCGCCGAG
GGGTACGCGG GGCCGGCGTC GGTCACGGTG CCCGTCACCG ACGCCACGGG TCCGTCCGAC
ACCACCGCGC GCACCGCGCT GCTCACGCTG CAGATCACGG TCTACGCGGT CGACGACGTC
GCACCGACGT TCGACGGCGC GACCATCGAG GTGGAGCCCG GCGAGGCACC CACGCGTATC
GACCTGCTCG CGCTGACGCA CGGCCCGGAG CAGGCCGACG GCGCCGTGGC GTCCGCCGAC
CGCTACGGCT ACTCGCTGAC GTCCGTCGTG CCCGCCGGGT TCGTGGTGGA CTCCGAAGGC
TCCGAGCTGT GGGTGTCGGC CGACCCGACG ACCCCGAAGG GCACGACGGG GCGCCTCGAC
CTGCGGCTCA CCTACGGGCG CAGCGGTGCC ATGGAGATCG CGGTCGACCT GCGCGTCGTG
GCGAGCAAGC GGCGCACCGC GACCGTCCAG AACTTCACGG TCGACGACGG CGCCCAGGGC
CGGGAGTCGA CCGTGGACGT GCTCGAGGGG GCGTTCAACC CGTTCCCCGA CACCGGGCCG
CTGAAGGTCG TCGGCGCGGT CGTCGAGACC GTCGGGGCGG GCACGGCGTC GGCGTCGTCG
AGCAGCGTCA CGGTGCGTCC CGGCGACGAC TTCATCGGCC TGATGGTCGT GCGCTTCCGT
GTGCGGGACG TCACGAGCGA CCCCGGGCGT GAAGTCGAGG GCCGTGTGAC CGTCCGCGTC
AGCGGCGTCC CGCTCGCCCC CGTCCCGCCG CGCATCGGCG AGGTGCGCGA CCGCACCGTC
GTGCTGTCGT GGACGGCGCC GGACAACCGC GGCGAGCCGA TCACGGAGTA CCGCGTGACC
GCGCAGCCCG GCGGCGCGAC GCGCTCGTGC GCGAGCACCA CGTGCACGAT CGACAACCTG
ACGAACGACG TCGAGTACAC GTTCACCGTC GAGGCGCGCA ACCGCGTCGG CTGGTCGGAG
CCGTCGCCGG CGTCCGCCCC CGCGCGCCCC GACGCGGTGC CGGACGCTCC CGGCGCCCCG
CGGCTCGTGT TCGGTGACGG CTCGGTCACG GCCACGTGGG ACGCACCGGT CTCCAAGGGC
TCGCCGATCA CCAGCTACTC GTTGGAGATC AGCCCGGCGC CGGACTCGGG CGCGGCGACG
CGCACGACGA CGTCGACGAG CTACACGTTC GACCGGCTGC GCAACGGCGT CGCCTACACC
GTCCGTGTCC GGGCCCACAA CAAGGCACCC GAGCCGAGCG CGTGGAGCCT GTGGTCGGGT
ACGGAGATCC CCGCCGGCCC GCCCGGCGCG CCGACGGGAC TGGCGGCCAC GCGCGTCGAC
GTGCCCTACG GCGGCCAGAT CACCGTCACC TGGGGCGACA CGGCCCCGAA CGGCGACGCG
ATCCAGGGCT ACGAGCTCGT CGTCGGCGGC AGCAACGGCG GCACCTTCCC CCTCAACGCC
GACGCGCGCT CCTACGCGTT CGCCCAGGCG AGGAACGGCG AGACGTACAC GTTCTCGCTG
CGCGCCCGGA ACAAGGCCGG GTGGGGGAGC GCGGCGAGCA CGGACGCGTC GACGTACGGC
CAGCCGGGCG CCCCCGGGGC GCCGAGCGCG GAGGCGCTCG TCGGCCAGGG CGCCGCGCGG
CTCACGTGGG CCGACGCCGA CGGCAACGGC GCGCCCATCG AGCGGTACGT CGTGACGGTG
TCGGACGGCC GACGCCTCGA CGTGAGCGGC ACGTCGACCA CCGTCACCGG GCTCACCGGA
GGGTCGTCCT ACACGTTCAC CGTCACCGCC GTGAACGCCC AGGGCGAGGG CCCGGCGTCG
GCGGCGGCGA CCGTGCAGGC CTCGACACCG CCAGGTACGC CCACGGTCGA GGCCCCCGTG
GTGTCCGCCA CGGGTGACTT CGGGCGCGCC ACACAGCTGC TGGTCTCGTG GAGTGCGGTG
TCCGCGAACG CGCTGGACGC CGAGGGACGA CCCTCGACGG TCTCGTACGA GTACCAGGTC
ACCAGCAACT GGGGCAGCAC CCAGCGACTG TCGACCACCG CGACGTCGGC GACCGTCGAC
GTCAGCGGCT GGCGCTTCCC GGTGGGCGGT GGCGATGTCA CCGTCACGGT GTGGCCGCAC
ACGACCGTGT CCGGGCAGCG GCTCGACGGC TCGTCGGGCG CGCGCACCGC ATCCGTCGGC
CGCTGGGGAT CACCGCCCTC GGCTCCCGGC ACGCCGACCC TCGTGGTGGA CGGCACCGCG
CTGACCGCCA GCTGGACGGC CCCCGCGGTC AACGGCGGCA GCGACATCGA GCGCTACCAG
GTGACCTGGA CGATCCCCGG GCGGCGCGAC CGCGTCGAGA ACACCGGACC CGGCACGCTC
CAGCACACGA ACGAGGTCGG CGACGTGCCG CCCGGCACGG TCGTCACGGT CACGGTGCGT
GCCGAGAACG CCAGCGGCCG CGGCGATCCC GCGTCCGCCA CCTGGACCGT GCCCGAGCCC
GCCGCACCGG AAGGTGACGG GTGA
 
Protein sequence
MTWWDARTWR RGPASRAAAI VTVPAVVLSL ALLDQGFPLA RVDLNDGGVW LTATSKMSLG 
RYNVPVEELD GGLVTTGGTF DVLQDEGWVL LREQSTVSVV DPASVATTTQ LATPGTEVSM
AAGRVAFVDS DGDAWVRQVA ALDTLDLTQD TPDLRLGAGG VAVVARSGAV LALAPETGTV
TRAPAQEPVA PEEVGRLGAV EADAATTVGD ELVVLSGSTV RTLAGTVEVD DDDLVLQQPG
PAASSVLVAG RSALWEVPLD GGAPREHPTT GSGRPAKPVR VGDCAYAAWS SSVGGYLELC
AGRDAVVKDL EGLSTGDQLE FRVNRGLVVL NDTVGGRVWL PQEDTEVRVP NWDDIVPEEE
PEESEEDSDS GEVMQEPVTE CSDQGAPPVA ADDAFGVRAG RTRLLPVIDN DSSADCKILV
ITQVDAPPPE FGTVEPVRGG RALQVHVAEG ASGSVQFKYS VNDGGGVNAP ATGVVTLTVS
DGDTAPTQAR TSTLTVETGG QLEHGVLADF HDADGDDLLL VGATADPSVG TARFRQDGVL
TFTADGGGLG RTTVQVQVSD GTNVVDGEVV VDVRAAGSVA PQIDPVHAVT YVGQEVVVSP
LDAVRSASSE PPRLAAVSDV VGATIVPDLR AGTFTFKAPR AQVYYVQFVV TAAPQQATGL
ARIDVREWPE QAQPPIAVRD LALLPAGGEV TVDPLANDED PADNVLVLQT VTAPEGSGLQ
VAVIDHRYVR IRAERTPDGP VPLVYEVSNG SASARGEIVV HPIPPSASSQ APVVPPVEAS
VRAGGVVTIP VLAGASDPDG DPLTVLRELS EPLPEGDGLL FVSGDVLRYQ APNRALTARA
TFQVQDTAGN VTGATVTVRV HESDPQTKSP PRPKDVEARV FEGDVVRIPV PLVGIDDDGD
GVTLLGPASA PALGRITEVG PDWLEYEADR GSRGTEKFTY AVEDWVGQRA VASVRVGIAP
RPSGAAGLVA VDDAVTLRPG QRVEVRVLAN DIDSTGRELS LEAIEPPEGL DAQIQGRRIV
VTAPPAPGVV QIPYLATNDG GGSDYGVLTV TVTPDAPVQP PRARDVLVPA IDTLGKTEVS
VDVLAVAQNP SGPLSDLEVS VPASHADVAR VTEEGDVVVT LVDHAQTVPY RLVNTTEPTA
DAYAFITVPA LGFFPPQLRP RAPELRVASG EQIEIPLAEH VQVAPGRKPT IADPTQVSAP
RSNGAELVKD PSTIVFTSAE GYAGPASVTV PVTDATGPSD TTARTALLTL QITVYAVDDV
APTFDGATIE VEPGEAPTRI DLLALTHGPE QADGAVASAD RYGYSLTSVV PAGFVVDSEG
SELWVSADPT TPKGTTGRLD LRLTYGRSGA MEIAVDLRVV ASKRRTATVQ NFTVDDGAQG
RESTVDVLEG AFNPFPDTGP LKVVGAVVET VGAGTASASS SSVTVRPGDD FIGLMVVRFR
VRDVTSDPGR EVEGRVTVRV SGVPLAPVPP RIGEVRDRTV VLSWTAPDNR GEPITEYRVT
AQPGGATRSC ASTTCTIDNL TNDVEYTFTV EARNRVGWSE PSPASAPARP DAVPDAPGAP
RLVFGDGSVT ATWDAPVSKG SPITSYSLEI SPAPDSGAAT RTTTSTSYTF DRLRNGVAYT
VRVRAHNKAP EPSAWSLWSG TEIPAGPPGA PTGLAATRVD VPYGGQITVT WGDTAPNGDA
IQGYELVVGG SNGGTFPLNA DARSYAFAQA RNGETYTFSL RARNKAGWGS AASTDASTYG
QPGAPGAPSA EALVGQGAAR LTWADADGNG APIERYVVTV SDGRRLDVSG TSTTVTGLTG
GSSYTFTVTA VNAQGEGPAS AAATVQASTP PGTPTVEAPV VSATGDFGRA TQLLVSWSAV
SANALDAEGR PSTVSYEYQV TSNWGSTQRL STTATSATVD VSGWRFPVGG GDVTVTVWPH
TTVSGQRLDG SSGARTASVG RWGSPPSAPG TPTLVVDGTA LTASWTAPAV NGGSDIERYQ
VTWTIPGRRD RVENTGPGTL QHTNEVGDVP PGTVVTVTVR AENASGRGDP ASATWTVPEP
AAPEGDG