Gene Cfla_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3298 
Symbol 
ID9147214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3666135 
End bp3670103 
Gene Length3969 bp 
Protein Length1322 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative type II DNA modification enzyme 
Protein accessionYP_003638376 
Protein GI296131126 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGGG ACCTGACCGC GGTGCGTTCC GTCGGGGCGC TGCTCCCGTC CGACCTGCTG 
TCGCGGGTCG CCGCGGGTGA CCCCGACCTG GGCGGGTTCT CCGGCCGCGA CTACCACCTG
GCGGCGGGTG AGTCGCCGCG TGAGGCGGCG AACCGCGCGT GGGCGTACCT CGTGGGGGTG
TGGCCGTCGT TCCAGGACGC GCTGGCGCGC CTGCCGGAGG GTGACGCGGC CGTCGGCCTG
ACGCGGGAGC GGTGGCTCCA GGTGCTGTTC CGCGAGCTGG GCTACGGGCG TCTGCAGACG
ACACCCGCAG GCGGCATCTC CGTCGCGGGC AAGGCGTTCC CGGTGTCGCA TGCGTGGGGC
GCTGTCCCGA TCCACCTGCT GGGGTGGGGC GTCGATCTCG ACAAGCGCAC CAAGGGGGTG
CCGGGTGCGG CCGAGCGGGC CCCGCACGCG ATGGTCCAGG AGCTGCTGAA CCGCTCGGAC
GACCACCTGT GGGCGGTCGT CTCCAACGGG CGTGTGCTGC GGCTGCTGCG CGACTCGACG
TCGCTGTCGA CGCAGTCGTA CGTGGAGTTC GACCTCGAGG CGATCTTCAC CGGCGAGCTG
TTCGCCGACT TCGTCGTGCT GTTCCTGCTG CTGCACCAGT CGCGCGTCGA GGTGCAGAGC
GAGGGCGCCC CGCCGAGCGA CTGCTGGCTG GAGAAGTGGC GGTCGACGGC GATCGCGTCA
GGTGCACGCG CGCTGACGCT GCTCTCGGAC GGGGTGCAGC GTGCGATCGC CGACCTGGGG
ACCGGCTTCC TGCGTGCGCC CCAGAACACC GCGCTGCGGG CGCGGCTCGA CGACGAGTCG
CTGCACCTGG CGGACTACCA CCAGGCCCTG CTGCGGCTCG TGTACCGGCT GCTGTTCCTG
TTCGTCGCCG AGGACCGGCA GGCGCTCCAC CCCGCCGACG CCGACCCGGT GGCCCGCGCG
CGCTACGTCG AGTACTTCTC GACCACGCGG CTGCGACGGC TCGCGCTGCG CCGGCGCGGG
ACGCGGCACG GCGACCTGTG GCAGGCGCAG CGGCTCGTGC TGCGGCGTCT GGGCCAGGAC
GACGGCTGCC CCGAGCTCGC ACTGCCCGGG CTGGGCGGCA TCTTCGACGA CGACGGCACC
GAGCTCTTCA CCGACGCCGA GCTGCCCAAC GACGCGCTGC TCTCCGCCGT GCGGCACCTG
TCGACAGTGC GGCCCAAGGG CCAGCCACTG CGCACGGTCG ACTACAAGAA CCTCGGCGCG
GAGGAGCTCG GCTCCATCTA CGAGTCGCTG CTCGAGCTCG TCCCCCGGTA CCAGCGCACC
GAGCAGACCT TCTCGCTGGA GAACCTGGCC GGCAACGACC GCAAGACCAC GGGCTCCTAC
TACACGCCGT CCTCGCTCAT CGACCTCGTG CTCGACGAGA CCCTCACGCC CCTGCTCGAC
GAGGCGGAGA GGAAGCCCGA CCCCGAGGCC GCGCTCCTCG CGATGAGCGT GTGCGATCCG
GCGTGCGGGT CCGGGCACTT CCTGGTCGCG GCGGCCCGCC GCATCGCCGA ACGCCTCGCG
ATCGTCCGCT CCGGGGAGAT CGACCCGACG CCGACGCACC TGCAGGACGC GCTCTACGAC
GTGGTGGGCT CATGCATCTA CGGCGTGGAC CTCAACCCAC TGGCCGCCGA GCTCGCGAAG
GTCTCGCTGT GGCTCGAGTC CATGCGCCCC GGGCGTCCGC TGTCGTTCCT CGACGCCCAC
ATCAAGGTCG GCAACGCGCT GCTCGGCACG ACGCCCGCGC TGCTCGCCGA CGGGATCCCC
GACGACGCCT ATGTCGCCCT GACGGGGGAT GACAAGCCGT TCACCACCGC GCTGAAGAAG
CGCAACAAGG CCGAGCGGGA GTCCAGCGGC GACGGCTCAC TCTTCGACCT GGGGGACGTC
GGGACCAGCA CGATCGACCT GCGCGGTCGG CTCTCGCGAG CGGTCGCGCC CGCGGTCGGC
GCGCCGCGCC TGCGGGAGGT CCACGCCGCG CAGCGTGCCT ACAGCCAGTT CCAGGCGTCG
CCCGAGCTGG CACGCGAGCG TCTGCACGCC GACGCCTGGT GCACGGCCTT CGTTCAGACC
AAGAGGCCGG GCACGACGGC GATCACCAGC GCGACACTCG ACGCCGTGCG CGACGGCACC
GCGCCCGACG ATGTCGTGCG CACCGTCCAG CGGCTCACCG CGCAGTTCCG GTTCTTCCAC
TGGCACCTGG AGTTCCCGGA CGTGTTCGAC ACGTCCGCAC CGGTCGGTGA ACACGGGTGG
GCAGGCGGGT TCACCGTCAT GGTGGGCAAC CCGCCGTGGG AGCGCATCAA GCTCCAGGAG
CAGGAGTTCT TCGCGCAGCG TGACGCCCTG ATCGCCGGTG CGGCGAACTC GGCCGCGCGG
AAGAAGCTCA TCGCAGCGCT GACGGCCGAC AACCCCGGCC TCGCTGACGA GTGGGCCGCC
GCGAGCCGGG CGGCAGAGGC GGCAAGCCAC TACCTGCGCA AGTCTGGTCG CTACCCGCTG
TGCGGTGTCG GCGACGTCAA CACCTACAGC GTCTTCGCCG AACTGTTCCG GTCGTCGATC
GCTCACCACG GGCGCATGGG CATCATCACC CCGACCGGTC TCGCCACGGA TGCGACGACC
GCCGCCTTCT TCGCCGACAC CGTATCGTCC GGGCGACTCG CCGCGTTCTA CGATTTCGAG
AACGAGGCGA AGATCTTTGA GAACGTGCAT CACGCGTTCC GCTTCGCGGT CGCCTGCCTC
ACCGGTGGTG TCCGAGCGGA CCAGGCTCGA CTCTCCTTCC TGACCAGGCA CGTCGCGGAC
GCCGTCTCCA CACGGTTTGG GCTTGACCCC GACGAGATCC TGGCACTCAA CCCCAACACG
GGGACCCTCC CGATGTTCAG ATCACGCCGC GACGCCGAGA TCACCTTGGG CATCTACCGG
CGTCATCCCG TCCTGGTCAA CGACGCCTCG GGCAGCAATC CGTGGGGTCT GCGATTCTCC
ACGATGTTCC ACATGTCCAA CGACTCGGGG TTATTCGAGA CGGCCGACGA CCTGCGCGCC
CGGGGCGCCG AGTTCGACGG CTGGGCCTGG ACACTCGGCA CGCAACGGTG GCTGCCGCTC
TACGAGGCCA AGATGCTCTC GCACTACGAC CACCGCTACT CGACCTACGC GAACGCCACC
CAGGCGCAGC TCAACATGGG CACCCTCCCC CGCCTGACCG ACGCCCAGCA CGACGACCCG
CACGTCGAGC CCCTCGCCCG CTACTGGGTC GCCGAGAAGG ACGTCGAGAC CGCCATCGCG
GGTCGCTGGG ACCGCGACTG GTTCCTCGGC TGGCGCGACA TCGCGAGATC GAGCGACGCC
CGCACCTTCG TCCCCAGCGT CCTGCCTAGA TCGGCAGTGG GTGACAAGTT CCTTCTCGCA
TGGCCCTCAG GGCCCGCCCA CTCGCTGCCA TTGCAGGCAA TATGGTCGAG CCACTGCTTC
GACTACATCG CACGGCAAAA ATTGAGCGGC ACGGGCATGA AGTACTTCCT CACCAAACAG
CTCGCCGCGC CACTCCCGGA AGATTTCGAT CAAACACTAC GCGGAGTAGC GGACCAACCA
CTCCTTAAGT GGATCACACC GCGAGTAGTT GCACTTTGCC ATACCTCGAG CCACCTGGCG
GCCTACGCGA ATGATTTCTC CGAATCGGTG TTCCCGTTCC GCTGGGTGCC TGGCCGCCGC
GAGCAGCTTC GCGCCGAGCT CGACGCCGCG ATGTTCCTCC TCTACGGCCT TGACCGGGGC
GAGGTCGAGC ACGTGATGGA CTCGTTCTTC GTCGTCCGCA AGTACGAGGA ACGCGACCAC
GGCGAGTTCC GCACCAAGCG CCTGATCCTC GAGGCCTACG ACGCGATGAC CGCGGCCGCG
CAAGCGGGCA CCGTCTACGT CAGCCCCCTC GACCCGCCAC CGGGTCACGG CCCCCGCCAC
GACGCATGA
 
Protein sequence
MSRDLTAVRS VGALLPSDLL SRVAAGDPDL GGFSGRDYHL AAGESPREAA NRAWAYLVGV 
WPSFQDALAR LPEGDAAVGL TRERWLQVLF RELGYGRLQT TPAGGISVAG KAFPVSHAWG
AVPIHLLGWG VDLDKRTKGV PGAAERAPHA MVQELLNRSD DHLWAVVSNG RVLRLLRDST
SLSTQSYVEF DLEAIFTGEL FADFVVLFLL LHQSRVEVQS EGAPPSDCWL EKWRSTAIAS
GARALTLLSD GVQRAIADLG TGFLRAPQNT ALRARLDDES LHLADYHQAL LRLVYRLLFL
FVAEDRQALH PADADPVARA RYVEYFSTTR LRRLALRRRG TRHGDLWQAQ RLVLRRLGQD
DGCPELALPG LGGIFDDDGT ELFTDAELPN DALLSAVRHL STVRPKGQPL RTVDYKNLGA
EELGSIYESL LELVPRYQRT EQTFSLENLA GNDRKTTGSY YTPSSLIDLV LDETLTPLLD
EAERKPDPEA ALLAMSVCDP ACGSGHFLVA AARRIAERLA IVRSGEIDPT PTHLQDALYD
VVGSCIYGVD LNPLAAELAK VSLWLESMRP GRPLSFLDAH IKVGNALLGT TPALLADGIP
DDAYVALTGD DKPFTTALKK RNKAERESSG DGSLFDLGDV GTSTIDLRGR LSRAVAPAVG
APRLREVHAA QRAYSQFQAS PELARERLHA DAWCTAFVQT KRPGTTAITS ATLDAVRDGT
APDDVVRTVQ RLTAQFRFFH WHLEFPDVFD TSAPVGEHGW AGGFTVMVGN PPWERIKLQE
QEFFAQRDAL IAGAANSAAR KKLIAALTAD NPGLADEWAA ASRAAEAASH YLRKSGRYPL
CGVGDVNTYS VFAELFRSSI AHHGRMGIIT PTGLATDATT AAFFADTVSS GRLAAFYDFE
NEAKIFENVH HAFRFAVACL TGGVRADQAR LSFLTRHVAD AVSTRFGLDP DEILALNPNT
GTLPMFRSRR DAEITLGIYR RHPVLVNDAS GSNPWGLRFS TMFHMSNDSG LFETADDLRA
RGAEFDGWAW TLGTQRWLPL YEAKMLSHYD HRYSTYANAT QAQLNMGTLP RLTDAQHDDP
HVEPLARYWV AEKDVETAIA GRWDRDWFLG WRDIARSSDA RTFVPSVLPR SAVGDKFLLA
WPSGPAHSLP LQAIWSSHCF DYIARQKLSG TGMKYFLTKQ LAAPLPEDFD QTLRGVADQP
LLKWITPRVV ALCHTSSHLA AYANDFSESV FPFRWVPGRR EQLRAELDAA MFLLYGLDRG
EVEHVMDSFF VVRKYEERDH GEFRTKRLIL EAYDAMTAAA QAGTVYVSPL DPPPGHGPRH
DA