Gene Cfla_3281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3281 
Symbol 
ID9147197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3640692 
End bp3643835 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content70% 
IMG OID 
ProductSMC domain-containing protein 
Protein accessionYP_003638361 
Protein GI296131111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTG AGTCCGTCAC TGCACACGCA TTCGGTCCTC TGCAGTCCGG CGACCTTCGG 
TTCGCCCCCG GGATGACGGT CGTTACGGGC GTCAACGAGT CCGCGAAGTC TTCTTGGCAC
GCCGCTGTGT ACGCCGCTGT GACGGGACGC CGACGCGGCA AGGGTGCGCC GACCCGTGAG
GAGCGCCGCT TTGCTGAGCT CCACAAGCCC TGGGACGACG ATCAGTGGCG TGTGTCCGCA
GTCCTCGTCC TCGATGACGG CCGACGCATC GAGCTAGCCC ACGACCTGAA CGGCAAGGTC
GACTGCCGTG CGACCGACCT TGCCCTCGGT ACGGACGTCT CGGCGGAGAT CATGTTCGAA
GGCGCTCCGG ACGCCTCGCG GTACCTCGGT CTCGACCGCA AGAGCTTCGC GGCCATCGCC
GTCGTCAATC AGGCGGAGCT CCTCGGGGTA CTGAACGCCG CCAACGGCCT GCAGGAGCAC
CTGCAGCGCG CCGCGGCCAC CGCCGGCGCG GACGCGACCG CCGCCGCCGC ACTGGCCGCC
CTGGAGACGT TCGCGCGCGA CAACGTCGGA CTCGACCGGG CGAACTCCTC CAAGCCGTTG
CGCGCAGCAA AGAATGCGCT CGAGAACGCA AGAGCCGACC TCGACGCCGC CTTCGCCGAG
CACGCGCGCT ACCTGGAGCT CACGGCGGTA GCCGAGACAC ATCGCGCGAG GGCGGACAAG
GCCGCGCAAC GGACTCTCGC CGCGCAGGAG AAGGCCGCGG CACTCGAGCT TCTGGTGCAT
GCCCTGCAGG TCGTGGCCGA ACGGCAAGGC GATGCCGCGC GCGTCGACGA TGTCAGCAAG
GCGGCTGTGA CGCGCAGAGA CGCACTTGCT CAGCGTGTGG CGAAGGCTCG CTCACTGAGT
GCCGCTTCCA CCGACGACGC CGTCCCCGCG GGAGCGCCAG CCGCAGAAGC AGTCGCTCGC
ATTGTCGCAG CCGCACTTGC ACGATGGTCC TCAGTTCCCG ACCTGCGGAT GCCGGCGGGG
TCCACCGCCG CCGACCTCGC TCAACAGCTC GATGCCCTAC CACCACCACC CGATGGCGAC
ACTCAGGTGG CGGCGGCGGT CCGGGACGCC TATCAGGGGT GGACCCGTGC GGTCGCAGTG
GTCACCGCTC ACGATGGCCG TCGCCCACCT GACCCGGGAA CTCCTTCGCA GGACCTCGAG
CCCGCCGTCG AAGCTGGCCC GTCGATGCTG CGCCAACTCG CAGCCCAGCT CGGCGCGACC
GGCAATGGCA ACTCGGAGCA GGTGCAGTAC CTGACCGATG CGCGCGACCA GGCTCGCTCA
GAACAGGCAG CGGCTCAGGC GTCGCTCGCC GACGCGACGG CCCGGGCGAA CGCTGCTCGA
GCCGCCTTCA CGACAGCCAT GGCGGCGCCT CCGTCGACGA CGACCGGCCG GTCCCGCGTG
CCGCGCTACG GCCTTGTCGC CACAGCGGCC GCCGCCGCAG TCGCGGCGGT GACCACGGCC
ATCGCCTTGT CGAACGTGAC GGTGGCACTT GTGCTGGGCG CGCTCTGCAT CGCCGCTGCC
GTCGGCGCTT TCGTCGTCGC CCGCCCGGAG CCCCGCCACC CGACCAGCGG CGCGGTGAAC
ATCCCAGCTC TGAGTGCCGC GGGCCACGCA GCGGAAGAGG CTCTGGCCTC CGCGCAGGAG
CGCCTGTACG CAGCGGTAAC TGAGGTGGCA TCCGCGGACG CGCAACTGCG CGCAGCTGCC
GCCTCCTCGT CCGCAGCGGC CGAGGTCGCT GCGCGCTGTG CCGCGCGGTC CCTGCCTGCC
GACCCCGGTG CGCTGCAACA GCTTGCTGCC CGCGCTGAGC AGCACCTGGA AGCACGGGAG
GCCTTCTTTC GTTGGGAGTC TGAGTCGCAG CGGGAGGCCA GGACCCTCGC GCAGGCGGAG
AGTGGGCTTC TCCGAGCTCT CGCCGACCGA GGCGTCGCCG TCGGAGCCAC GTCCACCGAG
GACGCGTTCT CCGCGTACGA GCGCGCGTGT GCGGAGCGCG CAGAGCAGGC CGCCCGGGCT
GCCCGGCGTC CAGAGGTCGA GAGGGCGCTT GCTGCTCGCG AAGCCGCCGA CAGGGACGCC
GCCGACGTCC TGGCGCAGCG GGAGTCAGCT CTGGACGCTC TGCGAGCTGC CGTGAGCGTC
GCGAGTGTAG GCGTTGATCC TGAAGCTGAT CCGGAGGCCC TGCGCGATGC GCTGATCCAG
TGGCAGGCTC ATTACGACGA GCTGCTGATC GCGACCGAAG CTCGTCAGCG CGATCTTTCC
CAGCTCGATG TCGTACTGGA CGGCTCGACG CTGGACGAAC TCGAGTCGTC GCTCATGGCG
GCAGAGGAGG CGGTCGCCGA GGCCGCTAAG GCCTCGGACG CGGCCCGGCG AGCTCTGCAA
GAGGCGGTCG TGCATGGTGA GGAACTCGCA GCCGATGCCG GCGCACCGAT CAGCGCAGGT
GTGGACGCTG CGCTCAGCGA CCTGACCCGG GCTCGACAGG CGCGGGCCGC AGCCCAGGAG
CAGGAGATGG AGCTGGCAGC AGTGGCGGAG AACGCCGCCG GGGTCGCAGC TGAGCGAGGT
AGGACACTCC GCAGCGTCGC CGAGGCCGAG GAGTGCCTGG TCGCTGCCGA AGCCGAGCTT
GCCCGGGTCA GCGAGCTCTC CGAGACGCTG CGGCTCACCA GCCATTTCCT CACGGACGCT
CAGGAACAGG TCCACCGGAC GATCGCGCCG GTACTGGCCG ACACGCTGAG TTCCTGGCTA
CCGCTGGTCA CAGGGGGGCG GTACACGGAC GCGACGGTGA ACCCGGCGAC GTTGGAGGTC
AAGGTGTGCG GCCCCCAGCG CAAGTGGCGG AATGCCGACC GTCTATCGAT CGGTACAGCA
GAACAGGTCT ACCTGCTCCT GCGGGTGGCA CTGGCCCAGC ACTTGAGCAC CACGGGCGAG
TCCTGCCCTC TCCTGCTCGA CGATGTCACC GTTCAGGCAG ACGCAGAACG GACCGTCGGC
ATCCTCGACC TGTTGCTCGC ACTGGCCTCG GATCGTCAGG TGATCCTCTT CGCGCAAGGG
CAAGAGGTTG CTGAATGGGC ACGAGTGCAC CTCATTGACC CCCGGCACTC GAGTGTGGAA
CTTACGCGAG TGGCCGTCGA GTGA
 
Protein sequence
MNIESVTAHA FGPLQSGDLR FAPGMTVVTG VNESAKSSWH AAVYAAVTGR RRGKGAPTRE 
ERRFAELHKP WDDDQWRVSA VLVLDDGRRI ELAHDLNGKV DCRATDLALG TDVSAEIMFE
GAPDASRYLG LDRKSFAAIA VVNQAELLGV LNAANGLQEH LQRAAATAGA DATAAAALAA
LETFARDNVG LDRANSSKPL RAAKNALENA RADLDAAFAE HARYLELTAV AETHRARADK
AAQRTLAAQE KAAALELLVH ALQVVAERQG DAARVDDVSK AAVTRRDALA QRVAKARSLS
AASTDDAVPA GAPAAEAVAR IVAAALARWS SVPDLRMPAG STAADLAQQL DALPPPPDGD
TQVAAAVRDA YQGWTRAVAV VTAHDGRRPP DPGTPSQDLE PAVEAGPSML RQLAAQLGAT
GNGNSEQVQY LTDARDQARS EQAAAQASLA DATARANAAR AAFTTAMAAP PSTTTGRSRV
PRYGLVATAA AAAVAAVTTA IALSNVTVAL VLGALCIAAA VGAFVVARPE PRHPTSGAVN
IPALSAAGHA AEEALASAQE RLYAAVTEVA SADAQLRAAA ASSSAAAEVA ARCAARSLPA
DPGALQQLAA RAEQHLEARE AFFRWESESQ REARTLAQAE SGLLRALADR GVAVGATSTE
DAFSAYERAC AERAEQAARA ARRPEVERAL AAREAADRDA ADVLAQRESA LDALRAAVSV
ASVGVDPEAD PEALRDALIQ WQAHYDELLI ATEARQRDLS QLDVVLDGST LDELESSLMA
AEEAVAEAAK ASDAARRALQ EAVVHGEELA ADAGAPISAG VDAALSDLTR ARQARAAAQE
QEMELAAVAE NAAGVAAERG RTLRSVAEAE ECLVAAEAEL ARVSELSETL RLTSHFLTDA
QEQVHRTIAP VLADTLSSWL PLVTGGRYTD ATVNPATLEV KVCGPQRKWR NADRLSIGTA
EQVYLLLRVA LAQHLSTTGE SCPLLLDDVT VQADAERTVG ILDLLLALAS DRQVILFAQG
QEVAEWARVH LIDPRHSSVE LTRVAVE