Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3281 |
Symbol | |
ID | 9147197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3640692 |
End bp | 3643835 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | SMC domain-containing protein |
Protein accession | YP_003638361 |
Protein GI | 296131111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATTG AGTCCGTCAC TGCACACGCA TTCGGTCCTC TGCAGTCCGG CGACCTTCGG TTCGCCCCCG GGATGACGGT CGTTACGGGC GTCAACGAGT CCGCGAAGTC TTCTTGGCAC GCCGCTGTGT ACGCCGCTGT GACGGGACGC CGACGCGGCA AGGGTGCGCC GACCCGTGAG GAGCGCCGCT TTGCTGAGCT CCACAAGCCC TGGGACGACG ATCAGTGGCG TGTGTCCGCA GTCCTCGTCC TCGATGACGG CCGACGCATC GAGCTAGCCC ACGACCTGAA CGGCAAGGTC GACTGCCGTG CGACCGACCT TGCCCTCGGT ACGGACGTCT CGGCGGAGAT CATGTTCGAA GGCGCTCCGG ACGCCTCGCG GTACCTCGGT CTCGACCGCA AGAGCTTCGC GGCCATCGCC GTCGTCAATC AGGCGGAGCT CCTCGGGGTA CTGAACGCCG CCAACGGCCT GCAGGAGCAC CTGCAGCGCG CCGCGGCCAC CGCCGGCGCG GACGCGACCG CCGCCGCCGC ACTGGCCGCC CTGGAGACGT TCGCGCGCGA CAACGTCGGA CTCGACCGGG CGAACTCCTC CAAGCCGTTG CGCGCAGCAA AGAATGCGCT CGAGAACGCA AGAGCCGACC TCGACGCCGC CTTCGCCGAG CACGCGCGCT ACCTGGAGCT CACGGCGGTA GCCGAGACAC ATCGCGCGAG GGCGGACAAG GCCGCGCAAC GGACTCTCGC CGCGCAGGAG AAGGCCGCGG CACTCGAGCT TCTGGTGCAT GCCCTGCAGG TCGTGGCCGA ACGGCAAGGC GATGCCGCGC GCGTCGACGA TGTCAGCAAG GCGGCTGTGA CGCGCAGAGA CGCACTTGCT CAGCGTGTGG CGAAGGCTCG CTCACTGAGT GCCGCTTCCA CCGACGACGC CGTCCCCGCG GGAGCGCCAG CCGCAGAAGC AGTCGCTCGC ATTGTCGCAG CCGCACTTGC ACGATGGTCC TCAGTTCCCG ACCTGCGGAT GCCGGCGGGG TCCACCGCCG CCGACCTCGC TCAACAGCTC GATGCCCTAC CACCACCACC CGATGGCGAC ACTCAGGTGG CGGCGGCGGT CCGGGACGCC TATCAGGGGT GGACCCGTGC GGTCGCAGTG GTCACCGCTC ACGATGGCCG TCGCCCACCT GACCCGGGAA CTCCTTCGCA GGACCTCGAG CCCGCCGTCG AAGCTGGCCC GTCGATGCTG CGCCAACTCG CAGCCCAGCT CGGCGCGACC GGCAATGGCA ACTCGGAGCA GGTGCAGTAC CTGACCGATG CGCGCGACCA GGCTCGCTCA GAACAGGCAG CGGCTCAGGC GTCGCTCGCC GACGCGACGG CCCGGGCGAA CGCTGCTCGA GCCGCCTTCA CGACAGCCAT GGCGGCGCCT CCGTCGACGA CGACCGGCCG GTCCCGCGTG CCGCGCTACG GCCTTGTCGC CACAGCGGCC GCCGCCGCAG TCGCGGCGGT GACCACGGCC ATCGCCTTGT CGAACGTGAC GGTGGCACTT GTGCTGGGCG CGCTCTGCAT CGCCGCTGCC GTCGGCGCTT TCGTCGTCGC CCGCCCGGAG CCCCGCCACC CGACCAGCGG CGCGGTGAAC ATCCCAGCTC TGAGTGCCGC GGGCCACGCA GCGGAAGAGG CTCTGGCCTC CGCGCAGGAG CGCCTGTACG CAGCGGTAAC TGAGGTGGCA TCCGCGGACG CGCAACTGCG CGCAGCTGCC GCCTCCTCGT CCGCAGCGGC CGAGGTCGCT GCGCGCTGTG CCGCGCGGTC CCTGCCTGCC GACCCCGGTG CGCTGCAACA GCTTGCTGCC CGCGCTGAGC AGCACCTGGA AGCACGGGAG GCCTTCTTTC GTTGGGAGTC TGAGTCGCAG CGGGAGGCCA GGACCCTCGC GCAGGCGGAG AGTGGGCTTC TCCGAGCTCT CGCCGACCGA GGCGTCGCCG TCGGAGCCAC GTCCACCGAG GACGCGTTCT CCGCGTACGA GCGCGCGTGT GCGGAGCGCG CAGAGCAGGC CGCCCGGGCT GCCCGGCGTC CAGAGGTCGA GAGGGCGCTT GCTGCTCGCG AAGCCGCCGA CAGGGACGCC GCCGACGTCC TGGCGCAGCG GGAGTCAGCT CTGGACGCTC TGCGAGCTGC CGTGAGCGTC GCGAGTGTAG GCGTTGATCC TGAAGCTGAT CCGGAGGCCC TGCGCGATGC GCTGATCCAG TGGCAGGCTC ATTACGACGA GCTGCTGATC GCGACCGAAG CTCGTCAGCG CGATCTTTCC CAGCTCGATG TCGTACTGGA CGGCTCGACG CTGGACGAAC TCGAGTCGTC GCTCATGGCG GCAGAGGAGG CGGTCGCCGA GGCCGCTAAG GCCTCGGACG CGGCCCGGCG AGCTCTGCAA GAGGCGGTCG TGCATGGTGA GGAACTCGCA GCCGATGCCG GCGCACCGAT CAGCGCAGGT GTGGACGCTG CGCTCAGCGA CCTGACCCGG GCTCGACAGG CGCGGGCCGC AGCCCAGGAG CAGGAGATGG AGCTGGCAGC AGTGGCGGAG AACGCCGCCG GGGTCGCAGC TGAGCGAGGT AGGACACTCC GCAGCGTCGC CGAGGCCGAG GAGTGCCTGG TCGCTGCCGA AGCCGAGCTT GCCCGGGTCA GCGAGCTCTC CGAGACGCTG CGGCTCACCA GCCATTTCCT CACGGACGCT CAGGAACAGG TCCACCGGAC GATCGCGCCG GTACTGGCCG ACACGCTGAG TTCCTGGCTA CCGCTGGTCA CAGGGGGGCG GTACACGGAC GCGACGGTGA ACCCGGCGAC GTTGGAGGTC AAGGTGTGCG GCCCCCAGCG CAAGTGGCGG AATGCCGACC GTCTATCGAT CGGTACAGCA GAACAGGTCT ACCTGCTCCT GCGGGTGGCA CTGGCCCAGC ACTTGAGCAC CACGGGCGAG TCCTGCCCTC TCCTGCTCGA CGATGTCACC GTTCAGGCAG ACGCAGAACG GACCGTCGGC ATCCTCGACC TGTTGCTCGC ACTGGCCTCG GATCGTCAGG TGATCCTCTT CGCGCAAGGG CAAGAGGTTG CTGAATGGGC ACGAGTGCAC CTCATTGACC CCCGGCACTC GAGTGTGGAA CTTACGCGAG TGGCCGTCGA GTGA
|
Protein sequence | MNIESVTAHA FGPLQSGDLR FAPGMTVVTG VNESAKSSWH AAVYAAVTGR RRGKGAPTRE ERRFAELHKP WDDDQWRVSA VLVLDDGRRI ELAHDLNGKV DCRATDLALG TDVSAEIMFE GAPDASRYLG LDRKSFAAIA VVNQAELLGV LNAANGLQEH LQRAAATAGA DATAAAALAA LETFARDNVG LDRANSSKPL RAAKNALENA RADLDAAFAE HARYLELTAV AETHRARADK AAQRTLAAQE KAAALELLVH ALQVVAERQG DAARVDDVSK AAVTRRDALA QRVAKARSLS AASTDDAVPA GAPAAEAVAR IVAAALARWS SVPDLRMPAG STAADLAQQL DALPPPPDGD TQVAAAVRDA YQGWTRAVAV VTAHDGRRPP DPGTPSQDLE PAVEAGPSML RQLAAQLGAT GNGNSEQVQY LTDARDQARS EQAAAQASLA DATARANAAR AAFTTAMAAP PSTTTGRSRV PRYGLVATAA AAAVAAVTTA IALSNVTVAL VLGALCIAAA VGAFVVARPE PRHPTSGAVN IPALSAAGHA AEEALASAQE RLYAAVTEVA SADAQLRAAA ASSSAAAEVA ARCAARSLPA DPGALQQLAA RAEQHLEARE AFFRWESESQ REARTLAQAE SGLLRALADR GVAVGATSTE DAFSAYERAC AERAEQAARA ARRPEVERAL AAREAADRDA ADVLAQRESA LDALRAAVSV ASVGVDPEAD PEALRDALIQ WQAHYDELLI ATEARQRDLS QLDVVLDGST LDELESSLMA AEEAVAEAAK ASDAARRALQ EAVVHGEELA ADAGAPISAG VDAALSDLTR ARQARAAAQE QEMELAAVAE NAAGVAAERG RTLRSVAEAE ECLVAAEAEL ARVSELSETL RLTSHFLTDA QEQVHRTIAP VLADTLSSWL PLVTGGRYTD ATVNPATLEV KVCGPQRKWR NADRLSIGTA EQVYLLLRVA LAQHLSTTGE SCPLLLDDVT VQADAERTVG ILDLLLALAS DRQVILFAQG QEVAEWARVH LIDPRHSSVE LTRVAVE
|
| |