Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2024 |
Symbol | |
ID | 9145919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2252741 |
End bp | 2255011 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | glycoside hydrolase family 10 |
Protein accession | YP_003637118 |
Protein GI | 296129868 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGCA CCATCGGACG GGCGGCGCTG GTCGCCGCGG TGTCGCTCGG CGCACTGGTC GGTGCCGCCG CTCTGACGCC TGCGGCCGCA GCGCCGAGCG GGTCGGGCCC GAACCCCGCC GACTACTCCA CCACCGGCGC CCTGCCCAAC CACACGATCT ACAAGCCGGT GAACCTTCCG TCGCAGCGCA TGCCCATCGT GGTGTGGTCC AACGGTGCGT GCTCGGCGGA CGGCACGTCG GCGCAGAACT TCCTCAAGGA GATCGCCTCC TGGGGCTTCC TCGTCGTCTC CAACGGCCGC CCCAACGGAA GTGGCAGCTC GAACTCCACG TGGCTGACGC AGGCCATGGA CTGGGCCGTG GCCCAGAACT CGAACAGCAG CAGCGACCTG TACAACAGGC TCGACACCAG CAAGATCGGT GTCGCCGGCT TCTCGTGCGG CGGCATCGAG GCCTACGCGG TCTCCGGTGA CCCGCGCGTC ACGACGACCG GCATCTTCAG CAGCGGTCTG CTGAACGACG CGGACGACTA CCAGCTGCGC CGTCTCGACC ACCCGATCGC CTACATCATC GGTGGCCAGA GCGACATCGC GTACCCCAAC GCGATGGACG ACTGGGGCAA GCTGCCGCAG GGCCTGCCCG CCTTCATGGG CAACCTCAAC GTCGGTCACG GCGGTACGTA CCACGAGACC AACGGCGGGG CCTTCGGCTT CGCGGCCCAG CAGTGGTTCC GGTGGCAGCT CAAGGGCGAC ACGACGGCGG CGCAGACGTT CGTCGGTCAG AACTGCGGCC TGTGCCGCAA CGGCTGGCAG GTTCAGCAGA AGAACCTCAC GGTGACCCAG CCGACGCAGT CGCCGACGCC GTCGCCGACC CCCTCGCCGA CGCAGAGCAC CCCGCCGGTC CAGACGCCCA CCCCGACGCC GACGCAGACG ATCCCGTCGT CCCAGCCCGG CGCGACGCTG CAGGCCGCGG CGGCCCGCAC GGGCCGGTAC TTCGGTGTGG CCCTGGCGGC GGGCAAGCTC AACGACTCGA CCTACACGAC CATCGCGAAC CGTGAGTTCA ACATGGTGAC GGCCGAGAAC GAGATGAAGA TGGACGCCAC GGAGCCGAAC CAGAACCAGT TCAACTTCTC CCAGGGCGAC CGGATCCTGA ACTGGGCGAC GCAGAACGGC AAGCAGGTGC GTGGGCACGC GCTGGCGTGG CACTCCCAGC AGCCGGGCTG GATGCAGAAC ATGTCCGGCA CGCAGCTGCG CAACGCGATG CTCAACCACG TCACGCGGGT CGCGACGTAC TACAAGGGCA AGATCCACAG CTGGGACGTG GTGAACGAGG CGTTCGCCGA CGGCAACGGC GGTGCGCGTC GTGACTCGAA CCTGCAGCGC ACCGGTGACG ACTGGATCGA GGCCGCGTTC CGCGCCGCGC GCGCCGCGGA CCCGGGCGCC AAGCTCTGCT ACAACGACTA CAACACCGAC AACTGGACGT GGGACAAGAC CCAGGCCGTC TACCGCATGG TCCGTGACTT CAAGTCGCGC GGCGTGCCGA TCGACTGCGT GGGTTTCCAG TCGCACTTCA ACGCCCAGTC GGCGTACAAC AGCAACTACC GCACGACGCT GTCGAGCTTC GCCGCCCTGG GTGTCGAGGT GCAGATCACC GAGCTGGACA TCGAGGGCTC GGGTCAGCAG CAGGCGCAGA CGTACGCGAA CGTCGTGAAC GACTGCCTCG CCGTGCCGGC CTGCAAGGGC ATCACGGTCT GGGGTGTGCG TGACTCCGAC TCGTGGCGCT CGTACGGCAC CCCGCTGCTG TTCGACAACT CGGGCAACAA GAAGGAGGCC TACACGGCCA CCCTCAACGC CCTGAACGCC GCGGCGCCGG TCCCGACGCA GGAGCCGACG CAGACGCCGA CGCAGGAGCC GACGCAGACG CAGGAGCCGA CGCAGACGCC GACGGTCACC CCGACGCAGG ACCCGGGCCA GGGCTCGGGT GCGTGCACCG CGACCTACAC GGTCGCGAAC CAGTGGGGTG AGGGCTTCGT CGCCGACGTC ACGGTCACGG CCAACCAGGA CCTCACCGGA TGGAAGGTGA GCATCCAGCT CCCCGGCGGT GCCGGGGTCT CGCAGACCTG GAACGGGACG CGAGGGAGCG CCAGCACCGG CCTCGTCACT GTGGCGAACG CCGGCTGGAA CGGTCGTGTC GCAGCTGGTC AGAGCACCAG CTTCGGCTTC CAGGGGACCG GCAACGGGGC GGGCGCGACC GTCTCGTGCG AGGCCGCCTG A
|
Protein sequence | MRRTIGRAAL VAAVSLGALV GAAALTPAAA APSGSGPNPA DYSTTGALPN HTIYKPVNLP SQRMPIVVWS NGACSADGTS AQNFLKEIAS WGFLVVSNGR PNGSGSSNST WLTQAMDWAV AQNSNSSSDL YNRLDTSKIG VAGFSCGGIE AYAVSGDPRV TTTGIFSSGL LNDADDYQLR RLDHPIAYII GGQSDIAYPN AMDDWGKLPQ GLPAFMGNLN VGHGGTYHET NGGAFGFAAQ QWFRWQLKGD TTAAQTFVGQ NCGLCRNGWQ VQQKNLTVTQ PTQSPTPSPT PSPTQSTPPV QTPTPTPTQT IPSSQPGATL QAAAARTGRY FGVALAAGKL NDSTYTTIAN REFNMVTAEN EMKMDATEPN QNQFNFSQGD RILNWATQNG KQVRGHALAW HSQQPGWMQN MSGTQLRNAM LNHVTRVATY YKGKIHSWDV VNEAFADGNG GARRDSNLQR TGDDWIEAAF RAARAADPGA KLCYNDYNTD NWTWDKTQAV YRMVRDFKSR GVPIDCVGFQ SHFNAQSAYN SNYRTTLSSF AALGVEVQIT ELDIEGSGQQ QAQTYANVVN DCLAVPACKG ITVWGVRDSD SWRSYGTPLL FDNSGNKKEA YTATLNALNA AAPVPTQEPT QTPTQEPTQT QEPTQTPTVT PTQDPGQGSG ACTATYTVAN QWGEGFVADV TVTANQDLTG WKVSIQLPGG AGVSQTWNGT RGSASTGLVT VANAGWNGRV AAGQSTSFGF QGTGNGAGAT VSCEAA
|
| |