Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1280 |
Symbol | |
ID | 9145160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1429982 |
End bp | 1432057 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | glycoside hydrolase family 43 |
Protein accession | YP_003636379 |
Protein GI | 296129129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.286369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00183728 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTCTCCG CGGCCTGCGC CGGCCTGGTC GCCGTCGCCC AGGCCCCTGA CGCCGCGGCC GCCACGGTCG ACACCGGCAC GTCGTACGTG CTGGTGAACC GCAACAGCGG CAAGGCGCTC GACGTCTACA ACTTCGCCAC CGGTGACGGC GCACGACTCA CGCAGTGGAC GCGCGGGTCC CAGGCCAACC AGCAGTGGCG CTTCCTCGAC TCCGGCGGCG GCTGGTACCG CGTGCGGTCG CAGCACTCGG GCAAGGTGCT GGACGTGCAG GGCTGGTCGA CCGCGGACGC CGCCAAGGTG GTGCAGTGGT CGGACACCGG CGGCGCCAAC CAGCAGTGGC GCCTGCAGGA CACGTCCGAC GGCTACGTGA TGCTCGTCAG CCGGCACTCG GGCAAGGCGC TGGAGGTGCA GAACGCGTCG CGTGACGACA ATGCGGAGAT CGTCCAGTAC AACCCGTGGG GAGGAACCAA CCAGCAGTGG CAGCTCGTGC CCATAGGGAG CCCCGTGACA CCGACGTCCA GCCCGACCTC CAGCCCGACC CCCGGCCCGA CCTCCGGTCC GACGACGCCG CCGAGCACCG GGACATTCAC CAACCCCGTC GTCTGGCAGG ACTTCGCCGA CGGCGACATC ATCCGCGTCG GTGACGCCTA CTACTACTCG GCCTCGACGA TGCACTACTC GCCGGGCGCG CCGATCCTGC GGTCCTACGA CCTGGTGAAC TGGGAGTACG CCGGTCACTC GGTGCCGCGC CTCGACTTCG ACGACCAGAA GTACGACCTC GTCGGCGGGC ACCGGTACGT CAAGGGCATC TGGGCGTCGA CCCTCAACTA CCGTCCCAGC AACCGTACCT ACTACTGGTA CGGCTGCACG GAGTTCAACC GCACCTACGT GTACACGGCG TCGGCCGTCG ACGGCACCTG GACCAAGAAG GCGCGGATCA ACAACTGCTA CTACGACGCC GGCCTGCTCA TCGACGACGA CGACACGATG TACCTCGCGT ACGGCAACGG CACGATCAGC GTCGCCCAGC TCAACGCGGA CGGCACGCAG CAGGTGCGGG CGCAGCAGGT GTACCAGACG CCGTCGAACA TCGGCACGCT CGAGGGCGCG CGGTTCTACA AGAAGGACGG CTACTACTAC ATCTGGCTGA CCCGCCCGGC CAACGGGCAG TACGTCCTGC GCTCGCGCTC ACCGTTCGGC CCCTACGAGC AGCGTCAGGT GCTGCTCGAC CTCCCCGGGC CGATCAGCGG CGGCGGGGTG CCCCACCAGG GCGGCATCGT CCAGACCGCG GCCGGCGACT GGTGGTACAT GGCGTTCACC GACGCCTACC CGGGCGGCCG CATGCCGACG CTCGCGCCGC TGACGTGGCG CGACGGCTGG CCCTCCGTCC AGACCGTCAA CGGCCGCTGG GGCACCACCT ACCCGCGCCC GGCCGTCAGC ACGACGCGCA CGGTCCAGCC GATGACCGGC AGGGACTCGT TCACCGGCTC GCGGCTGGGC CCGCGCTGGG AGTGGAACCA CAACCCGGAC ACCTCGCGCT TCTCGGTGGG CGACGGGCTG CGGCTGCAGA CGGCGAGCGT CACGAACGAC CTCTACGCCG CGCGCAACAC GCTGACCCAC CGCATCCAGG GGCCCACGTC GACGGCGACC ATCGAGCTCG ACTACTCCTC GATGGCCGCC GGTGACCGGG CCGGGCTCGC GATGCTGCGG CAGACGTCCG CGTGGATCGG GGTGGTCAAG GGCGGCGACG GCCGGACGCG GGTCGTCATG ACCGACGGCC TGACGATGGA CTCGTCGTGG TCGACCACGG GCCGTGGCAC GGAGCGCGCG AGCGCCGACG TCAGCGGGGG CCGGATCTGG CTGCGGACGT CCGCCGACAT CCGTCCCGGC GCGAACCGCC AGGCGACGTT CTCCTACAGC ACGGACGGCA CGACGTTCCG CAACCTCGGC CCGGCGTTCA CGCTCGACAA CGCGTGGCAG TTCTTCATGG GCTACCGGTT CGGCATCTTC AACCACGCCA CCACGTCCCT CGGGGGCGCC GTCACGGTCA GCAGCTTCGA CCTCACGACG CCCTGA
|
Protein sequence | MLSAACAGLV AVAQAPDAAA ATVDTGTSYV LVNRNSGKAL DVYNFATGDG ARLTQWTRGS QANQQWRFLD SGGGWYRVRS QHSGKVLDVQ GWSTADAAKV VQWSDTGGAN QQWRLQDTSD GYVMLVSRHS GKALEVQNAS RDDNAEIVQY NPWGGTNQQW QLVPIGSPVT PTSSPTSSPT PGPTSGPTTP PSTGTFTNPV VWQDFADGDI IRVGDAYYYS ASTMHYSPGA PILRSYDLVN WEYAGHSVPR LDFDDQKYDL VGGHRYVKGI WASTLNYRPS NRTYYWYGCT EFNRTYVYTA SAVDGTWTKK ARINNCYYDA GLLIDDDDTM YLAYGNGTIS VAQLNADGTQ QVRAQQVYQT PSNIGTLEGA RFYKKDGYYY IWLTRPANGQ YVLRSRSPFG PYEQRQVLLD LPGPISGGGV PHQGGIVQTA AGDWWYMAFT DAYPGGRMPT LAPLTWRDGW PSVQTVNGRW GTTYPRPAVS TTRTVQPMTG RDSFTGSRLG PRWEWNHNPD TSRFSVGDGL RLQTASVTND LYAARNTLTH RIQGPTSTAT IELDYSSMAA GDRAGLAMLR QTSAWIGVVK GGDGRTRVVM TDGLTMDSSW STTGRGTERA SADVSGGRIW LRTSADIRPG ANRQATFSYS TDGTTFRNLG PAFTLDNAWQ FFMGYRFGIF NHATTSLGGA VTVSSFDLTT P
|
| |