Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3563 |
Symbol | |
ID | 9147479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3952946 |
End bp | 3955273 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003638634 |
Protein GI | 296131384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCCA GCACCCGCCG GCGCGCCGCC GGCCTGGCAG CCTCGGCCGC GGTCGCGGTC GCCGCCGCCC TCACCGTCAC GGGCGCCACC GCGCCCGCGG CCGTCGCCGC CGCCGACCCG AGCACCCCGG TCAAGGTCAA CCAGGTCGCC TACGTCCCCG GTGTCGCCAA GGTCGCGACG CTCGTCAGCT CCGCGACGTC CCCCGTACCG TGGACGCTGC GCGACGCGTC CGGCCGCACG GTCGCCTCGG GCAGCACCAC CGTCAAGGGC GCGGACTCCC TGTCCGGCGA CCGCACCCAC CTCGTCGACT TCTCGTCGTA CGACACCCCG GGCACCGGGT ACGTGCTGGT CGCGGGCGGC TCCGAGAGCC TGCCGTTCGA CATCTCCGCC GACCCGCTGA AGCAGCTCCG GTACGACGCG CTCGCGTTCT TCTACCACCA GCGCTCGGGC ATCGCGATCG AGTCGCAGTA CGTCGGAGCC GCGTACGCGC GCGCCGCCGG GCACCTGGGT GTCGCCCCCA ACCAGGGCGA CACGAGCGTG CCGTGCCGGC AGTCCTGCGG GTACAGCCTC GACGTGCGCG GCGGCTGGTA CGACGCGGGC GACCACGGCA AGTACGTCGT CAACGGCGGC ATCGCGACGT GGCAGCTGCA GAACGCGTAC GAGCGGACGC TGCACGTCGA CGGCGCGGAC CGCGCGGCGC TCGGTGACGG GAAGCTGCGC ATCCCCGAGC GCGCCAACGG GGTGCCGGAC GTGCTCGACG AGGCGCGCTG GGAGGTCGAG TTCCTGCTGC GCATGCAGGT GCCGGCCGGG CGCACCGACG CCGGGATGGT CCACCACAAG ATGCACGACG AGAACTGGAC GGGGATGCCC ACGATCCCGT CGCAGGACTC GTCGCGCCGC ATCCTCGCGC CCGTCAGCAC GGCCGCGACG CTCAACATGG CGGCCGTCGC CGCACAGGCA GCGCGCCTGT GGGAGCCGTA CGACGCGACG TTCGCCGCGA AGGCGCTGAG CGCCGCGAGG ACCGCGTACG CCGCGGCGAA GGCCAACCCG AACCGCATCG CGTCGGCAAG CGACGGCACC GGCGGCGGGG CGTACGGCGA CACCTCCGTG ACCGACGAGT TCTACTGGGC GGCCGCCGAG CTGTACGCGA CGACGGGCGA GTCGGCCTAC CGCGCCGATG TCACGGGCTC GTCATTCTAC AAGGGCACGA GCTTCGCGCA GCGCGGGTAC GACTGGGGCT GGACCGGCGG GCTCGGTGAC ACGACGCTCG CACTCGTCCC GACCGACCTG CCCGCGTCGG ACGTCGCGGC GACGCGCTCG GCGATCGTGT CCTTCGCGGA CGCGGCGCTC TCCCGCCTGG CGGGCCAGGC GTACCCGGCA CCGAACAACG CCGGCAGCGT CTACTACTGG GGCTCCAACG GCCAGGTGAC CAACAACGCC AACGCGCTGG CCCTCGCGTA CGACTTCACG GGCCAGGCGA AGTACCGGAC CGCCGTGTTC GGCGCGCTCG ACTACCTGCA GGGCCGCAAC CCGCTGAACC AGTCGTACGT CGCCGGCTAC GGCGAGAAGC CCGTGCGCAA CGTGCACCAC CGGTTCTGGG CGAACCAGCT CGACGCGTCC CTGCCGACCG CGCCCCCCGG GTCGCTGTCC GGCGGCCCGA ACAGCGAGCT GCAGGACCCG TACGCGGCGG CGCAGCTCGC CGGGTGCGCG GCGCAGAAGT GCTTCGTCGA CCACATCGAC GCGTACTCGG TCAACGAGGT CGCGATCAAC TGGAACTCGG CGTTCGCGTG GCTGACGGCG TGGGCGGCGG AGCAGGCGGG CGGCGCGACG CCGACGCCGA CCCCGACGCC CACCCCGACC CCCACGCCGA CGCCGTCGGC GACGCCGAGC CCCACCCCGT CGCCGACCCC GTCGGTGACG CCGACCCCCA CCCCGTCGCC GACCCCGTCC GCGACGCCGA GCCCCACCCC GTCGCCGACC CCCTCCGCGA CGCCGAGCCC GACGCCCCAG CCGGCTGCGG CCTGCCGGGT GACGTACTCG GCGAACTCAT GGGGCTCGGG CTTCACGGCG GCGGTCGAGG TGACCAACAC CGGCTCCACC GCGTGGTCGT CGTGGCGCCT CGGCTTCACG TTCGCGGGTG ACCAGAAGGT CACCCAGGGC TGGAGCGCGA CGTGGGACCA GTCGGGCTCG ACGGTGACCG CGACCAACGC CGCGTGGAAC GGCCAGCTCG CGCCCGGCGC GACGACGAGC ATCGGGTTCA ACGGCTCGTA CAGCGGCACG AACGCGGCAC CGACCGCGTT CACGGTCAAC GGGGCCGCCT GCTCCTGA
|
Protein sequence | MPPSTRRRAA GLAASAAVAV AAALTVTGAT APAAVAAADP STPVKVNQVA YVPGVAKVAT LVSSATSPVP WTLRDASGRT VASGSTTVKG ADSLSGDRTH LVDFSSYDTP GTGYVLVAGG SESLPFDISA DPLKQLRYDA LAFFYHQRSG IAIESQYVGA AYARAAGHLG VAPNQGDTSV PCRQSCGYSL DVRGGWYDAG DHGKYVVNGG IATWQLQNAY ERTLHVDGAD RAALGDGKLR IPERANGVPD VLDEARWEVE FLLRMQVPAG RTDAGMVHHK MHDENWTGMP TIPSQDSSRR ILAPVSTAAT LNMAAVAAQA ARLWEPYDAT FAAKALSAAR TAYAAAKANP NRIASASDGT GGGAYGDTSV TDEFYWAAAE LYATTGESAY RADVTGSSFY KGTSFAQRGY DWGWTGGLGD TTLALVPTDL PASDVAATRS AIVSFADAAL SRLAGQAYPA PNNAGSVYYW GSNGQVTNNA NALALAYDFT GQAKYRTAVF GALDYLQGRN PLNQSYVAGY GEKPVRNVHH RFWANQLDAS LPTAPPGSLS GGPNSELQDP YAAAQLAGCA AQKCFVDHID AYSVNEVAIN WNSAFAWLTA WAAEQAGGAT PTPTPTPTPT PTPTPSATPS PTPSPTPSVT PTPTPSPTPS ATPSPTPSPT PSATPSPTPQ PAAACRVTYS ANSWGSGFTA AVEVTNTGST AWSSWRLGFT FAGDQKVTQG WSATWDQSGS TVTATNAAWN GQLAPGATTS IGFNGSYSGT NAAPTAFTVN GAACS
|
| |