Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3401 |
Symbol | |
ID | 9147317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3782367 |
End bp | 3784922 |
Gene Length | 2556 bp |
Protein Length | 851 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Glycosyl hydrolase family 32 domain protein |
Protein accession | YP_003638477 |
Protein GI | 296131227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.18387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGACGAC GCATCCGCGC CCTGGGCGCG ACGGCCGCGG TGGCGCTCAT GACGGGGCTG GCGGCCCCGG TGCAGGGCCA GGACGCAGCC GACCCGCACC GGCCGGTGGT CCACTTCGCC CCGCAGCGCC ACTGGGTCAA CGACCCCAAC GGTCCCGTGT GGTACGACGG GCAGTACCAC CTGTTCTTCC AGCACAACCC GCTCGGGGAC ACCTGGGGCC ATATGTCCTG GGGCCACGCC GTCAGCACGG ACCTCGTGAC GTGGGAGGAG CGGCCCCTCG CGATCCCGTG GTCCGAGCGG GAGCACATCT TCTCCGGCAC CGTCGTCGTG GACGAGGGGA ACACGAGCGG CCTCGGCACG CCCGGCACCA CCCCGCTCGT GGCCGCCTAC ACCTCGTGGG ACCCGCTCAC GGGCATCCAG TCGCAGTCGG TCGCCTCCAG CCTCGACGCC GGGGAGACGT GGACCGCGTA CGAGGGCAAC CCGGTCCTCG ACATCGGGTC GCGCGAGTTC CGCGACCCCA AGGTGTTCCG GTACGAGGCC GGCGGGTACT GGGTGATGGC GGTGGCGCTC GCGGAGGAGC GGATCATCCG GTTCTACCGC TCGCACGACC TGATCCGCTG GACGCACCTC AGCGACTTCG GGCCGGCCGG CGCCGTCGGC GGCGTCTGGG AGATGCCCGA CCTGTTCGAG CTGCCGGTCG ACGGCGACCC GGCACGCACC CGGTGGGTGC TGGTCGTGAG CCTCAACCCC GGAGCCGTCG CCGGGGGGTC GGGCGCCCAG TACTTCGTCG GCGAGTTCGA CGGCACCCGC TTCGTCGCCG ACCCGCCCCC CGCGCCGGGT CCGGACGGCG ACGTCCTCGC CGACTTCGAG GGCGGCACGT ACGGCGAGGG CTGGACCACC ACCGGCGACG CCTTCGGCGA CGGACCGGTG TCCGGCACCC TGCCGGGTCA GCACCCCGTC ACGGACTTCC GGGGCGAGGG ACTCGTCAAC AGCTTCCGTG GCGGAGACGC CGCCACCGGC ACCCTCACGT CACCGCCGTT CACCGTCGAG CGCCGTCACC TCACGATGCT CGTCGGCGGC GGCCGGCACC CCCACCGCCC GGGCACCGGC GACGGCGCAG CGCCCGCGGG CACGGTGCTC GCGGACTTCG AGGGGACCGA CTTCGGCGAC TGGACGGTCG AGGGGACCGC CTTCGGGCCC GGTCCGCTCG CCGGCGACGC CCCCTGCCAG ACGGGTGTGC GCGGATACCT CGGCACGCGC CTGGCCAACT CCTACCAGAA CGGCCGAAGC GACCCGTGCA CGCCTCCGCC CGACTCCGCC ACCGGCCGGC TCGTGTCCCC GACGTTCACC GTCGACCGCC CGTGGATCAG CATGCTCGTC GGTGGTGGTG CGGGGCCCGG CACCGCCGTG CGGCTCGTCG TCGACGGGCA GGTCGTGCGC ACCGCCTCCG GACGGGAGAG CGGGACGCTC GACTGGGCGA CCTGGGACGT CGCCGAGCTC GCCGGGCGGC GTGCCCACCT CGAGGTCGTC GACGAGGTCA CCGGGGGGTG GGGCCACATC ATGGTCGACC ACATCGTGCT CTCGGACGAG CCGGCGCGTC CGCGGTCGGA CGAGGCGACG GTCAACCTCG TGGTGGACGG GCGGGTCGTG CGCACTGCCA CCGGCCGCGA CTCCGAGCAA CTCGCCCCCG TCACCTGGGA CGTCGGTGAC CTCGTCGGGC GCACCGCGCA GCTCGTGGTG ACGGACACCA GCACGGGCGG GTGGGGTCAC ATCCTGCTCG ACCACGTCGT CGCCACGGAC GCGCCGGTGC CCACCCCGCT CGAGCGCCAC GCCTGGGTCG ACCACGGCAC GGACTTCTAC GCGCCGCTGA CCTTCGAGAA CACCCCGGAC GGCGAGCGGG TCGCCATCGC GTGGATGAAC AACTGGGAGT ACGCCACGTC GACGCCGACC ACCGGGTGGC GGGGATCGAT GACGTTCCCG CGCACCCTCG CGCTGCGCAC GGTCGACGGC CGGGTCGTGC TGACCTTCCA CCCCGTCGAC GTGCCCGGTG CGGAACCCAT CCGCCTGCGG GACCATGAGG GGGTCCTCGA CGAGGGCACC GTGCGCGTCC CCGAGGCGAC GCACGACGGC GCCGCGATCG TCACGGTCGA GCTCGAGGTC GGCGACGCCG AGCGGCTCGG GCTGCACGTG CGCGTCGGGG ACGACGAGCG CACGGTCGTC GGCTACGACG TCGCGGCCGG ACGGATGTAC GTCGACCGGA CACGCTCCGG CACGGTCGAC TTCCATCCGG GCTTCGCCGG CGTGCACACC GCGCTGCTGC CCACCCGCGA CGGACGCGTG CGGCTGCAGG TCGTCGTCGA CCGGTCGTCC GTCGAGGTCT TCGGCAACGA CGGGGAGGCC ACGATCACGG ACCTCGTCTA CCCGGGCGGG GGTAGCAACG GGGTGGCACT GTTCGCCGAG GGCGGCCGTG CGACCGTGAC CTCGCTCGAG GTGCGTCCGC TCGGTCCGGG GACGTCGTGG GCCGTCGTGG CACGCGGGCA CGCCGCCCGC CCGTGA
|
Protein sequence | MRRRIRALGA TAAVALMTGL AAPVQGQDAA DPHRPVVHFA PQRHWVNDPN GPVWYDGQYH LFFQHNPLGD TWGHMSWGHA VSTDLVTWEE RPLAIPWSER EHIFSGTVVV DEGNTSGLGT PGTTPLVAAY TSWDPLTGIQ SQSVASSLDA GETWTAYEGN PVLDIGSREF RDPKVFRYEA GGYWVMAVAL AEERIIRFYR SHDLIRWTHL SDFGPAGAVG GVWEMPDLFE LPVDGDPART RWVLVVSLNP GAVAGGSGAQ YFVGEFDGTR FVADPPPAPG PDGDVLADFE GGTYGEGWTT TGDAFGDGPV SGTLPGQHPV TDFRGEGLVN SFRGGDAATG TLTSPPFTVE RRHLTMLVGG GRHPHRPGTG DGAAPAGTVL ADFEGTDFGD WTVEGTAFGP GPLAGDAPCQ TGVRGYLGTR LANSYQNGRS DPCTPPPDSA TGRLVSPTFT VDRPWISMLV GGGAGPGTAV RLVVDGQVVR TASGRESGTL DWATWDVAEL AGRRAHLEVV DEVTGGWGHI MVDHIVLSDE PARPRSDEAT VNLVVDGRVV RTATGRDSEQ LAPVTWDVGD LVGRTAQLVV TDTSTGGWGH ILLDHVVATD APVPTPLERH AWVDHGTDFY APLTFENTPD GERVAIAWMN NWEYATSTPT TGWRGSMTFP RTLALRTVDG RVVLTFHPVD VPGAEPIRLR DHEGVLDEGT VRVPEATHDG AAIVTVELEV GDAERLGLHV RVGDDERTVV GYDVAAGRMY VDRTRSGTVD FHPGFAGVHT ALLPTRDGRV RLQVVVDRSS VEVFGNDGEA TITDLVYPGG GSNGVALFAE GGRATVTSLE VRPLGPGTSW AVVARGHAAR P
|
| |