Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3120 |
Symbol | |
ID | 9147033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3466869 |
End bp | 3468188 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | cellulose-binding family II |
Protein accession | YP_003638201 |
Protein GI | 296130951 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0893071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00025616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCACCA CCCGATCCAC CCTGCGCGCC CGCAGTGCGC TCGCCGCGCT CGTCGCGGGC GTCCTGACGG TCGGCGCCGT CGCCGTCGCG TCCGCCAGCG AGCCGCAGGT GCGCACCGCA CGCTGCGAGG TCACCTTCAC CACCAACTCC TGGCCCGGCG GGTTCGTCAC CGAGGCGCGC TTCTCCACGG CGGAGGCGCT CTCCGGGTGG ACCCTGTCGT TCCTGCTGCC CGACGGGGCA CGCGTCAGCA ACGCCTGGAG CGCCGACGTC GCGCAGTCCG GCCAGAGCGT GACGGTCCGC AACGCCGCCT GGAACGGCCA GCTGTCCGCC GGGGCGACGG TCTCGTTCGG CTACCAGGGG ACGACCACGG GCAGTGCCGT CGAGCCGCCC GAGGTGTACC TCGACGGCGC GCGGTGCGTG GGCCTCACCG GCCCGACCAC GCCGCCGACG CCGACCGTGA GCCCCACACC CACGCCCGTC GTCGACCCGA CGCCCAGCCC CACACCGAGC CCCACACCGT CGCCGACGCC GAGCCCCACA CCGAGCGCGA CGCCCAGCCC CACGCCGAGC CCGACGCCCA GCCCCACACC GAGCCCGACG CCCAGCCCCA CGCCGAGCCC CACACCGAGG CCTGACCCGG GCGGCTGCAG CGGCGCGTTC TGCGACGGCT TCGAGGCCCA GACCGGCACG TCGCCGGCCG CGCCGTGGAG CGTGGTGCAC CCCGACTGCT CCGGCACGGG CACCGCGAGC ATCGACTCCT CCGTGGCCCG CTCCGGCAGC CGGTCGCTGC GGATCAACGG GGCCGTGGGC TACTGCAACC ACGTGTTCGT GCAGGCCGAC GGGGCCGTGA CCGGCTCGGG GGCGACGTAC CTGCGCTTCT GGGTGCGGCA CACCACCGCG CTGCCGACGT CGCACGTCAC GTTCCTCGCG ATGAAGGACG CGAACGACAA CGGCAAGGAC CTGCGCATGG GCGGCCAGAA CGGTGCCCTG CAGTGGAACC GCGCGTCGGA CGACGCGACG CTCCCCGAGC AGAGCCCCAA CGGCGTCGCG CTGAGCAAGC CGCTGCCCAC GAACCAGTGG TCGTGCGTCG AGGCGCTCGT GGACCCGGCA GGACGCCTGA CCACGTGGCT CGACGGCTCC GAGGTCACCG GCCTGGTCGC CGACGGCACG CCGACGCACG ACGTGGACAG CCAGTGGCTC AACAAGGCGT GGAGCCCGCG GCTCGTCGAC CTCAAGCTGG GCTGGGAGAG CTACGGCGAC GGCGCGGACA CGCTCTGGTA CGACGACGTG GTGGTGAGCA GCAGCCGCAC CGGCTGCTGA
|
Protein sequence | MSTTRSTLRA RSALAALVAG VLTVGAVAVA SASEPQVRTA RCEVTFTTNS WPGGFVTEAR FSTAEALSGW TLSFLLPDGA RVSNAWSADV AQSGQSVTVR NAAWNGQLSA GATVSFGYQG TTTGSAVEPP EVYLDGARCV GLTGPTTPPT PTVSPTPTPV VDPTPSPTPS PTPSPTPSPT PSATPSPTPS PTPSPTPSPT PSPTPSPTPR PDPGGCSGAF CDGFEAQTGT SPAAPWSVVH PDCSGTGTAS IDSSVARSGS RSLRINGAVG YCNHVFVQAD GAVTGSGATY LRFWVRHTTA LPTSHVTFLA MKDANDNGKD LRMGGQNGAL QWNRASDDAT LPEQSPNGVA LSKPLPTNQW SCVEALVDPA GRLTTWLDGS EVTGLVADGT PTHDVDSQWL NKAWSPRLVD LKLGWESYGD GADTLWYDDV VVSSSRTGC
|
| |