Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0068 |
Symbol | |
ID | 9143933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 86232 |
End bp | 87386 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | extracellular repeat protein, HAF family |
Protein accession | YP_003635187 |
Protein GI | 296127937 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00675708 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCTCACCA GAACTGCGCC GAGGCTGCTG CTCGTCGGCG TGCTCGCCTG CACGCTCGCG GCGGTGCCCG TCACCGCGGA TCCACCCGCC GCGGCCCGCG CGGGCTCGGG GATCACCGTC CGTGACCTGG GGACCCTCCC CGGTGACAGC TGGAGCGTCG CGGTGGACGT CAACGAGCAC GGCCAGGTCA TCGGGTTCAG CCAGGCGGGG AACGGTGACA CGCGCGGGTT CCTCTGGGAC CGCGGGGTGA TGCGTGACCT GGGGACGTTC TCCGCGGTGG CGATCAACGA CAGCGGGCAG GTCGTGGGGA CGGCCCTCAA CGGCTCCGGC GCCCAGCAGG CGGCCATGTG GGACGGTGGC CGGCTCCGGT ACCTCACCAC CCCGGACGGC GCCCCCAGCC GGGCGGTCCT CCTCAACGAG CACGGCCAGG TCGTCGTGCA GACACGGGAG TACGGCGGCG ACGAGCCGGA CCGGCTGCGC AACTACGTCT GGGACGACGG GGCCGTGACC GAGATCCCGC CCCTGCCGGG CAGCCCCTAC ATGCACCCGT TGGACATCAA CGACCAGGGC TGGGTGACCG GCTACAGCCC CGGCCCCGGG TCCCTCCGCC ACGGGTTCCT GTGGCGGGAC GGCGTCGTCA CCGACCTCGG CTCGGCCGCG TCCGGGGACG TCGCCTCCAC GATGGGCCTG GCCGTGAACG AGGCCGGTCA GGTGGCGGGC CAGGCGAGCG CATCGGACAC CGAGCACGCC GCCGCAGTCT GGCAGGACGG CGAGTGGATG CGGCTCGGCC ACCGCGAGGG CTGGAGCGGC GCGACCGACA TCAACGAGCA CGGCACCGTC GTGGGGTGGG CGAGCGACGG CGGCCCGCAC GAGCACGCCG TGCTCTACCG CGACGGCGAG TGGACCGACC TCGCCCCGGC CGGCTCGCGT GCGATCGAGC TGAACGACCG GGACCAGGTC ATCGGCTCCG TCGACGGCTA CACCCTCACC GCCGTGCTGT GGCAGGACGG CGAGACGCAC CTCCTGCCGC CCCTCTACCC CGGCAGCGCG ACCACGGCGT ACGACATCAG CGAGCGCGGC CAGGTCGCCG GCTCCGCCCG CGTGCTGTCC GGCGTGGAGC ACGCCGTGCT CTGGACGACG GCGCGGGGCT CCTGA
|
Protein sequence | MLTRTAPRLL LVGVLACTLA AVPVTADPPA AARAGSGITV RDLGTLPGDS WSVAVDVNEH GQVIGFSQAG NGDTRGFLWD RGVMRDLGTF SAVAINDSGQ VVGTALNGSG AQQAAMWDGG RLRYLTTPDG APSRAVLLNE HGQVVVQTRE YGGDEPDRLR NYVWDDGAVT EIPPLPGSPY MHPLDINDQG WVTGYSPGPG SLRHGFLWRD GVVTDLGSAA SGDVASTMGL AVNEAGQVAG QASASDTEHA AAVWQDGEWM RLGHREGWSG ATDINEHGTV VGWASDGGPH EHAVLYRDGE WTDLAPAGSR AIELNDRDQV IGSVDGYTLT AVLWQDGETH LLPPLYPGSA TTAYDISERG QVAGSARVLS GVEHAVLWTT ARGS
|
| |