Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2196 |
Symbol | |
ID | 9146096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2445392 |
End bp | 2446540 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | protein of unknown function DUF58 |
Protein accession | YP_003637286 |
Protein GI | 296130036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.309415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00320399 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGGTCTGC GCGTCGCGCC CCTGGGGTGG GGGACCGCGG TCGTGGCCGT CTTCGCCACG GCGGCGGGAC GCGTGCTGGG CTGGGGCGAG CTCGCGGCGC TCGGCGTCGC GCTGCTGGCC GTCCTCGTGG TCGCCCTCCT CATGACGGTC GGGCGCACCC GCTACCGCGT CGTGCTCGAC CTGGCGGACC ACCGCGTGCG CATCGGCCAG CGGGCCGTGG GGCGCGTCGA CGTCCGCAAC GCCGCCAGGC GCCGCTCCCT GCCCTCGCAG GTCGACCTGC CCGTCGGGGA GCGGGTCGTC GAGCTGTCGG TCCCGAGCCT GCCGCCCGGC GGCACGCACG ACGACCTGTT CGCGGTGCCG ACGGAGCGGC GTGCCGTGAT CGTCGTGGGC CCCGTGGTGT CGCGGCGGGG CGACCCCTTC GGACTGCTGC AGCGCCGTCT GCGCTGGACC GAGCCCGCCG AGCTCTTCGT GCACCCCGAG GTGATCGGCC TGGGCGGCGC CAACGCCGGC CTGCTGCGCG ACCTCGAGGG GCAGGCGACG CGTGACCTGT CGGACTCGGA CCTCAACTTC CACGCGCTGC GCGACTACGT CGCCGGCGAC GACCGCCGCT ACATCCACTG GCGCACGACC GCGCGCCGTG GCCGCCTCAT GGTCAAGCAG TTCGAGGACA CGCGGCGCAC GCTGACGTCC ATCGCGCTCG CGACCGCGGT CGGCGACTAC GCGCACCCCG ACGAGCTCGA GCTGGCGGTG TCGGTCGCCG CGTCGATCGC GGTGCAGGCG ATCCGCGACG AGCGCGACGT CGAGGTGCTC GCCGGGGCGG GCCACCTGCG GACCGCCACG CCCCCGCTGC TCCTCGACGA CTGCTCGCGG CTGTCGTGGT CGCCCACGGG GCCCGGTGTC GTCCTGCTGG GCCGCCGCGT GGTGCGCGAG ACACCGGACG CGTCGGTGGC CTTCCTCGTC ACGGGCGGCG CGCCCACCGA CGCCGACCTG CGTCTGGGTG CGCGCGCACT GCCCGCCGGC ACGCGCGCCG TCGTCCTGCG GTGCGCGCTC GGCGAGGACG TCGCGGTGCG CACGCAGGGC TCGCTCACGC TGGGCACGCT CGGCCGGCTC GACGACCTGC CGCGCGCACT GCGCCGGGTG GTGGGATGA
|
Protein sequence | MGLRVAPLGW GTAVVAVFAT AAGRVLGWGE LAALGVALLA VLVVALLMTV GRTRYRVVLD LADHRVRIGQ RAVGRVDVRN AARRRSLPSQ VDLPVGERVV ELSVPSLPPG GTHDDLFAVP TERRAVIVVG PVVSRRGDPF GLLQRRLRWT EPAELFVHPE VIGLGGANAG LLRDLEGQAT RDLSDSDLNF HALRDYVAGD DRRYIHWRTT ARRGRLMVKQ FEDTRRTLTS IALATAVGDY AHPDELELAV SVAASIAVQA IRDERDVEVL AGAGHLRTAT PPLLLDDCSR LSWSPTGPGV VLLGRRVVRE TPDASVAFLV TGGAPTDADL RLGARALPAG TRAVVLRCAL GEDVAVRTQG SLTLGTLGRL DDLPRALRRV VG
|
| |