Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3001 |
Symbol | |
ID | 9146913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3327559 |
End bp | 3328590 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003638083 |
Protein GI | 296130833 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCACGT CCAGCGACAA CCGACGGACG GTGACGATCG CCGAGATCGC CGCGCTCGCC GGCGTCTCCG TCCCCACCGT GTCCAAGGTC CTCAACGGCC GCGCCGACGT CGCCGCGACG ACGCGCGCCC GTGTCGAGGC GATCCTCGAG GAGCACAGCT ACCGCCGCCG GCGCGGCCGC GGCTCGGGCG ACCCGAACCT CATCGACCTC GTCTTCCACC ACATCGACAA CGCGTGGGCG CAGGAGGTCA TCAAGGGCGT CGAGGACGCC GCCGCCGCCC ACCGCGTCGG CGTCGTGCTG TCCGAGCTCG GCGGCGCGCA CCGCCCGCAG CAGGAGCTCA TCGACGACAT CCTCGCGCGC CGCCCGCTCG GCGTGCTGCT CGTGCTGTCG AGCCTCGACG CGACGCAGCG CCACCAGCTC GAGTCCCGCT CCATCCCGTT CGTCGTCGTC GACACCTGGG GCGAGCCGCC GGCGGGCGTC CCGACGGTCG GCTCCAACAA CTGGAACGGC GGTCTCATCG CCACGCGCCA CCTGCTCTCG CTCGGGCACC GCCGCATCGC GGTCATCTCG GGCCCGTCCG ACGTGCTGTG CTCCCGCGCC CGCGTCGACG GCTACCGCAG CGCGCTCGAG GAGGCCGGCA TCCGCTCCGA CCCGTCGTAC GTCCGCTGGG GCGACTTCCA CGTCGACGGC GGCTACCGCC ACGGCCTGGA GCTGCTGTCG CGCCCCGACC GCCCCACCGC GATCTTCGCC GGCTCGGACT ACCAGTGCCT CGGCGTCATG CGGGCCGTGC GCGAGCTCGG CATGTCGATC CCCGAGGACG TGTCGGTCGT CGGGTACGAC GACATCCCGC TCGCCCAGTG GCTCGGCCCG TCGCTCACCA CCGTGCGCCA GCCGCTGCGC GAGATGGCCG GCACCGCGAC CGAGATGGTG CTGAGCATCG CCTCCGGCGA GCGACCGTCC AACCTGCGGA TCGACCTCGC GACCGAGCTC GTGGTCCGCG AGTCGACGGC GCCGGCACCC GCCGGCGTCT GA
|
Protein sequence | MATSSDNRRT VTIAEIAALA GVSVPTVSKV LNGRADVAAT TRARVEAILE EHSYRRRRGR GSGDPNLIDL VFHHIDNAWA QEVIKGVEDA AAAHRVGVVL SELGGAHRPQ QELIDDILAR RPLGVLLVLS SLDATQRHQL ESRSIPFVVV DTWGEPPAGV PTVGSNNWNG GLIATRHLLS LGHRRIAVIS GPSDVLCSRA RVDGYRSALE EAGIRSDPSY VRWGDFHVDG GYRHGLELLS RPDRPTAIFA GSDYQCLGVM RAVRELGMSI PEDVSVVGYD DIPLAQWLGP SLTTVRQPLR EMAGTATEMV LSIASGERPS NLRIDLATEL VVRESTAPAP AGV
|
| |