Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2887 |
Symbol | |
ID | 9146796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3198207 |
End bp | 3201200 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | transcriptional regulator domain protein |
Protein accession | YP_003637970 |
Protein GI | 296130720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0904649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.154883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCGG CCGACGACGT GCCCGCGAGC ACCGGAGCAG TCCCGGTGCT GCGCGCCCGC CTGGCGCGCC CGGAACCGAC CGGGCTGCGC CGGCCGCGGC TGCTGGCCCG GCTCGTCGGC GCCGCGGCGC CGCCGCTCGT CGCCGTGGTC GCCGGCCCGG GGTGCGGCAA GACGACGCTG CTGGCGCACG TCGCGGAGGC CGCGGGCCTG CCCGTCGCGT GGGTGACCCT CGACCCCGCG CTCGACGACC CGCAGGTGCT GCTGCGGCAC CTGCACGTCG CGTGCGCGGC CCTGCCGTGG CGCGGACCGC ACGAGCCGTG GCCCACCGCG GACGCCGCGC TCGCCGCGCT CGGTGACGGC CTCGCGGGCC CGGCGCTGCT CGTGCTCGAC GACGTGCACA CGGCCGAGCG GCGCCGTGCC GCGCAGGTCG TCGACCTGCT CGTGCGCCAC CAGCCCGCCG GTCTGCAGCT CGCGCTCGGC AGCCGCGGCC TGCACCCCGG CGTGCCGAGC AGGGGGCTCG CGGGCACGGC GCGCGTCGTC GACGGCGAGG CGCTGCGCTT CCGCACGTGG GAGGTCGACG AGTTGTTCCG CGGGCACCAC GCGACGCCGC TGCCCGCGGC GGAGGTCGCC GAGCTCACGC AGCGCACGCA CGGCTGGGCG GCAGGTCTCG AGCTGTTCCA CCTGGCCACG CGTGACCGGC CGGCGAGCTC GCGCACCCGC CTGCTGGAGC GTGCGGGACG CGGTCCCGCG GTGGCCGACT ACCTCGCGGG CCACGTGCTC GACCCGCTGC CGGTCTCGCT GCGCGAGTTC CTCGTGCGCA CGTCCGTCCT GGGGCGCCTG TCCGCCGAGC GGTGCGACGC GCTGCTCGGC ACCGACGGCG CCGCCCGGAT GCTCGCCGCG GCGCACCGCC ACGGGTTGCT CACGGCGGTC GACGACGCCG GCGGCACGCG CTACGCCTGC CACGAGGTGG TGCGCGGGTA CCTGCTCGAC ACGCTCGCCG AGCGGGCGGG CGGGGCCGTC GCGCGCGAGC TGCACCGACG GGCGGCAGCT CTCGCGCTCG CCGCCGGCGA GCACGACGAG GCCGTGCGCT CGTCGTGCCG TGCGCAGGAC GAGGACGCCG TGCGCCGCGT CCTGCACCTG GCCGGGGAGG ACCTCGCGCG CCGTCCCGGG CCGTGGCTCG ACCTCCTGCC GGACGCGGTC CGCGACGACG ACCCGTGGGT CGCGGTCGCG GTCGCGCGCC GTCACCTCGC CGAGGGCGTG CCGGAGCGGG CCCTGGCGGC CTACGGGGAC GCCGCGGAGC GCGCGGCGAC CGCTGCCGAC CGGACGCTCG TGATCCGGGA GTGCCACCTG GTCCGCACGT GGGCGGAGCC GTCGCCGGGT GCGCCGACGT ACTGGGTCGC GCTGGTGCGC CGCGCACTGG CCCGGCCGCG CGACTGCCTC GCGGCGGACG TCGCCGCGCA CGACGTGGGT CGTGCACTCG CGCCCGCCGT CGCCGCGCTC GTGGGCGGCG ACGTACGGGA CGCGGCGCGC CGGTTCGCCG TGGTGCACAG GCGTGCCGGG ACCCCGCCGG TCGTGGAGGC GCTCGCGCTG CTGGGCGAGG CCGCGGCCCT CACGCTCGTG CACGACCGTG GCGCGCGTGA CGCGCGGGAG CGCGCGCACA CCGCCGCCGG GCTGCTCGGT GCGCCCGCGC TGGAGCGGCT GGCGGACGGC CTCGACCTCG CGGCCGACCC GGCGGTCGGC GACGGGCTGC GTCACCTGCG CGAGCGTTGC ACGGGCGTCG GCGACACGTG GGGTGCGGCC GTCCTCGGGC TGCTCGACGC CCTGCGGGCC GTCACCGCGG TCCCGTCGCG CCGCCGCGCC GTGGTGGTCG CGCGGGACGC GGCCGACGAG CTCGAGGCGC TGGGAGCGCC CGCGCTGGCG GCGTGGGCCC GCGCGGCGGG TGCCGTGGTC CGTGCCCGCT GCGGCGACGG TCCGCCGCCC GACGGGCTCG AGGCCGACGC GGCGCGCAGC GGTCCGGTGC CGCTCGCGCT CGTCCTCGCC GTGCTCGACG CGCCGCGGGA CCGCTGGGGT CGGGCGGCAC GCGCCGCGGG GGTCCCCGGG TGGGTGCGGC TCGTCGCGCA CCGTGCGGCC ACCCGCACGG CACCTGCGCC CCCGATCCCC GTCGCCGGTA CGCCCGCGGC CGCCGCCGCG ACCGCAGCGG GCGGTACGGC CGTGCTGCGG GCGCTCGACG TGCAGTGCCT GGGGACGTTC CGGCTGACCG TCGCCGGGGT CGCTGTGCCG ACCGCCGGAC TGCGTCCGCA GCACGCGGCA CTGCTGCGCG CGCTGGCGCT GGCCGCGCCC GCTCCCGTGC ACCGCGACCG CCTCGTCGAG TGGTTCTGGC CCGGTCGGCT GCCCTCGCGG GGGCAGCACT CCCTGCAGGT CGCCGTCAGC GACGTGCGGC GCGCGCTCGA CGCGGCGAGC CCGGGTGCCG GGGCGGTGCT GCGTCGCGTC GACCAGGGGT ACGCGCTCGA GGGGGGCGGG ACCACCGTGC ACGACGTGCG GGCGCTCGAG GACGCGCTGC GCGCGGCGGA CGTCGCGCTC GACAGGGGTG ACGAGGCCGC GGCGCTCGTC GCCCTGCGTC GCGTCGTGGC GGACGGTGCG GCTGAGCTGC TGCCGCAGGA CGGGACGGCC GAGTGGGTGC TCGGGCCGCG CGAGCGGCTG CGCGCCGCGG TGGTGCGGGC GTGCGCGGCG CTCGCGGACC TGCTGGCGGC GCGGGGCGAC GCGTCGGGTG CCCTCGCGGC CGTGCGCCGC GGCCTCGCGC TCGACCGGTA CCAGGACGAC CTGTGGCGCC GCCTCGTCAG AGGGCTCGCC GCCGACGGCC GCCCGGCGGC GGCAGCTGCG GCCCGCCGCG AGTACGCCGC GGTGCTCAGC GAGCTGGGTG TCGCGTCCGG CACGCCGCCC GCGCTGCGGG CCGGTGCTGC GACGCACGTG CTGCCCACCC TCACGGGGCG CTGA
|
Protein sequence | MRAADDVPAS TGAVPVLRAR LARPEPTGLR RPRLLARLVG AAAPPLVAVV AGPGCGKTTL LAHVAEAAGL PVAWVTLDPA LDDPQVLLRH LHVACAALPW RGPHEPWPTA DAALAALGDG LAGPALLVLD DVHTAERRRA AQVVDLLVRH QPAGLQLALG SRGLHPGVPS RGLAGTARVV DGEALRFRTW EVDELFRGHH ATPLPAAEVA ELTQRTHGWA AGLELFHLAT RDRPASSRTR LLERAGRGPA VADYLAGHVL DPLPVSLREF LVRTSVLGRL SAERCDALLG TDGAARMLAA AHRHGLLTAV DDAGGTRYAC HEVVRGYLLD TLAERAGGAV ARELHRRAAA LALAAGEHDE AVRSSCRAQD EDAVRRVLHL AGEDLARRPG PWLDLLPDAV RDDDPWVAVA VARRHLAEGV PERALAAYGD AAERAATAAD RTLVIRECHL VRTWAEPSPG APTYWVALVR RALARPRDCL AADVAAHDVG RALAPAVAAL VGGDVRDAAR RFAVVHRRAG TPPVVEALAL LGEAAALTLV HDRGARDARE RAHTAAGLLG APALERLADG LDLAADPAVG DGLRHLRERC TGVGDTWGAA VLGLLDALRA VTAVPSRRRA VVVARDAADE LEALGAPALA AWARAAGAVV RARCGDGPPP DGLEADAARS GPVPLALVLA VLDAPRDRWG RAARAAGVPG WVRLVAHRAA TRTAPAPPIP VAGTPAAAAA TAAGGTAVLR ALDVQCLGTF RLTVAGVAVP TAGLRPQHAA LLRALALAAP APVHRDRLVE WFWPGRLPSR GQHSLQVAVS DVRRALDAAS PGAGAVLRRV DQGYALEGGG TTVHDVRALE DALRAADVAL DRGDEAAALV ALRRVVADGA AELLPQDGTA EWVLGPRERL RAAVVRACAA LADLLAARGD ASGALAAVRR GLALDRYQDD LWRRLVRGLA ADGRPAAAAA ARREYAAVLS ELGVASGTPP ALRAGAATHV LPTLTGR
|
| |