Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0502 |
Symbol | |
ID | 9144369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 534730 |
End bp | 536565 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003635615 |
Protein GI | 296128365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000614607 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000229515 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGCATCG GGGTGCTCGG TCCCGTCGTC GCGCACGCCG CCGGCGACGC GTTGACGCTG CCCCGGCCGC GCTCGCGCGA GGTGCTGGCG GTGCTCGTCG CGGCGGGCGG GCGCACGGTC CGCACGGACG CGCTGGTCGA CGACCTGTGG GACGGCACCC CTCCGCCGGG TGCGGTCGGG GCGGTGCGCA CGTTCGTCGC CGAGCTGCGC CGCGCGCTGG AGCCCGACCG GCCCCCGCGC ACACCGCCGC GCGTCGTCGT CACCCGCGGC CCGGGCTACG CGCTCGACGT CCCGCCGGAC GCCGTCGACG CGTGGCGCGT GGCGCGGCTG GCGGCCGAGG CCCGCACCGC TCCCCCGGAC ACCGCGGTGC GTCTGCTCGC CCACGCCCTC GCCGCGTGGC GCGGTGAGCC GTACGAGGAG CTCGCCGACC GCCCCTGGGT GCAGCCCGAG CGCACCCGCC TGGTGACGCT GCGCGCCGAC GTCACCGAGC AGCTGGCGGA TGCGCTGCTG GCGACGGGCC GCGCGGTCGA CGTGGTCCCG CTGCTCGACG CGCACGTCGG CGCGCACCCG TGGCGCGAGG ACGGCTGGCG GCTGCTGGCC ACGGCGCTGC ACCGCTTGCA CCGGCCGGCG GACGCGCTCG ACGTGCTGCG CCGCGCCCGA CGCACCCTCG CCGAGGACCT CGGCCTCGAC CCCGGCCCAG CCCTGCGGGA CCTGGAGCAG CAGGTCCTGG AGCGTCGCGA CGACGACGCG TGGCGCGACG ACAGCCTGTC CGCGCTCGAC CGCCGCGGCG GGCGCGCGCG GCTGGAGGCG TCGGGCGCGG TCCTCACGAG CCTGGCCGTG TCGGGTGACC TCGCGACGGT CCGCGCGCAG CGGCTCGCGT CCATCGCGGC GGCCGAGAGG CTGGGCGACC CGTTGCTGAC GGCGCGGGTG GTCGGCGGCC TGGAGGCGCC GGGGGTGTGG ACGCGGTCGG ACGACGACGA GCTCGCGGCG GCGGTCGTCG CGGCGGCCGT CCGGACGCTG CCGCGGGTGA CGGGCAGCCC GGTGACGCGC GGGCGACTGC TCGCGACGAT CGCGATCGAG GACCGCGGCA CGGCGGCGCG CGAGACCGAG GCGCTCGAGG CCGAACGGAT CGCGCGCGAC CTGGACGACC GGCACCTGCT GTGCCTGGCT CTCAGCGGGC GCGCGATGCA GCGGTTCGGG AGCACGGGGC TGGCGGCGGA CCGCGAGCGG ATCGGCGCCG AGCTCGTCGC GACGGCGGTG TGCGCGGAGT CCACGACGTT CCGGATCGCC GGGCGCATCG TGCGGATGCA GGCGCTGTGC GCGCTGGGGC GGCTCGACGA GGCGGCGGCG GAGGCCGACG AGGTCGACGC GCTGGCCGCG TCCGCCGAGC GCCCGCTGGC GACGACGTTC ACCGCGTGGT TCCGCCACAC CTTCGCCGAC GGGCCGGAGC CGGCGGCGCC CGACGAGATG CCGGGGTTCT CGCACGGGAT CGTCGCGCTG GCCCGGGTGA CGCGGCTGGT CCGCGACGGC GGCACGCTCC CCGGGCCGGA CGCCGCGGGC GACCTGGGCC CGTACGCGCC GTGGGTCCGG CCGCTCCTGC TGGTGCGCGC GGGCGACGTC GACGGCGCCC GCCGGGCCGT GCGTGGGGCA CCGGCGCCGC CGCACGACCT GCTGCAGGAG GTCGCCTGGG GCCTGCTGCT GACGGCGGCC CGCGAGGCCG GCGCACCCGA CGTCGTCGAC CGTGCCCGTG ACGCCCTGGC ACCGGCCGTC GACGAGCGTG CGGCGGGCAG CGGCGTCGTC GACGCGGGTC CCGTCCGCGC GTTGCTGCGG GGCTGA
|
Protein sequence | MRIGVLGPVV AHAAGDALTL PRPRSREVLA VLVAAGGRTV RTDALVDDLW DGTPPPGAVG AVRTFVAELR RALEPDRPPR TPPRVVVTRG PGYALDVPPD AVDAWRVARL AAEARTAPPD TAVRLLAHAL AAWRGEPYEE LADRPWVQPE RTRLVTLRAD VTEQLADALL ATGRAVDVVP LLDAHVGAHP WREDGWRLLA TALHRLHRPA DALDVLRRAR RTLAEDLGLD PGPALRDLEQ QVLERRDDDA WRDDSLSALD RRGGRARLEA SGAVLTSLAV SGDLATVRAQ RLASIAAAER LGDPLLTARV VGGLEAPGVW TRSDDDELAA AVVAAAVRTL PRVTGSPVTR GRLLATIAIE DRGTAARETE ALEAERIARD LDDRHLLCLA LSGRAMQRFG STGLAADRER IGAELVATAV CAESTTFRIA GRIVRMQALC ALGRLDEAAA EADEVDALAA SAERPLATTF TAWFRHTFAD GPEPAAPDEM PGFSHGIVAL ARVTRLVRDG GTLPGPDAAG DLGPYAPWVR PLLLVRAGDV DGARRAVRGA PAPPHDLLQE VAWGLLLTAA REAGAPDVVD RARDALAPAV DERAAGSGVV DAGPVRALLR G
|
| |