Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1318 |
Symbol | |
ID | 9145198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1465459 |
End bp | 1467264 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003636416 |
Protein GI | 296129166 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0818356 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00409906 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCGCGG CCCGTCGGTC CGCAGCCCCT GTCCACACCC CCCAGCACCC CTCGACGTCG GCAGCGCAGC AGCTGCCGCC GGCGTTCCCG GCGCGTGCGC CGTGGGGTGC CGCCGGGAGC CTGCGGGCGT GGCAGGCGGA GGCGATCGAG CTCTACCGGC AGCGCGGCCC GCGCGACTTC CTCGCCGTCG CGACGCCGGG CGCCGGCAAG ACGACGTTCG CGTTGCGGAT CGCCACCGAG CTGCTCGAGG CCAAGGTGGT GCGCCGTGTC ACCGTCGTCG CGCCCACCGA GCACCTCAAG AAGCAGTGGG CCGACGCGGC CGCGCGTGTC GGCATCCGGC TCGACCCGCG CTTCAGCAAC GCGCAGGGGC GGCACGGGGC CGGGTACGAC GGTGTCGCGG TGACGTACGC GCAGGTCGCG AGCAAGCCGG CGCTGCACGC CGCGCGCACC ACGGCCGAGC GCACCCTCGT CATCCTCGAC GAGGTCCACC ACGGTGGCGA CGCGCTGTCG TGGGGCGACG CGGTCCGCGA GGCCTTCGAG GGGGCCACGC GCCGGCTCGC GCTCACGGGG ACGCCGTTCC GCTCGGACAC GGCGCCGATC CCGTTCGTCA CCTACGCGCC CGACGCGCAG GGCATCCGGC GCTCGGTGGC CGACTACACC TACGCGTACG GCGACGCGCT GCGCGACCAC GTCGTGCGGC CCGTCATCTT CCTGTCCTAC TCCGGGTCGA TGAGGTGGCG CACCAAGGCG GGCGACGAGG TCGCGGCACG CCTGGGCGAG CCGCTGACCA AGGACATGAC GTCGCAGGCG TGGCGGACGG CCCTGGACCC GAACGGTGAC TGGATCCCGT CGGTCATCGC CGCCGCGGAC CGCCGCCTCA CCGAGGTCCG GCGCACGGTG CCCGACGCGG GCGCGATGAT CATCGCCACC GACCAGACCG ACGCGCGCGC GTACGCGGGG CACATCGCGC GGCTCACGGG CGAGTCGCCG ACGGTCGTGC TGTCCGACGA CGACGGCGCG AGCGACCGCA TCGAGGAGTA CGCGGCGAGC GACTCGCGCT GGCTCGTCGC CGTGCGCATG GTGTCCGAGG GGGTCGACGT GCCGCGCCTG GCCGTGGGCG TCTACGCGAC GAGCACGGCG ACGCCGTTGT TCTTCGCCCA GGCCGTCGGG CGGTTCGTGC GCGCGCGGCG TCGCGGCGAG ACCGCGTCGG TCTTCCTGCC CAGCGTGCCG CAGCTCCTCG AGCTCGCCGC GTCGCTCGAG GTCGAGCGCG ACCACGCGCT GGACAAGCCG ACCGGCTCGG AGGACCCCGA GGCCGATCTG CTGGCGCTGG CCGAGCGGGA GCAGAAGAGC GAGGACGCGG TCGGCTCCGA CGGGGTCGTC GGCACGTTCG AGGCGCTCGA GGCCCAGGCG TCGTTCGACC GCGTGCTGTT CGACGGCGGC GAGTTCGGGA CGGGGGCCGA GGTGGGCTCC GACGAGGAGC TCGACTTCCT CGGGCTGCCG GGGCTGCTCG ACGCCGACCA GGTGACGACC CTGCTGCGCC AGCGGCAGGC CGACCAGCAG GGCGCGCGTC GCCGTCGGGG CGAGGCCGAG CAGGTCGAGG TCGTCGACCA CCGCAAGCAG GCGGAGCTGC GCAAGGAGCT CGCCCAGCTC GTGGGCGCGT GGGCGCGTCG CAGCGGCCAG CCGCACGCCA CCGTGCACGC CGAGCTGCGC CGGCGGTGCG GTGGTCCGGA GGTCGCGGTC GCCGCACCGG AGCAGCTCGA GGCCCGGATC GCGATGCTGC GCGGGTGGTT CGTCGGCAGG CGCTAG
|
Protein sequence | MSAARRSAAP VHTPQHPSTS AAQQLPPAFP ARAPWGAAGS LRAWQAEAIE LYRQRGPRDF LAVATPGAGK TTFALRIATE LLEAKVVRRV TVVAPTEHLK KQWADAAARV GIRLDPRFSN AQGRHGAGYD GVAVTYAQVA SKPALHAART TAERTLVILD EVHHGGDALS WGDAVREAFE GATRRLALTG TPFRSDTAPI PFVTYAPDAQ GIRRSVADYT YAYGDALRDH VVRPVIFLSY SGSMRWRTKA GDEVAARLGE PLTKDMTSQA WRTALDPNGD WIPSVIAAAD RRLTEVRRTV PDAGAMIIAT DQTDARAYAG HIARLTGESP TVVLSDDDGA SDRIEEYAAS DSRWLVAVRM VSEGVDVPRL AVGVYATSTA TPLFFAQAVG RFVRARRRGE TASVFLPSVP QLLELAASLE VERDHALDKP TGSEDPEADL LALAEREQKS EDAVGSDGVV GTFEALEAQA SFDRVLFDGG EFGTGAEVGS DEELDFLGLP GLLDADQVTT LLRQRQADQQ GARRRRGEAE QVEVVDHRKQ AELRKELAQL VGAWARRSGQ PHATVHAELR RRCGGPEVAV AAPEQLEARI AMLRGWFVGR R
|
| |