Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1660 |
Symbol | |
ID | 9145549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1851825 |
End bp | 1853012 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Thiamin pyrophosphokinase catalytic region |
Protein accession | YP_003636756 |
Protein GI | 296129506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.316441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTCT CCCTGCGCCG CCGTACGCCC GCGTCCGACG AGTCCGAGGT CGTCGGACCG GCCCGAGTCG ACCCCCGGAC CAAGGGGCTC ACCAAGCGTC TGAAGCCGGG CGACGTCGCG GTCATCGACC ACCTCGACAT CGACCGTGTG TCGGCCGAGG CGCTGGTCGC CTGCGCACCG GCAGCGGTCC TCAACGCCGC ACGCTCGACG TCGGGCCGCT ATCCGAACCT CGGGCCGGGC ATCCTCGTCG AGGCGGGCAT CCCGCTGGTG GACGACCTCG GTCCGGACGT CATGGCGCTC ACCGAGGGGC ACGTGCTGCG CGTCGTGGAC GGGGCCGTCT ACGACGGTGA CACGCTCGTG GCCGAGGGCG TCGAGCAGAC GGTCGAGACC GTCGCTGCCG CCATGGCCGA GGCCCGCGAG GGCCTGTCGG TGCAGCTCGA GTCGTTCGCG GCCAACACGA TGGACTACCT GCGGCGGGAG CGCGAGCTGC TCCTCGACGG TGTCGGCGTG CCCGACATCG ACACGAGGAT CGACGGGCGG CAGGTGCTCA TCGTCGTGCG CGGCTACCAC TACAAGGAGG ACCTGGTCAC GCTGCGCCCC TACATCCGGG AGTACCGACC GGTGCTCATC GGTGTGGACG GCGGCGCGGA CGCGATCCTC GACGCCGGGT GGCGCCCGGA CATGATCGTC GGCGACATGG ACTCGGTGTC CGACCGGGCC CTGCGCTGCG GCGCCGAGGT CGTCGTGCAC GCCTACCGCG ACGGCCGGGC GCCCGGCATC GCGCGCGTCG AGCAGCTCGG CGTGCCGCAC GTCGTGTTCC CCGCGACCGG CACCAGCGAG GACGTCGCGA TGCTCCTCGC CGACGACAAG GGCGCCGAGC TCATCGTCGC GGTCGGGACG CACGCCACCC TCGTGGAGTT CCTCGACAAG GGCCGCTCCG GCATGGCCAG CACGTTCCTC ACGCGCCTGC GCGTCGGCGG CAAGCTCGTC GACGCGAAGG GCGTGTCGCA GCTCTACCAG CACCGCATCT CCAACGTGCA GCTCACGCTG CTCGTCCTGG CGGGCCTCGC CGCACTCGGC GTCGCGCTCG CCTCGACGGC CGCCGGCCAG ACGCTCTTCG GCCTGGTCGG TGCGCGCGTC GACGACCTCG TCTCCTGGGT CGGTGCGCTG TTCGGCGGGT CCCCGTGA
|
Protein sequence | MRVSLRRRTP ASDESEVVGP ARVDPRTKGL TKRLKPGDVA VIDHLDIDRV SAEALVACAP AAVLNAARST SGRYPNLGPG ILVEAGIPLV DDLGPDVMAL TEGHVLRVVD GAVYDGDTLV AEGVEQTVET VAAAMAEARE GLSVQLESFA ANTMDYLRRE RELLLDGVGV PDIDTRIDGR QVLIVVRGYH YKEDLVTLRP YIREYRPVLI GVDGGADAIL DAGWRPDMIV GDMDSVSDRA LRCGAEVVVH AYRDGRAPGI ARVEQLGVPH VVFPATGTSE DVAMLLADDK GAELIVAVGT HATLVEFLDK GRSGMASTFL TRLRVGGKLV DAKGVSQLYQ HRISNVQLTL LVLAGLAALG VALASTAAGQ TLFGLVGARV DDLVSWVGAL FGGSP
|
| |