Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1806 |
Symbol | |
ID | 9145699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2013117 |
End bp | 2014508 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | histidyl-tRNA synthetase |
Protein accession | YP_003636902 |
Protein GI | 296129652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.114177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.246433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGAC CCACACCTCT GTCCGGATTC CCCGAGTGGC TGCCCGACGG GCGCATCGTC GAGCAGCACG TGCTCGACGT CCTGCGCCGC ACCTTCGAGC TGCACGGCTT CGCGGGCATC GAGACGCGGG CCGTCGAGCC GCTCGACCAG CTGCTGCGCA AGGGCGAGAC CTCCAAGGAG GTCTACGTGC TGCGCCGGCT GCAGGAGGAC CCCGAGACAC CCGCGGCCCG CGGGCACCGC GAGGCCGAGA AGGGCCTCGG CCTGCACTTC GACCTCACCG TGCCGTTCGC GCGGTACGTG CTGGAGAACG CCGGTCACCT CGCGTTCCCG TTCCGCCGCT ACCAGATCCA GAAGGTCTGG CGCGGCGAGC GACCCCAGGA CGGCCGGTTC CGCGAGTTCG TGCAGGCCGA CGTCGACGTG GTCGGGGCGG GGTCGCTGCC GTACCACTAC GAGGTCGAGC TGCCGCTGGT GATGGCCGAC GCGCTCGGTG CGTTGCGCGA CCTCGGCGTG CCGCCCGTGC GGATCCTCGT CAACAACCGC AAGGTCGCCG AGGGGTTCTA CCGCGGCCTG GGCCTCACCG ACGTCGAGGC GGTCCTGCGC GGGATCGACA AGCTCGACAA GATCGGCTCC GACGCGGTCG CCGAGGTGCT CGTCGCCGAA GCCGGCGCCA CGCCCGCCCA GGCCGCCGCG TGCCTCGAGC TCGCGGCGGT CAACGGCGAG GACACCGGCG TCGTCGACCG CGTGCGCGAG CTCGCCGCGC GGTACGACGC GTCGACGGAC CTGCTCGAGG AGGGCCTCGG CGAGCTCGGG GCGCTGGTCG CGGCGGCCGC GGAGCGCGCG CCCGGCGTGC TGGTGGCCGA CCTGAGGATC GCCCGCGGTC TCGACTACTA CACCGGCTCG GTGTACGAGA CGGTGCTCGT CGGGCACGAG GACCTCGGCT CGATCTGCTC CGGCGGGCGC TACGACACGC TCGCGTCCGA CGGTGCGACC ACCTACCCCG GCGTCGGGCT GTCCATCGGC GTCTCGCGCC TGGTCTCGCG CCTGCTGTCG GCAGGTCTGG TCCGGGCGAC GCGGTCCGTC CCCACGGCCG TGCTCGTCGC GGTGACGTCC GAGGAGCGCC GCGCCGCCTC CGACGCCGTC GCCGCCGCGC TGCGCTCCCG CGGCATTCCC ACCGAGGTCG CCCCGAGCGC GTCGAAGTTC GGCAAGCAGA TCAGGCACGC CGACCGGCGC GGCATCCCGT ACGTGTGGTT CGTCGGCGAC ACCTCGGACG ACGGTGCGCC TGCGCAGGAC GAGGTCAAGG ACATCCGGAC CGGCGCGCAG CAGCCTGCCG ACGCGGCGAC GTGGGCACCG CCGGGCGAGG ACCTGCTGCC CCGCGTCGTC GCCGCGGGCT GA
|
Protein sequence | MARPTPLSGF PEWLPDGRIV EQHVLDVLRR TFELHGFAGI ETRAVEPLDQ LLRKGETSKE VYVLRRLQED PETPAARGHR EAEKGLGLHF DLTVPFARYV LENAGHLAFP FRRYQIQKVW RGERPQDGRF REFVQADVDV VGAGSLPYHY EVELPLVMAD ALGALRDLGV PPVRILVNNR KVAEGFYRGL GLTDVEAVLR GIDKLDKIGS DAVAEVLVAE AGATPAQAAA CLELAAVNGE DTGVVDRVRE LAARYDASTD LLEEGLGELG ALVAAAAERA PGVLVADLRI ARGLDYYTGS VYETVLVGHE DLGSICSGGR YDTLASDGAT TYPGVGLSIG VSRLVSRLLS AGLVRATRSV PTAVLVAVTS EERRAASDAV AAALRSRGIP TEVAPSASKF GKQIRHADRR GIPYVWFVGD TSDDGAPAQD EVKDIRTGAQ QPADAATWAP PGEDLLPRVV AAG
|
| |