Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2195 |
Symbol | |
ID | 9146095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2443023 |
End bp | 2445395 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transglutaminase domain protein |
Protein accession | YP_003637285 |
Protein GI | 296130035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.330551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0122115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCCCG GGCAGCGGGT CGGCGTGCGC GAGCCCGCGG ACGCCGTGCG CGACGTGGTC GTGCTCGTCC TCGCGACGGG GCTGGCGCTG ACGCCGCTCC TGCCGGCCTA CGGCACGGCC GCGGTGCTGC CGGCGCTCCT GGGCGGCGTC GTCCTGGGCG GTGGCGTCGT GGTCGTCACG GCCGTGCGGC GCTCGTCCGC GCTGGTGGTC GTCGCGTCCC TGCTCGGTGC GTACGCGCTC GCGGGCGGCG CGCTGGCCGC CCCGACGACG ACGGTCGCCG GCGTGGTGCC GACGCCCGCG ACGCTCGTCG CGCTCGCGCG CGGTGCCGCG TCGACGTGGA AGCAGGTGCT GACCCTGCAG CCGCCGGTCG GCACCGGCGG CACGCTGCTC GTGGCGGCGC TGCTGCTCGC GCTCGTCGGC ACGGCGGTCG CGTTGTCGCT GACCCTGCGG GTCCGTCGTC CGGCCGTGGC CGCGCTGGCG GCGGTCGTCC CGCTCGCCGT GCTCGTCGGG TCGATCGTGC TCGGCACGCG GCAGCCGCCC GTGCCGCCCG CCGTCGCCGG GACGGTGCTC GTCGCGACGT GCCTGGTCTG GGCCGCGTGG CGGGTCGGCT CCTGGCGGCC CCGCCGCGTG GTGGCGACCG CGGCGCTCGC GGCCGTCGCG GCGACCGGTG GCCTGCTCGG TGGTCCGCTC GTCGTCGCGG ACGAGCCGCG CCTCGTGGTC CGCGACGAGA TCGTGCCGCC GTTCGACCCG CGTGACTACC CGAGCCCCCT CGCGTCGTTC CGACGCCTGG TGAAGCTCGA CGACACGGTG CTGATGACGG TCGACGGCCT GCCCGCCGGG GCGCGTGTGC GGCTCGCGAC GTTCGACCGG TACGACGGTG TCGTGTTCAA CGTGGCGGGG GACGGCTCCG CGCAGGCGTC GGGGGAGTTC CGCCGGGTGG GCGACGAGAT CGACGTCCCC GTGCGCGGCA CGCCGGCCCG CGTCGAGGTG ACGGTCGGGG ACCTGCAGGG CGTGTGGCTG CCGACCGTCG GGTACGCACG GTCCGTCGAC CTGGGGCGGC AGGCCTCCGA CCTGCGCTAC AACGACGTCA CGGGAGCGGC CGTCGTCACG TCCGGCGTGC GCCCCGGCAT GACGTACACG ATGGAGGTCG TCGTGCCGGA CCGGCCGGAC GACGACGAGG TCGGCCGCGC GTCCGCGGCC GCCGTCTCGC CGCCGACGTC CGCCCGTCTG CAGGTCGTCG ACGCCGCGGC GTCCGACGTC GCACGCGACG CCGGGTCGCC CGTCCAGGTC GCACGTGCGA TCTCCCGGTA CCTGTCCGAG GACGGCTACT TCAGCCACGG CCTGACCGAC GCGGGCGACC ACCCGTCCCT GTCGGGCCAC GGCGCCGCGC GCGTCGCCGA GCTCCTCGCG GGCGAGATCA TGGTCGGTGA CGGCGAGCAG TACGCCGCTG CGGCCGCGCT GATGATCCAG GCGAGGGGTC TGCCGGCGCG GGTCGTCCTC GGGTTCGTCC CCGGGTCGGG CGACGACGAG GACGGGACCG GGGACGTGCC CCCGCCGACC GACGAGGACG GTGCCGTCGA GATCCGCGGC CGCGACGTCC AGGCGTGGGT CGAGGTGGCC TTCGCCGGGC ACGGCTGGGT GCCGTTCGAC GTGACGCCGC CGCGCTCGCG GACCCCCGAG CAGGAGCAGG AGGAGACGGA CACCGACCCG CAGCCGCAGG TGGTGCAGCC GCCCGCCCTG CCGCCGGACC GGGTCACGCC GCCCGAGGAC GACACCGAGC AGCCGCAGAC GGAGGACCCG CCCGCGGACG ACTCCGGGCT CGCGCTGTGG CTGCGGATCG CCGCCTGGGC GGGCGTCGGT CTCGCGGGTG TCCTCGTGCT GCTGTCGCCG CTGCTCGTCG TCGTCGCGCT CAAGGCGCGC CGGCGCCGGC GGCGGCGCCG CGCGGCGGAC CCCGTGCGAC GTGTCGCCGG AGGTTGGGAC GAGGTCGTCG ACACCGCCCG CGACATGGGC GGCGAGCCCC CGGCGGGCGG CACGCGACGC GAGACCGCGC GGCTGTGGGG GCGCACGCTC GTGGAGCGGC ACCCCGCGGT GGCCGCGCGC GTCGACGCTC TCGCGCGCCG CGCGGACCGT GCGGTGTTCG CACCGGGTGT GCCGGACCGC TCCGACGTCG AGGCGTACTG GGCGGACGTC GACGCGACCG TGAGTGCGCT GCGCCGGACC CTGCCCTGGC GTCGCCGGGT GCGCACGCGG GCGGCGCTCA CCTCGCTGCG CCGCCCGTCG GCCGCGCGTC CCCGCGAGCC CCGTCGGCAG CCGGCACCCA CGGCGCCGCC CGCCCCACCG ACGACCCGCC GAGCACGCCG AGGTGAGCGA TGA
|
Protein sequence | MRPGQRVGVR EPADAVRDVV VLVLATGLAL TPLLPAYGTA AVLPALLGGV VLGGGVVVVT AVRRSSALVV VASLLGAYAL AGGALAAPTT TVAGVVPTPA TLVALARGAA STWKQVLTLQ PPVGTGGTLL VAALLLALVG TAVALSLTLR VRRPAVAALA AVVPLAVLVG SIVLGTRQPP VPPAVAGTVL VATCLVWAAW RVGSWRPRRV VATAALAAVA ATGGLLGGPL VVADEPRLVV RDEIVPPFDP RDYPSPLASF RRLVKLDDTV LMTVDGLPAG ARVRLATFDR YDGVVFNVAG DGSAQASGEF RRVGDEIDVP VRGTPARVEV TVGDLQGVWL PTVGYARSVD LGRQASDLRY NDVTGAAVVT SGVRPGMTYT MEVVVPDRPD DDEVGRASAA AVSPPTSARL QVVDAAASDV ARDAGSPVQV ARAISRYLSE DGYFSHGLTD AGDHPSLSGH GAARVAELLA GEIMVGDGEQ YAAAAALMIQ ARGLPARVVL GFVPGSGDDE DGTGDVPPPT DEDGAVEIRG RDVQAWVEVA FAGHGWVPFD VTPPRSRTPE QEQEETDTDP QPQVVQPPAL PPDRVTPPED DTEQPQTEDP PADDSGLALW LRIAAWAGVG LAGVLVLLSP LLVVVALKAR RRRRRRRAAD PVRRVAGGWD EVVDTARDMG GEPPAGGTRR ETARLWGRTL VERHPAVAAR VDALARRADR AVFAPGVPDR SDVEAYWADV DATVSALRRT LPWRRRVRTR AALTSLRRPS AARPREPRRQ PAPTAPPAPP TTRRARRGER
|
| |