Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1151 |
Symbol | |
ID | 9145030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1295024 |
End bp | 1297318 |
Gene Length | 2295 bp |
Protein Length | 764 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | transglutaminase domain protein |
Protein accession | YP_003636254 |
Protein GI | 296129004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.301395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000129974 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGCCCG CACCGGGCGC CCAGGGCGCC GACCGCGGCC CCGGCGCGGC CGTCACGCTC TGGGCCGTCG CGCTGCTCGC GCTGGCGTGC GTCCCCGTCG CGGCCGCGTT CACGTCGCCG GTCGTGCTGC TCCCGCTCGT CGGGGTCGTC GGCCTCGCGT TCGCCGTCGT GGCGCTCGCG CGGCGGCTCG CGGTCCCGCC GTGGGCCACC GGCGCCGCCG GTGTGACGCT CGTGCTCGCG TCCGCGACGT CCCTCGCGCG CCTCGGCGCC GACGCCGCAC CCGTGGCCGA CGCCGACCTC GACGGTGCCG GCTGGCGGAG CACCGGGGCA GCCCAGCTCC TCGGGCCGCT CGCCGACGCG GTCGCACGGC TCCTCACAGC GCCGCGGCCC GCCGCCCCCG CGCTCGCACC GCCGGTCGTC GCGCTCGTCG CACTCGTCGC CCTCACGGTG GCGCTCGCGC TCGGTCGCCG CTCGGCGGCC CGGGTCGCAC CGCTCACGGG CGCGGTCGTC GTGTACGGCG TCGCGCTGCT GCTCACCGCG GGCGCCGCCG ACCGCGGCGC GGTGGGGGCG GGGCTCGTCG CGCTCGCCGC CGCCGGCTGG ATGCTGCTGG ACGGCGAGGC GCTGGCACGC GACGGTGCGC GGCCCCGGCT GGGGCGTGCG CCGCGCCGGG CCCGCACCCG GTGGCGCAGC AGCGGCGTCG CGGGGCTGCT GGTCGTCGCG CTCGTCAGCG CCGGGGGTGC TGCGGCCGCC GCGGCCGTCA CCGGGGACGC CTTCGAGCCG CGTGACCACG TCGCACCGCC GCGTGTGCCG GCCGCCGTCG TGCACCCGCT CGCGGAGCTC TCGCGCTGGC AGTCCGACGC CGACGCCACC GTGCTGCGCG TGCGCGGCAC GCACCCCGGC TACCTCACGT GGGTCACGCT GCCCGACCAC GACGGCGCCG GCTGGCATGC CGACCTCGCC CTGCGCCCGC TCGGGACGGT CGTCGAGCCC TCGCTCCCGC CGGGCGCTGC GCGCGGTGAC GTGGACGTCG AGGTGCAGCT CGTGGAGATC GCCGGGCCGT CGGGCTCCCC GGGCTCGTGG CTGCCGGCGA CCGCCGAGGT CGTGGCCTCG GACGCGCCCG GCGTCCTGGC CGACGTGGAC GCGGGTGTCC TGGCAGTCCC GCCGACGCCC GAGGGGCGGC TGCCGGAGGG GCTCGTCTAC CGCGTGCGCG GGCACGTCGA CGTGCCCGAC CCCGCGCTCG CCGTGCGCGC CGGCGTGCCC GGCGGCGAGG AGGTCGAACG GTACCTGCGG CTCGAGCGGT TCCCCGCGGA CCTGCGTGCC TACGCGCAGG ACGTCGTCGC GGACGCGTCC TCGCGCCTCG ACCAGGCGGA GCGCCTGGCT GCGACGGTGC GCGCGGACCG CGAGCTGGCG GCCGACGCCG TGAGCGGCAC GTCCTACGCG CGCCTGCGCG AGTTCCTCTT CGCGGAGCGG GAGGCCGGGG GGCAGGTCGG TACCAGCGAG CAGTTCGCCG GTGCGTTCGC CGTGCTCGCG CGCGCCGTGG GGTTGCCGTC GCGTGTGGTC ATGGGCTTCG TGGTGCCCGG TGGCGCCGCG ACGGACGACG AGACGCTGCG CGACGTGCAC GGGTCCGACG TGCGTGCCTG GGCCGAGGTG TACCTCGCCG GGACGGGGTG GGTGAGGTTC GACCCGGCGC CGGACGCCGT GACGACGTCG GCCGTCGACC AGGTGCCGCC CGAGCAGCAG GAGCAGACCG CCACCCAGGA CGACCCGCCG CCGGTGCCGG AGGACGAGCA GCCGCAGGAA CCCGCGACGC CGCTGCCCGG AGGGCAGGAC CGGCGGGTGG ATCCCGCCGT CGTGATCGTG GCTGCCGCGG CGACGCTCCT GCTGGCCGCG CTCGGGGCGT GGCTCGCCGT GCTGGGTGCG CGCCTCCTGC GCCGTGGCCG GCTGCGCCGT GCCGGTGCCG TCGGGGCGTG GCAGCACGCC GCCGACGCGC TGCTGCTGCG CGCGGGTGCG CCGGCGCCGG GCGAGACAGC CGACGACCTC GCGCGGCGCA TGACCGAGCT GTGCGGCGTG CCCGCCCACG GGTTGGCCAC CGCCGCGCAG GCCGTCGCGT TCGGGGCCTC ACCCGTGCCG GCGCCGGGCG CGTGGCGCAC GGCGGTGCAG GTGCAGCGCC GGCTGCGGGC GGGTGCGCCT CTCGGCCGGC GGCTCACCTG GTGGGCGGAC CCCGCGCCGC TGCGACGGCG TGCGGGCACG GGCGGGCGAG GCGCAGGGCA CCGACCGGGG CCGTCACGGC GGTGA
|
Protein sequence | MTPAPGAQGA DRGPGAAVTL WAVALLALAC VPVAAAFTSP VVLLPLVGVV GLAFAVVALA RRLAVPPWAT GAAGVTLVLA SATSLARLGA DAAPVADADL DGAGWRSTGA AQLLGPLADA VARLLTAPRP AAPALAPPVV ALVALVALTV ALALGRRSAA RVAPLTGAVV VYGVALLLTA GAADRGAVGA GLVALAAAGW MLLDGEALAR DGARPRLGRA PRRARTRWRS SGVAGLLVVA LVSAGGAAAA AAVTGDAFEP RDHVAPPRVP AAVVHPLAEL SRWQSDADAT VLRVRGTHPG YLTWVTLPDH DGAGWHADLA LRPLGTVVEP SLPPGAARGD VDVEVQLVEI AGPSGSPGSW LPATAEVVAS DAPGVLADVD AGVLAVPPTP EGRLPEGLVY RVRGHVDVPD PALAVRAGVP GGEEVERYLR LERFPADLRA YAQDVVADAS SRLDQAERLA ATVRADRELA ADAVSGTSYA RLREFLFAER EAGGQVGTSE QFAGAFAVLA RAVGLPSRVV MGFVVPGGAA TDDETLRDVH GSDVRAWAEV YLAGTGWVRF DPAPDAVTTS AVDQVPPEQQ EQTATQDDPP PVPEDEQPQE PATPLPGGQD RRVDPAVVIV AAAATLLLAA LGAWLAVLGA RLLRRGRLRR AGAVGAWQHA ADALLLRAGA PAPGETADDL ARRMTELCGV PAHGLATAAQ AVAFGASPVP APGAWRTAVQ VQRRLRAGAP LGRRLTWWAD PAPLRRRAGT GGRGAGHRPG PSRR
|
| |