Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3804 |
Symbol | |
ID | 5672168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4513652 |
End bp | 4516114 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242683 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001508103 |
Protein GI | 158315595 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.307631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0951056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCCGC CGCCCGGTGC GGGTCCGCCG CCCGGGTCGG CGCCGGCGCC GCCGGCCCCG GATGGCCGGG CGGGCGGGCG GCCGGCGCGC CGGCGGATCC GCGCCGCGCC GGCGCCCACC CCCGTCCGAA CGCCCGGGCC GCGATCGTTC CGACCCGGGC AGGTAGCCCT CGCCGCGGTG CTGGTCGCGC TGGTCGCGGT CGCCTTCACC ACCTTCTTCA CCGGCGGCCC GACCGGGGGC GCGGGGGTCG TCCTGCTCCC GGCGGTGGTG TTGGCGTCCG CGCTCGGCTG CCTCGCCGGG GCCAGGCTGG GCGCCGGCTG GCTGGTCGGC CTGGTCGGCC TGATCGGTGC CCTGCTGTTC GGCGTGCTCG CGCTGTTCGC CGCCAGGTTC GGGGACGGGC TCTCGGCGCT GTCCTCCGAG TTCGGCTCGG CGGCGCGGGA CGGCTGGGCG CGCATGCTCA CCGTCGGGCT GCCCGCGCAC CCCGGCGCGG ACCTGTTGTT CATCCCGGTG TTGGTGCTGT GGCTCGCGGC GTTTGCCGCC GCCGTCCTCA CGGTGCGCAC CGATTCGGTG CTCGCGCCCG TGCTTCCGGC GATCGTCGGC TACGTCGTCG CGCTGCTGCT GGTCGCGGCC CGGGGCCGGT CGCTGCTCGT CCTCACCGGC CTGATCGCCC TCCTTGCGCT GGTGCTCGCG GTTGTCCGGG CGTCCCGGCT GGCAGCCGAA GGGCAGCTCT CCGCGGTGGC GGTGAGGCCG GAGGCGGCGC CCGCGGCCCA GCCCGACGCC GGGCGGGCCG ACGCGGATGG GACCAGCGGC GCCCGTGGCG GCGGAGCGGG CGGGCCGGCG CGGCCCCGGG TGGGTGCGGG TCGGCTCGCG CTCGGCCTGC CGGTCGCGGC TGTGACCGCG CTGCTCGGCA CGATCGGCGC GGCGTTCCTG CCGATCGCCG ACGGCACGGA CCGGTTCGAT CCCCGCGACC ACCGGCATCC CCCGGTCGAG ATCTCCACCT CGCTCAACCC GTTGGTGCAG GTGAAGGCCG CGCACAAGGC GACGGCGGCG CGGAACCTGT TCACGGTGCA GCTGTCCGCG GTCGGCGGCA AGGTCGCGAC CGACCGGCTG CGCACCGTCA CGCTGGCCGA CTTCGACGGT GCCAGCTGGC GCGAGGACGG CACCTTCGTC CGCAGCGGAA GCACGCTGCC GGACGGCGAC GGCCTGGCGC CCACGGTCGG CAACGAGACC CGCATGGAAG TGACCGTCGA CACGGCGAGC GGTCCGTTCC TGCCGTCACT CGGGCGGCCG GTGCGCATCT CCGGCGCCAG CCTCGAGTAC GCGTTCCAGC CCGACGCCGG TGTGCTCGCC GTCGCCGCTC CGGCCCGGAC CGGCGACCAC TACGTCCTCA CCGCGCGCGT GCCCGGCCCC ACCGACCAGC AGGTCCGCGG CGCCGTGCCC GCCTCCGGCC CGGCCGCCGC GCGGTACCTC GAGCTGCCGC CGGGCATGCC GGCGGAGCTG CAGGACCTGG CGTCGCGGGT GATGAGCGGG AAGTCGAGCC CGTACGAGAA GCTCACCGCC CTGGAGGACT TCCTGCGGGA CCAGGCCAAC TACCCGGTGG ACCTGAACGC CCGTCCCGGC CACTCGTACG GTGCGTTGAA GCGGTTCCTG ACCGGCTCCA AGGCCGACAA CCGTGGCTAC GTCGAGCAGT TCGCGACAGC GTTCGCGTTG CTCGCCCGGG CCGAGGGCTT CCCGAGCCGG GTTGCCGTCG GATACCTGCT CGACAGCCGT TCCTCGTCTG CGCCCGGCAG GTTCACGGTG ACGTCGAAGC AGGCGTTCGC CTGGCCGGAG GTGGCCCTCG ACGGTATCGG CTGGGTCGCC TTCGACCCGA CCGATATCAG CAAGCTCGGC GCCACGCCGC CGGCACCCAG CGACGACCAG ACGCCCGGTG GCGAGGGAGC TGCCCCGCAG GCGCAGACAG TCCCTCCCAT CGTCAAACCG GAACTGGACC GGGCGGCCCA GACCGGCGGC GGTGGCGCTG GCGGCGCCCG GAACACCCTG CTGCTCGCGC TGCTGGCGGT CGTTGCCGCG GCGGCCGCCG TCCCGGTCGG GATCGTCGGC GAGAAGGCAC GCCGCCGCCA GCGCCGCCGC GCCGGTACGG CGGCGGCGCG GATCGGCGGC GCCTGGCGGG AGGTCCGGGA CCGGCTGGCC GAACGGGGGG TGGACCGTTC GCGCGCCCTC ACCGCGGACG AGGTCGTCGC ACGCACCCGG GCACTGCGCG GCGATGCTGC CGGTGAGCGG GTGGGCAGTC TCGCGCCGGT GGTGAGCAGC GCGCTGTTCG CCGCGGCGGA GCCGGGCGAA GCCGAGGCAC GGCACGCCTG GGAGCTGGCA GCGGCCGTCA GCCAGGAACT CCACCGGTCC GACAGCCTGT GGCGGCGGGT CGTCGCCGCG GTCGACCCGC GTCCATTGCT GCCGGGGAGA TGA
|
Protein sequence | MPPPPGAGPP PGSAPAPPAP DGRAGGRPAR RRIRAAPAPT PVRTPGPRSF RPGQVALAAV LVALVAVAFT TFFTGGPTGG AGVVLLPAVV LASALGCLAG ARLGAGWLVG LVGLIGALLF GVLALFAARF GDGLSALSSE FGSAARDGWA RMLTVGLPAH PGADLLFIPV LVLWLAAFAA AVLTVRTDSV LAPVLPAIVG YVVALLLVAA RGRSLLVLTG LIALLALVLA VVRASRLAAE GQLSAVAVRP EAAPAAQPDA GRADADGTSG ARGGGAGGPA RPRVGAGRLA LGLPVAAVTA LLGTIGAAFL PIADGTDRFD PRDHRHPPVE ISTSLNPLVQ VKAAHKATAA RNLFTVQLSA VGGKVATDRL RTVTLADFDG ASWREDGTFV RSGSTLPDGD GLAPTVGNET RMEVTVDTAS GPFLPSLGRP VRISGASLEY AFQPDAGVLA VAAPARTGDH YVLTARVPGP TDQQVRGAVP ASGPAAARYL ELPPGMPAEL QDLASRVMSG KSSPYEKLTA LEDFLRDQAN YPVDLNARPG HSYGALKRFL TGSKADNRGY VEQFATAFAL LARAEGFPSR VAVGYLLDSR SSSAPGRFTV TSKQAFAWPE VALDGIGWVA FDPTDISKLG ATPPAPSDDQ TPGGEGAAPQ AQTVPPIVKP ELDRAAQTGG GGAGGARNTL LLALLAVVAA AAAVPVGIVG EKARRRQRRR AGTAAARIGG AWREVRDRLA ERGVDRSRAL TADEVVARTR ALRGDAAGER VGSLAPVVSS ALFAAAEPGE AEARHAWELA AAVSQELHRS DSLWRRVVAA VDPRPLLPGR
|
| |