Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1403 |
Symbol | |
ID | 3903384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1688201 |
End bp | 1690753 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878740 |
Product | transglutaminase-like |
Protein accession | YP_480509 |
Protein GI | 86740109 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0925352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00325186 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCACAG TGATCGCCAC CGTCGCCTGT CTGCTGTCCA CCAGCGCGCT GACCCGTCTG TTCGAGGGTG TCCTCTGGTG GTTCGGGCCG GTGCTCGTTG CCACGACCGT GGCCGTCGGG ACGGGGGTGG CGAGCCGCCT GCTCCGGCTG CCGGCGGCGG CCGAGATCCT GCTCAGTCTG GTCGGCCTCA TCGGCACCGT CACCGTGCTG TCCGCGCGGT CGACGGCGTT GCTGGGCTTC CTGCCGACCG CCACCACGGT CGAGACGCTG CACACGCTCA TCTCCGCCGG CGGCTCCGAC ATCTCACGGC TGGCCGCGCC GGTGCCCGCC CGGCCGGGCC TGGTGGTCCT CACGGTGATC GGCGTCTACC TCCTGGTGAT GATCGTCGAT CTGATCGTGG TGAAGTTCGA CCACCCGACG CTCGGCGGCC TTCCGCTGCT GGCGCTGTAT GCGGTGTCGG CGGCGATTCT GCCCGGGGGG GTCGGCGTGG TGCCCTTCCT GCTCGGCGCG GTGAGCTTCA TCGCCCTGCT CCTGCTCGAC GGGCGGTTGG CGTACGAGCG ATGGGGCCGG ACCGTCTCCG ACCAGCGGAG GGCTGGCGTC GAGATGTTCG GCGGTCGCAT CGCCGTGGGC GCGCTCGGGG TCGCCGTCAC CGCGCTGGTC GCCGCGATCG TGGTGCCGCT TGGCCTGCCG TCCCTTGACG GCGAGGGACT GGTGTCGCGC AAGGGTGGAT CCGGTGCCGG GGACGGTCCG AGTTCGGCCA GCGTCGTGCA GCCGATCGTC TCGGTGAGCC AGCAGCTGCA CGCCAGCACG GAGGTGCCCC TGCTTCGGCT GCGCTCCGAC GAGCCGAAGT ATCTGCGACT GACGGCGCTG GAGAACTTCG ACGGCCAACT GTTCACCCTG CGGGCGTTGA ATGCCACCCG GGAAGACCGG GTCAGCGAGG GATTACCGAA ACCACGGACG GGCGGCGCTA CCCAGTCGGT GCGTGCGGAG GTGGCCGTCA CCGGGCGGTT CAACGAGCTG TATCTGCCGG TTCCGGGCAT CCCAATCCGG ATCGAGGGGC TGACCGGCGA CTGGCGGCTG GCGAGCCCGA CAGGCACGAT CTTCTCGACC CGGACCTCGA CGGCGAACGC GCACTACCAG GTGGACGCGG TGGTGCCGAA TCCCACAAGC GCACAGCTTC AGGCCGCGAC CGGGCCGATC CCGGACTCGC TCGGCGTCGC GACCGCGCTC CCTGACAATC TCGATCCCCA GCTGCGTCAG CTCACCAAGG CCGTGACCGC GGGTGCGCGC ACCCCCTATG AAAAGGTGTA CGCGATCCAG GAATACCTGC GGGGGCCGCA GTTCACCTAC GACCTCCGGG GCGCCCCCAC CACGCAGGAG GGCGCGCTCA GCCAGTTCCT GTTCGACACC CACCGCGGCT ACTGCGAACA GTTCGCCTCG GCCATGACGG TCATGGTGCG GATACTGGGC CTACCCGCCC GGGTCGCGAT CGGCTTCGTC CCCGGGGAGC GGCAGAGCGA CGGCAGCTAC GTCATCACCA ACCGGCAGGC GCACGCCTGG CCCGAGGTCT GGTTCCCGTC GATCGGCTGG ATCAGCTTCG AGCCGACCCG CCGCAGCGAC GGAACCACCT CGGCGCCGTC CTACGCCCCC GCCGGGCCCG AGGACCCCGA CAACGACGCC GCCCTGCCGG AAGAGCAGGC TCAGGCCGAA CCGCAGGCAG TGCCCGAGCC GGTGCCCGTG CCGGATACCT CCACGCCGGC TCCCGGCGAC GATCCGACCG CGATGGGCGC TGCGCGGTCA GGTGACGACG CATCGACCTC CACCGAAATT CCCTCCTGGG TGGTCTGGCT GTGCGCTGCC CTGCCCGTCC TGGGCCTGCT AGCCGTTCCG GCGATCATCC GGATCCGGCG GAGGCGGGTC CGGCTGCGCC CGGCCGAAGA CGATCCCCCA GATCCGTCTG GGGCGACGGT GGAACGGGTC CACGACGCCT GGGCGGAACT GCTGGACGTC GCCGCGGACC TGGGAATCAT GATCCGGCCG AACGACTCGC CGCGGGCCGG AGTGGCCCGG CTGACCGCCT ACCTGGACGC CGAGCCGACC ACCGCACAGG CTCCGGGATC CGGCCCGGAA CCGTCTGCGT GGCAGTCCGG GGGCAGCGAG CCCCCCACGA CCTACCCGCA GGCCCGCGCG GCACTCGCCC GACTGGCCTC CGCGGAGGAG CGGGCCCGCT ACGCGCCGCC GGAGATGGCC GCGCCGCCGT CCGTCGCAGA CCTCAGCGAA GATGTCGTCC TGGCCACCAC CACGCTGTTG GCGCTGGCAC CGCGGGCCAG GCGGATGTTG GCGCGGATCG CCCCGGCATC GGTTCTCCGG CGCGCCGCGC CGGCCTCGGC GCGCGGACGG TCGAGGATGG GACGGGGAGG CGGTAAGGAC GACGGACCCG CCACGCCCAC CACGGTTCAG AACGAAGACA ATCAGGACAA TCAGGACTGG CTGGACGCGG TCAGGGACGG TCCTCCCAGC GCCGCCGCCA CCGCTCCTCG ACCCGTTCCA TGA
|
Protein sequence | MPTVIATVAC LLSTSALTRL FEGVLWWFGP VLVATTVAVG TGVASRLLRL PAAAEILLSL VGLIGTVTVL SARSTALLGF LPTATTVETL HTLISAGGSD ISRLAAPVPA RPGLVVLTVI GVYLLVMIVD LIVVKFDHPT LGGLPLLALY AVSAAILPGG VGVVPFLLGA VSFIALLLLD GRLAYERWGR TVSDQRRAGV EMFGGRIAVG ALGVAVTALV AAIVVPLGLP SLDGEGLVSR KGGSGAGDGP SSASVVQPIV SVSQQLHAST EVPLLRLRSD EPKYLRLTAL ENFDGQLFTL RALNATREDR VSEGLPKPRT GGATQSVRAE VAVTGRFNEL YLPVPGIPIR IEGLTGDWRL ASPTGTIFST RTSTANAHYQ VDAVVPNPTS AQLQAATGPI PDSLGVATAL PDNLDPQLRQ LTKAVTAGAR TPYEKVYAIQ EYLRGPQFTY DLRGAPTTQE GALSQFLFDT HRGYCEQFAS AMTVMVRILG LPARVAIGFV PGERQSDGSY VITNRQAHAW PEVWFPSIGW ISFEPTRRSD GTTSAPSYAP AGPEDPDNDA ALPEEQAQAE PQAVPEPVPV PDTSTPAPGD DPTAMGAARS GDDASTSTEI PSWVVWLCAA LPVLGLLAVP AIIRIRRRRV RLRPAEDDPP DPSGATVERV HDAWAELLDV AADLGIMIRP NDSPRAGVAR LTAYLDAEPT TAQAPGSGPE PSAWQSGGSE PPTTYPQARA ALARLASAEE RARYAPPEMA APPSVADLSE DVVLATTTLL ALAPRARRML ARIAPASVLR RAAPASARGR SRMGRGGGKD DGPATPTTVQ NEDNQDNQDW LDAVRDGPPS AAATAPRPVP
|
| |