Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0636 |
Symbol | |
ID | 5669053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 738776 |
End bp | 741268 |
Gene Length | 2493 bp |
Protein Length | 830 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239563 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001505001 |
Protein GI | 158312493 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00254265 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.770826 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGCG CCGCGGGCCC GGCCCGCCGA GCCACCTTCG CCCCGAGCGA GGCCGGTCTA CGCCGCATCG CCGCGCTGGC CGTCGTCGCC GCCCTCGCCC TGGTGGCCGG GACGGGATTC ACCCGCCTCT TCCCGACCCG GGACCTGTGG GTGCTGCTGC CGGTGGCGGC GGTCCTGCCC GTCGTCCTGG TCGGCGCGCT CTCCCGGCAC GGCCGCCCGG TCTCGCCGGC GCTGACCGTG CCCGTATGGC TGGCCGGCTT CGTAGCCTGG ACGGCTTACA CCGTCGCCGC CGGATCGGGC GATCTCATGG CCCGCCTCGA TGTGGTCCGC ACGGGCGTCG TCGACGGGTG GGCCCGGGTC CTCGACATGG GCGTCCCCGC CCCGGCCGAC CCCGACCTGC TGATCGTCGC GCTCGCCCCC ACCTGGCTCG CCGCGGCGCT CGGCGCCGAG CTCGTTGTGC GGACCCGCGC GGCGCTCGCC CCGGCACTGC CCGCCGCCGT AGCCCTTCTC GCCGCGTCCG CGCTGGCCGT CCCCGCCCCC GGGGACAACC TGGCCCGGGC CGGTGCCCTC GCCGCCCTGA CCGCGCTGTT TCTCATGATC AGAGCACCCC GGGCCGCGGG CCGCGGCCCG CGCCGTGAGC TCGCGCGCCG GGGCGGCGCG ACACTGCTCG TGGTCGTGGT CGGCGTGCTC GCCGGCACGG CCGCGACGAG GGCCGGCGGC GGGGACCCCG TCGACCCCCG CACCCACCGC TCGACACCAC CGGTGAGCCA GACCGAGCCG AGCCCGCTCA GCGCACTCGG CGGCTGGACC GCCCATCCGG ACGAGGTGCT CTTCCACACC GACGTCAGCG GGCCGGCCCC GGCCGAGCCG GTGACCCTCC GGCTCGCCGT CCTCGACTCC TACGACGGCG CCCAGTGGCG GTCGACCGCC CGTTTCGTTC GGGCCGGCTC GGGGCCACCC TCATCGCAAG CCGGGATCGC CGACGACCCG GCCGGCGATC CCGCTGCCGG CGACCGGGCC GCTGATCCAG CAGGTGACGG GCCCGGCCCG GCACCGGTGG GCGAGATGCG GCAGGTCATC GAGATCGCCG GACTGGGCGG GCGGATCCTC CCGTCGGACG GGCGGCTGGT GGGCGCCCCC ACCGGCGTCC GGGTCGATCC CGGCACCGGG ACCCTGCTGA ACGACCGTCC GCTGCTTCCC GGCGACCGTT ACGAGATCAC TTCAGCGCCG GACCCCCGGC CCGCGCCGGC GGATCTCCCC AGGTTCGACG CCGGCACCGC CCGAACGGGC GGGCCCGATC CCGACCTCGA GGTCCCCGCC GACCCGCCGG CCATCCTCGG CCGTCTGGCG GATATCGCGA CGGCCCGGGG GAGCACCCCG TTCCAACGGG CCGCGCTGCT GCGGCAGTAC CTGAGCGCGA CCTTCACCTT CGACCCGGCC GTCCCACCCG GGCACTCGTA CGGGCACATC GACCACTTCC TCGCCCACAC CCACCGCGGG ACGTCCGAGC AGTTCGCGAC CGCCTTCGTC CTCGCGGCCC GCATCCTTGG GCTGCCCGCC CGGCTTGCCG TGGGATTCAC CGCGCTCCCC GCGGCGGACG GCCAGCCCCG CACCGTGCAC GGCGCCGACG CGCTCGCCTG GGCCGAGGTG CGCTTCGACG GCGCCGGCTG GCTGCCGTTC TTCCCCACCC CGCCGGCCGC GGACGCCCGC GGCGCGAGCG TGGCGGGATC GAATCCGGGC GAGACCCCCG AGCAGGCGGA ACTGATCGAC GTCGCCCTGC GCTCCGCGGT CGAGACCACG GCACCGGCCA CCCGGGCCAC GGAGGACTCC ACCCCAGCGG CCACGCCGCC CGGCCCGGGT GCCTGGTCAC GTTGGGCCAT CCGGAGCGGC CTGGCGATCG CCGGCGCCGC GGCGCTGTAT CTCGCGCTCG CCGTCGCGCT GCCCTCGCTC CGGCGGGGCC GCCTGCGCCG CGCGGGCCAC CCACGACGCA GGGTGGTCGA CGCCTGGCGG CAGGCCGTCG ACGCGCTCGC CGACGCGGGC CTGCCCGTCC CGGCCGCCGC GAGCCCGGGA GAAGTCGCCC GGCTCGCCGC GGCCGAGGTG GGCCCGTCCG GCGAGACGGC GATCCGCGAG CTGGCCGACA TCGTGACACT CGCCCTGTTC GCCCCCGTGA CCGCGGTGGA ATGGGAGGGC ACCCCCGGCC GCCGCGCCGC GGACGAGGCC TGCCGCCTCC TCGACCGGTT CGAGCGGGCC CTGCGCGAGC GCACGACACG CCGAGCGCGG ATCCGCCGGA CGCTGGCCCC GAGCACCGTG GCCACCGAGC TCCGCCGGCT GCGGGATGCA CGGGACCCGC GGGATCCGGG AGATCCGCCG ACCGGGGCCG CGGGGGACCC GGACGGCCCG TGCGGCCCCG GCACCGACCA CGGGCGGCCC GATCCGGGCA CCCCGCCGGC CGGGTATGTC TCGGACGGGA AGCGCTTTGC CCCGGCGGCG TGA
|
Protein sequence | MTGAAGPARR ATFAPSEAGL RRIAALAVVA ALALVAGTGF TRLFPTRDLW VLLPVAAVLP VVLVGALSRH GRPVSPALTV PVWLAGFVAW TAYTVAAGSG DLMARLDVVR TGVVDGWARV LDMGVPAPAD PDLLIVALAP TWLAAALGAE LVVRTRAALA PALPAAVALL AASALAVPAP GDNLARAGAL AALTALFLMI RAPRAAGRGP RRELARRGGA TLLVVVVGVL AGTAATRAGG GDPVDPRTHR STPPVSQTEP SPLSALGGWT AHPDEVLFHT DVSGPAPAEP VTLRLAVLDS YDGAQWRSTA RFVRAGSGPP SSQAGIADDP AGDPAAGDRA ADPAGDGPGP APVGEMRQVI EIAGLGGRIL PSDGRLVGAP TGVRVDPGTG TLLNDRPLLP GDRYEITSAP DPRPAPADLP RFDAGTARTG GPDPDLEVPA DPPAILGRLA DIATARGSTP FQRAALLRQY LSATFTFDPA VPPGHSYGHI DHFLAHTHRG TSEQFATAFV LAARILGLPA RLAVGFTALP AADGQPRTVH GADALAWAEV RFDGAGWLPF FPTPPAADAR GASVAGSNPG ETPEQAELID VALRSAVETT APATRATEDS TPAATPPGPG AWSRWAIRSG LAIAGAAALY LALAVALPSL RRGRLRRAGH PRRRVVDAWR QAVDALADAG LPVPAAASPG EVARLAAAEV GPSGETAIRE LADIVTLALF APVTAVEWEG TPGRRAADEA CRLLDRFERA LRERTTRRAR IRRTLAPSTV ATELRRLRDA RDPRDPGDPP TGAAGDPDGP CGPGTDHGRP DPGTPPAGYV SDGKRFAPAA
|
| |