Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5110 |
Symbol | |
ID | 5673445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6119290 |
End bp | 6121767 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243961 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001509375 |
Protein GI | 158316867 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.167034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.118743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACCC CACGGCCGTT CTCCGCCGCG CTGGGCGGGA TGGCCTGCCT ACTCGCCAGC GCCGCGCTGG CCCCGCTGTT CGACGGGTTC GGCTGGTGGT TCGGCCCCGT CCTGATCGCG ACGGCGGTCG CCGTGAGCAC CGCGGTGCTG GGCCGGCTGC TCGGCGCCCT GTTCCGGCTG CCGGCGTCCA CCGGCATCTG CCTCAGCCTG ATCGGCCTGC TCACCACCCT CACCAGGGTG AGCGCACGGG ACACCGCCCT GCTCGGCGTG TTCCCGACGC CGTCCACCGT GACGGCGCTG CGCGAGCTGG CGCTGGCCGG CAAGCACGAC ATCGGCGAGC TCGCGGTACC GGTGCCGGAA CGGCCCGGCC TGGTGGTCCT GGTCTTCGTC GCCGTGTACC TGGTCGTCAT GGCGGTCGAC CTGATCGTGG TGGTCGTGGA CCGGCCCCCG CTGGCCGGCC TGCCGCTGCT CGGCCTGTTC GTCGTGCCGG CCGCCGTCCT GCCGGCCGGG GTCGGCACGC TTCCCTTCGT CCTCGCCTCG GTCGGCTTCG TCGCGCTGAT GCTGCTGGAC GGCAACCGGA TGGTGACCCG GTGGGGCCGC CCGGTCGGTG ACCGGCCCCC CCGGGTCATC CGCAACGGCC TGGGATCACT CGGTGCCCGG GTGGCCGTCG GGTCGCTGGT GATCGCGGCG GCGGTGCCAC TGCTGGTGCC CTCCCTGGAC GGGCACGGAG TGATTGACAA CGGCGGCGGC GGGCGCTCGG GCGACGGCCC GAGCTCGGCG AGCGTCGTGC AGCCGATCGT CTCGCTCTCC CAGCAGCTCC ATGACGACCG CGAGATCCCG CTCCTGCGCG TCACCACGGA CAATCCGCAG TATCTGCGGC TCACCGCCCT GGAGAACTTC GACGGCCAGC GCTTCACCCT GCGGGCCCTG AACGCGACGA AGGAAGCCCG GGTGAGCGAG GGCCTGCCCG GACCGGAGCG GGGCGTCCGC ACGATCTCGA CCACCGCATC GGTGGCGGTC TCCGGCGAGA TGGCCGAACG TTACCTGCCC GTGCCGGGCA TTCCCACCGA CTTCGACGGA CTCGCCGGCG ACTGGCGCCT CGCCGAACCG ACGGGCACCG TCTTCTCCAC CCGCACCTCC ACCGCCGGGC TGCGCTACAC CGTCAGCGCG GCGGTCCCCG ACCCCACCGC GCAGCAGATC GCCGCCGCCA CCGGCCCGGT GCCCGAGTCG ATGAACGTCG TCACTCAGCT GCCGCAGGAC GCCGACCCAC GGCTGCGGAC ACTGCTCGCC CAGATCACCA CCGGCGCGAG CACCGGCTAC GCCCGGGTCC TCGCCATCCA GAACTTCCTG CGCGGCTCGG AGTTCACCTA CGACCTCAAC GGCGCACCCA CCGTCCAGGA CGGCGCGCTC AGCGAGTTCC TCTTCGAGAG CCGGCGCGGG TACTGCGAAC AGTTCGCGTC CGCAATGACC GTCCTGGTGC GGATGCTGGG GCTGCCCGCC CGGGTGGCCA TCGGTTTCAC GCACGGCACC CGCACCGCCG ACGGCACCTG GGTGATCACC AACAAGCAGG CACACGCCTG GCCCGAGGTC TGGTTCCCGA CCCTGGGCTG GCTCCCGTTC GAGCCGACCC GCCGCTCGGA CGGCGCGACC CCGGCCCCGG ACTACGCCCC GTCGACGACC GAGCCGACCA CCGGCCCGGA CCCGGCCGAA GTCCCCCAGG GGAACGGGGA TGTCGCCGTC GAGCCGACGC CGAGCGCGGT GCCCGTCCCC GACGACCAGG GCGGGGCGGC CGAGGAGCTG ACCGCCGAGG CCGACGACAA GGCCGCGGGC ACCCACAAGG GCACCTTCCC CCCCTCCTGG CTGCCCTGGG TCGGGCTGAG TCTCGGGATC CTGGTCCTGC TCAGCATCCC GGCCCTGTCC AGGGTGGCGC TGCGGCGCCG CCGGATGGGC TCGGGCGGCC CCGACCGCCA GGACGCCGAG GCAGTCGCGC GCGTGCACGC GGCCTGGGCC GAGCTGGTCG ACGTGGCCGC CGACCTCGGC ATCCACCTGC GGACGAGCGA CTCACCGCGC TCGGGCGCGC AGCGGCTTAT CGCCTACCTC GAGGCCGGCC CCGAAGCCGG GTCGGCGGAG GTCGACGCCG CCCGGCAGGC CCTGATCCGG ATGGCAATGG CCCAGGAGCG GGCCCGTTAC GCCCCCGCCG GGATGGCCGC GCCCGATCCG GGCGTGGACG TCCTGGCCGA CCTGGCTCTG GCCCGCCGGG TGCTGTGGTC GGTCGCGCCC CGGGGCCGCC GCGCGATGGC GACGGTGGCC CCGCCGTCGA TGATGCAGCG GGCGCGGGAA ATCCCGGTGC GCGATGTTCT CGGGCGCATC CGGCACCGGG CCGACGACCC GCCGGACAAC GGCGCCGACG ACGACCAGGA GGCCGGGGTC GGTGCCTCCG CCGGCGGGCG GACACACCCG CCGCAGCCGC CCGCCTGA
|
Protein sequence | MVTPRPFSAA LGGMACLLAS AALAPLFDGF GWWFGPVLIA TAVAVSTAVL GRLLGALFRL PASTGICLSL IGLLTTLTRV SARDTALLGV FPTPSTVTAL RELALAGKHD IGELAVPVPE RPGLVVLVFV AVYLVVMAVD LIVVVVDRPP LAGLPLLGLF VVPAAVLPAG VGTLPFVLAS VGFVALMLLD GNRMVTRWGR PVGDRPPRVI RNGLGSLGAR VAVGSLVIAA AVPLLVPSLD GHGVIDNGGG GRSGDGPSSA SVVQPIVSLS QQLHDDREIP LLRVTTDNPQ YLRLTALENF DGQRFTLRAL NATKEARVSE GLPGPERGVR TISTTASVAV SGEMAERYLP VPGIPTDFDG LAGDWRLAEP TGTVFSTRTS TAGLRYTVSA AVPDPTAQQI AAATGPVPES MNVVTQLPQD ADPRLRTLLA QITTGASTGY ARVLAIQNFL RGSEFTYDLN GAPTVQDGAL SEFLFESRRG YCEQFASAMT VLVRMLGLPA RVAIGFTHGT RTADGTWVIT NKQAHAWPEV WFPTLGWLPF EPTRRSDGAT PAPDYAPSTT EPTTGPDPAE VPQGNGDVAV EPTPSAVPVP DDQGGAAEEL TAEADDKAAG THKGTFPPSW LPWVGLSLGI LVLLSIPALS RVALRRRRMG SGGPDRQDAE AVARVHAAWA ELVDVAADLG IHLRTSDSPR SGAQRLIAYL EAGPEAGSAE VDAARQALIR MAMAQERARY APAGMAAPDP GVDVLADLAL ARRVLWSVAP RGRRAMATVA PPSMMQRARE IPVRDVLGRI RHRADDPPDN GADDDQEAGV GASAGGRTHP PQPPA
|
| |