Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1117 |
Symbol | |
ID | 5669530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1336144 |
End bp | 1337130 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240049 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001505477 |
Protein GI | 158312969 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.113705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGACTC CTGGGGCACC CGGTGGCCGG CTCACGCCCG CCGGGCCGGC GTCTGCCGCG CCCGACATCG CGGGCGCGCA GCGGGTCACC TACCGGGTGA GTCAGCGGTT CCGGTACACC TACGACGGCA GCGCGACGAA CCTCGACCAC CGGCTTGTCG CGGTGCCCCC GCCGCGGCAC GGCGGCCAGT TCCGCCGGTC CGTCGACCTG CGGGTCTCCG CCCCGGAGGC GCGCACGACC TGGCAGCGCG GGCCGGACGG CCTTCAGATC GCCAACATCC GGATCGACGT CGTGCCGCCG ACGCTGGACT TCGATGTCAC CGTCGTCGTC GAGCGGATCG CCCGGGCCGG GTGGCCGACG CTGCCGGCGT CCGCGCTGAG CAGCCGGCGC CTGCTGACCG CCACCGCTCT CACCTCCCCC ACCCCGGCGA TGATCGACGC GGCGCGCTCG ATGGCCGGGC CCGACCCGGT CGCCACGGCC CGCCGGGTGT GCGGATGGGT CCACGAGCGC ATCGCCTACG TCTCGGGCAG CACCGACGTC GGGACGACCG CCGCCCAGGC GCTCGGCGGC GGGCGCGGCG TCTGCCAGGA CCAGGCCCAC GTGATGATCG CGATGTGCCG CGCGGCCGGC ATCCCCGCCC GGTACGCGCA GGGGCACATG CTGGGTGAGG GCGCCTCGCA TGCCTGGGTG GAGGTGCTGG TGCCCGCGGC CCTCGCTCCG CCCGTCGACG GGGCCTCGCC CGTCGGCGGG GCTGGCGCGC CCGCCGGGGC GGGGGCGGTC GATCCGGGGC CGGTCGAGGC GTTCGCCCCC GGCGGTGTCG GCGCGGCCGC GGTGGCCTTC GACCCCTGCC ACGACCGGCT CGCCGACCTG CGGTACGTCA CGGTGGCGGT CGGTCGCGAC TACCAGGACG TCGCACCGAC CTCGGGTCGC TACGTCGGCG CGGGCCGCGG CGTCCTGGTC GCCACCGCGC GGGTCGACGT CGTCTAA
|
Protein sequence | METPGAPGGR LTPAGPASAA PDIAGAQRVT YRVSQRFRYT YDGSATNLDH RLVAVPPPRH GGQFRRSVDL RVSAPEARTT WQRGPDGLQI ANIRIDVVPP TLDFDVTVVV ERIARAGWPT LPASALSSRR LLTATALTSP TPAMIDAARS MAGPDPVATA RRVCGWVHER IAYVSGSTDV GTTAAQALGG GRGVCQDQAH VMIAMCRAAG IPARYAQGHM LGEGASHAWV EVLVPAALAP PVDGASPVGG AGAPAGAGAV DPGPVEAFAP GGVGAAAVAF DPCHDRLADL RYVTVAVGRD YQDVAPTSGR YVGAGRGVLV ATARVDVV
|
| |