Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0331 |
Symbol | |
ID | 5668755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 397246 |
End bp | 398352 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239262 |
Product | NUDIX hydrolase |
Protein accession | YP_001504703 |
Protein GI | 158312195 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.100097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATC ATCAGGCGGC AATCGTCGCC CGGTTCGGCA CGTCCCGCCG GATGGAACCG GAGGAGGAAC TCCGGCTGGC CCGGCTGGCC GCCAACCCGG CACCCGCCCG GGACGCGGCG ACCGTTGTGC TCCTGCGCGA CGCGCCGGCC GGTTCCGGCA TCGAGGCCTA CCTGCTCAGG CGCACCCGGG CGATGTCGTT CGCGGGCGGC ATGCACGTCT TCCCGGGCGG GCGGGTCGAC CCGTCGGACG CCGCAGAGGA TCTCCCGTGG GTCGGGCCCT CCGTCGAGGA GGCGATGCCG GGGCTGGACG ACGATCCCGC CCGCGCGCGG GCCTTGGTCT GCGCGGCCGT CCGCGAGACC TTCGAGGAGT GCGGCGTCCT GCTCGCCGTA CCGACGGGCA CATCCGCAGC CAGCAGAGTC GCGCCGGCCG GCGGAGCCAC GCCGACGGGC AGGAACGTGC CGGCCGGACG TGTCGACGCG GCGGGCGGCG CTGCCGGTAC CGGCGATCCG GGTACCGGCG ATCCGGGCTG GGCCGCCGAG CGGCGGGCGG TGGAGAGCCA CCGCAGTGGC CTCGCCGAGC TGCTGACCCG CCGCGGGCTG GCCCTGCGGG CCGACCTGCT GGCCCCGTGG ACCCGTTGGA TAGCCCCCGA GCTGGAGCCA CGGCGGTACG ACACCAGGTT CTTCGTCGCC GCGCTGCCGG CCGGGCAGCT GCCGGGCGAG CTCGCGACGG AACTCTCGAC CGAGGCTGAC GGGATGCTGT GGATCCGTCC GGCGGAGGCG ATGGAGCGGT TTGTCGCCGG CGAGATCGGC ATGCTCCCGC CCACCGCCTT CACCCTCGCG GAGCTGTCGG CCTACGACGA CGTCGCCGGT GCGCTCGCGG CCGCGCGCAC CCGCGACCTG AAGCCGATCA TGGCAAGGAT CATCGCCGGC GACGGCACCT GGCAGCTGTC GTTCCCACAC CTGTTACCGC TGGACGGTGC CCCCGGCACA CCGCTGGACG GTGCCCCCGG CACACCGCTG GACGGTGCCC TCGGCACCGA GCCGGGCACG CCTTCGAAAA CCGCCCCGGC ACCGGCCGCG GGCACCCGCG GTGGTGTCCC GCGGTGA
|
Protein sequence | MADHQAAIVA RFGTSRRMEP EEELRLARLA ANPAPARDAA TVVLLRDAPA GSGIEAYLLR RTRAMSFAGG MHVFPGGRVD PSDAAEDLPW VGPSVEEAMP GLDDDPARAR ALVCAAVRET FEECGVLLAV PTGTSAASRV APAGGATPTG RNVPAGRVDA AGGAAGTGDP GTGDPGWAAE RRAVESHRSG LAELLTRRGL ALRADLLAPW TRWIAPELEP RRYDTRFFVA ALPAGQLPGE LATELSTEAD GMLWIRPAEA MERFVAGEIG MLPPTAFTLA ELSAYDDVAG ALAAARTRDL KPIMARIIAG DGTWQLSFPH LLPLDGAPGT PLDGAPGTPL DGALGTEPGT PSKTAPAPAA GTRGGVPR
|
| |