Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4916 |
Symbol | |
ID | 5673256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5903755 |
End bp | 5904606 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243771 |
Product | NUDIX hydrolase |
Protein accession | YP_001509187 |
Protein GI | 158316679 |
COG category | [L] Replication, recombination and repair [R] General function prediction only |
COG ID | [COG0494] NTP pyrophosphohydrolases including oxidative damage repair enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.933953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00516547 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCATCA CCAACGAGGC GGTCGGGCGG GTTCTCGCGG CGTACCTGAC CGCCCATCCC GGTGACCGCG AGCGGTTGGG ACCGCTTGTC CGGGCGTCGG CGGCCCCGGC CCGGGGGTCG GCCCGGCTGA CCTCGCGCCG GACGCTGCCC GGCCACGTGA CCTGCTCGAC GGTCGCGGTG AACGACGAAG GCCGCATTCT GCAGATCCAT CACCGGGCGT CGGGCCTCTG GCTCCAGCCG GGAGGCCACA TAGAGGACCG CGACGGCTCG CTGTTCGGTG CGGCGCTGCG GGAGCTGGCC GAGGAGACCG GGATCGGCCG CCGCCGCGTC AGCGCGATCA GCGAGCAGCC GGTGGACGTC GACATCCACT GGGTGTCCGC GAGTGCCGCC CGCGGCGAGC CGGAGCACCT GCACTACGAC TTCCGCTTCC TGGTCGAGAT CCGCGGCTCC GGGCTCGCAC CGGTCGAGAC GGGCCTCGAT CCCGATGATC CGGAGACGAC GCGGCCGCTG TCCGCCCCTC CGCTCGCCGT GCCCGTGGCC ACAGGCGGCC CCGCCGGGCG GACGGCCTCG TCCCCCGGGG TGGGCGCGTC CGGGCCCGTG CCGCCGGGGG TGGCGGGGAC GAAGGTGCCG GGGCCGTCGG CGGCCCCGCC GGGTGCCCAC GGAGCGGCTG GGGCAGCATG CATGCCGATG CCGGGCACGG CGCAGGGAAC GGCGGCCGGA GTGCCGGCGG GGGTGATTCT GCAGGTTGAG GAGGTTGCCG GCGCCCGCTG GGCCGAGGTG TCGGCGCTCG GCGGCCGGCT GGCGCGCCGC GTGGCGGCCG CGCTGGTCGA CCCGGGCGCC AGATGCCGAT GA
|
Protein sequence | MSITNEAVGR VLAAYLTAHP GDRERLGPLV RASAAPARGS ARLTSRRTLP GHVTCSTVAV NDEGRILQIH HRASGLWLQP GGHIEDRDGS LFGAALRELA EETGIGRRRV SAISEQPVDV DIHWVSASAA RGEPEHLHYD FRFLVEIRGS GLAPVETGLD PDDPETTRPL SAPPLAVPVA TGGPAGRTAS SPGVGASGPV PPGVAGTKVP GPSAAPPGAH GAAGAACMPM PGTAQGTAAG VPAGVILQVE EVAGARWAEV SALGGRLARR VAAALVDPGA RCR
|
| |