Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0937 |
Symbol | |
ID | 5669351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1095245 |
End bp | 1097077 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239864 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001505299 |
Protein GI | 158312791 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0114509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.737923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCAGCA GCGAGCGCGT GTATCCGGTC CGCAGCGGTG TCGCTCCCGA CCGCGTTGCC ACCGTCGCCG AGGCGCCGGC CGGCGCTCCC GTGCAGGCCG TCCCCGCCGG CAGGCCTCCC TCGGGCACCG TGACGACCAC CGCGGCCGGG GATGCCATCG AGGCCGGGGT GGCCGGGGTG GCCGGGTCGG TCAGGGCGGT CGCGGAGCCC GGGCTGGCAC AGGCGGTGGC CGTGCTCACC CGGCCCGACC TCGCCCACGT CGTCGACCTG GTCGCGTGGG TCGAGGACGG CTGGTTGCAC GTCGCGAACG CCGACGGTGC GTCGCGCCTG CCGGTCGACG ATCCGGACGG GCCGTGGGAG ATCCTGCGTG GCCGGGACCC GGTCGCCGAC CAGGACCCCA TGCACGGTGT TCCGCTGGTG GCCGCGCTCG CCGACCCGTC CCCGCCGGCC GCGCGCAACG CCTACCCCTT CGCGGGACGG CGCCTGCTGT CCATGTTCGC CGACCCGACG CGCTCGCCGG ACATCGCCGT CGTCCACACC CCGCGGCACT ACTGGCCCGA GCGGGGCGGC CATCTGGGGG AGCACGGCTC GCTGGACGCC GGGCAGTCCC GGGCGCCGCT CGTTCTGTCC GGGGCAGGTG TGACCGCCCG TGGCCTGCTG CCGCGGGTCG CCCGGGTGAT CGACGTCGGT CCGACCCTGG CGGCGCTCGC CGGGGCGGCG ATGCCCGAGG CGGAGGGAAC CGCGCTCGAC GATCTCGCCG GGCCGGGAGC CCGGCATGTG GTCGGGCTGC TGTGGGACGG GACGAACTGC AACGACCTGC TCGACCTCGC CGCCCGGGGA GAGCTCCCGA ACGTCGCCCG CCTGCTGGCC CGCGGCGTCG CGCTGACCGG CGGCGCCCTG GCCGAGTTCC CGAGCGTGAC CCTCACCAAC CACACCTCGG CGATCACCGG CGTCGGCCCC GGCCGGCACG GCATTCTGCA CAACGTCTAC TTCGACCGCG CGACCGACCG CCAGGTGATC ACCAACGAGG CCGCGACCTG GCACGTCGCC TGCGACCAGC TCCGGGACGG CGTGTCGACG GTGTTCGAGG CGGTGGCCCG CTCCCGCCCC GGGACCGAGA CGGCCTGCGT GAACGAGCCG ATCGACCGCG GGGCGAGCTA CTCGACGTTC GGCCTGGTGC GAGCCCTCGG GGTCGGGCCG GGCGCCGGCG GTGCCGGTGC CGAGGCCGCC GGTGGCGGGA TGGAGGACTA CCTGCCGGCC GCCGAGGGCG ACCCGCACGC GAGCGCCGAA TGGGTCACCG CCGACCCGAA CTACGCCTGG TCGACAAGGG TGGACGCGCT GGGGCTGACC CAGATCACCG ACCTGTGGGC CGACGGCCGG GAACCACCTG TGCTGACCTG GTGGAACACC ACCATCACCG ACACCGGTCA TCACGGCGGC GGGCCCTACT CACCCGAGGC CCGCGCCGCG CTGGCCGACG CCGACCGCCG GCTGGGGGTG TTCCTCGACC TGGTGGAGCG CCGTGGCCTC ACCGACCAGA CGGCCATCCT GCTCACCGCC GACCACGGAT TCGAGGCGGC CGACCCCGAC TGCCGCGGTG ACTGGGACGT CGCCCTGCAC CGCGCCGGGG TGGTCTTCCG GGACGAGGGC TACGGCTTCA TCTACCTGGG CCTCGCCGGC GACGACGAAG CCCCGGCCGA CTCCGCTGAT CCGGGCAGCC CGGCCAACCT CGGAGACCGG GCCACGCCAG GTGGCCCGGC GGGCCAGGGT GGGCCGGGCG GCTCGGGGGC CCAGGCAACG GCCTCCCTGC CGCCGCAGCC GCCAGGCACC TGA
|
Protein sequence | MPSSERVYPV RSGVAPDRVA TVAEAPAGAP VQAVPAGRPP SGTVTTTAAG DAIEAGVAGV AGSVRAVAEP GLAQAVAVLT RPDLAHVVDL VAWVEDGWLH VANADGASRL PVDDPDGPWE ILRGRDPVAD QDPMHGVPLV AALADPSPPA ARNAYPFAGR RLLSMFADPT RSPDIAVVHT PRHYWPERGG HLGEHGSLDA GQSRAPLVLS GAGVTARGLL PRVARVIDVG PTLAALAGAA MPEAEGTALD DLAGPGARHV VGLLWDGTNC NDLLDLAARG ELPNVARLLA RGVALTGGAL AEFPSVTLTN HTSAITGVGP GRHGILHNVY FDRATDRQVI TNEAATWHVA CDQLRDGVST VFEAVARSRP GTETACVNEP IDRGASYSTF GLVRALGVGP GAGGAGAEAA GGGMEDYLPA AEGDPHASAE WVTADPNYAW STRVDALGLT QITDLWADGR EPPVLTWWNT TITDTGHHGG GPYSPEARAA LADADRRLGV FLDLVERRGL TDQTAILLTA DHGFEAADPD CRGDWDVALH RAGVVFRDEG YGFIYLGLAG DDEAPADSAD PGSPANLGDR ATPGGPAGQG GPGGSGAQAT ASLPPQPPGT
|
| |