Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1858 |
Symbol | |
ID | 5670260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2231264 |
End bp | 2232871 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240779 |
Product | phosphoesterase PA-phosphatase related |
Protein accession | YP_001506202 |
Protein GI | 158313694 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.832371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00202108 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGGCGGA TTCCGCGGCC CGTTCACCGT TCCCGCACCC GTGACCGTGA CCGTTCCCGC GACCTGGTCG GCGAGAGCGA TCTGCCCCGC GTCGGCGACC TGGCCCGCCT GCTCGTCCGG CACGACGTCG CGATCGGCCG GCGTTTCGTG CACACCTTCA TCACGGTCGA CCGCGCGATG TTCTCCGCCA TCGCCGGCGC CCGGCCGCTG GTGGACCCGC TGCTGCCCCG GCTCTCCCAC GCGGCCGACC ACGGCATGCT CTGGTGGGGG GTGGCGGGCG CGCTGGGCGC GACGAAGGGC CGCCGCCGCC CGGCGGCGAT GCGCGGCCTG CTCGCCCTGG GCGTCGCCAG CGCCGTCGCC AACGGCCCGG CCAAGCTCCT GTTCCGCCGG GGCCGGCCGC CGACACACGG CATCCCGCCG TTGCGCCGGC TGCGCCGTGA CCTGACGACC TTCTCGTTCC CGTCGGGGCA CTCGGCCTCG GCGGCCGCCT TCGCCACCGG TGTCGCGCTC GACGCGCCCG CGGCGGCCGT CCCGGTGGTC GCTCTGGCGT CCGCGGTCGC CTTCTCCCGG GTCTACGTCG GTGCCCACTA CCCGGGCGAC GTGGTGGCCG GCGCCGCGCT CGGCATCGGC GCCGGGCTGC TGACGACGAA GGTGATGCCG CGGCGCCCCT GGTCGCCGGC GCGGGCCCGG CCGGCGTCCG CCTGGGCCCC GGCCCTGCCG GACGGGACGG GCCTGGGTGT GGTCGTCAAC GCCCGGTCCG GCGCCGGTCA CCACGCCGAG CTCGCCGCCG TCCTGCGTGC CGACCTGCCC GGTGCGCAGG TCGTCGAGGT CGGCCCCGAC CAGGACGTCG CCGAGGCCCT GGACCGCGTC GCCGCGCACA GCCGGGTCCT CGGCGTCGCC GGCGGCGACG GGACGGTCAA CGCCGCGGCC GCGGCCGCTC TCGCCCGGGG CCTTCCGCTC GCGGTGTTCC CGGCCGGCAC CCTCAACCAC TTCGCCGCCG ACGTCGGCCT GAACAGCGCC GGTGACACGA TCAAGGCGGT CCGCGAAGGC ACGGCCGTCG CGGTGGACGT CGGCAGGGTC GACGGCATCG GCGCGGCCGA CTCCCGGTTC AGCCGGATCT TCGTGAACAC CGCCAGCCTC GGCGGCTACC CGGACATGGT CGGCATCCGG GAGCGTTTCG AGCGCCGGAT CGGCAAGTGG CCGGCGATGA TCATCGCGCT GAGCCGGGTG CTGTGGAGCG ACCCGCCGTT CGACGTCGAG ATCGACGGCG TCACCCGCCG GGTCTGGCTC GTCTTCGTCG GCAACGGCCG CTACCTGCCC GACGGGTTCG CCCCGACATA CCGGACCCGC CTCGACGAGA GCCTGCTCGA CCTGCGGGTC GTCGACGCGA CCGCGCCGCT GGCCCGGCTC CGCCTCGTCG GCGCGGTGCT GACCGGACGT CTCGGCCGGT CACGGGTCTA CGAGCAGCGG ACGGTGGAAC GGGTGCGGAT CTCCTCCAGT CAGCCGTCCC CGCTGCCGTT CGCCAGCGAC GGCGAGGTGA CCGAGGGGAT CCGCCGGATC GCGGTCACGA CGAGCGGCGC CCGTCTCATC GTCTACCGGC CCGAGTAG
|
Protein sequence | MRRIPRPVHR SRTRDRDRSR DLVGESDLPR VGDLARLLVR HDVAIGRRFV HTFITVDRAM FSAIAGARPL VDPLLPRLSH AADHGMLWWG VAGALGATKG RRRPAAMRGL LALGVASAVA NGPAKLLFRR GRPPTHGIPP LRRLRRDLTT FSFPSGHSAS AAAFATGVAL DAPAAAVPVV ALASAVAFSR VYVGAHYPGD VVAGAALGIG AGLLTTKVMP RRPWSPARAR PASAWAPALP DGTGLGVVVN ARSGAGHHAE LAAVLRADLP GAQVVEVGPD QDVAEALDRV AAHSRVLGVA GGDGTVNAAA AAALARGLPL AVFPAGTLNH FAADVGLNSA GDTIKAVREG TAVAVDVGRV DGIGAADSRF SRIFVNTASL GGYPDMVGIR ERFERRIGKW PAMIIALSRV LWSDPPFDVE IDGVTRRVWL VFVGNGRYLP DGFAPTYRTR LDESLLDLRV VDATAPLARL RLVGAVLTGR LGRSRVYEQR TVERVRISSS QPSPLPFASD GEVTEGIRRI AVTTSGARLI VYRPE
|
| |