Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0805 |
Symbol | |
ID | 5669221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 942152 |
End bp | 943462 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641239733 |
Product | nucleoside triphosphate pyrophosphohydrolase |
Protein accession | YP_001505169 |
Protein GI | 158312661 |
COG category | [R] General function prediction only |
COG ID | [COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain |
TIGRFAM ID | [TIGR00444] MazG family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0143961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCC GCATCACCCT GGTCTCGACC AGCGCCCGCG TGGCCCCCGG CCTGCTCACC GCCGCGGCCT GGGACGTGCT CCGCTCGGCC CGGGTCTGGA CGGCGAGCCC GGAGCATCCC CAGGCCGCGG CGCTGCGTGA GGCCGGCGTC GGCGTCTCGG TGCTGCGCCC CGCTCCGCCG TCCGAACCCG GCGTGGGGCT GTCGGCCGAG GCCGGTGTGG TGCCGTCGGG CCGGGCGGAG GCTGATCCGG TCGCCGAGCT GCGGGCGGTG GCCGGGCCGG GCACCCATGT CGCGTGGCTG CTCGACCCGG TCTCGCCGAC AGCCGCCGAC CGCGCGCTGC GCGCCGCCCT CACCGACCAG GACCACCCGG CGGAGCCGGG GGCCGTCGTC GAACTGCTCG TCGCCACCCG CGAGCTGCCC GGGTCGGCGC TGCTCGACGC GGTCGCGGTC ATGGACCGGC TGCGCTCGCC CGGCGGCTGC CCCTGGGACG CCGAGCAGAA CCACGTCTCG CTGGCCCCCT ACCTGCTCGA GGAGGCCTAC GAGGCCTACC AGGCCATCGA GGACGGGGAT CTCGCGGAGC TGCGCGAGGA GCTGGGCGAC GTCCTGATGC AGGTGCTCTT CCACGCGCGG ATCGCCGCCG AGTCCGGCGG GGCGGGCTGG GACGTCGACG ACGTCGCGGC CGGGCTGACC GCCAAGCTGA TCCGCCGCCA CCCGCACGTG TTCGGTGACG TCGCCGTCTC CGGCGCGGAC GACGTCGTCA CCAACTGGGA TGCGATCAAG GCCCAGGAGA AGGGCCGGAA GTCGGTGACG GAGGGTGTGC CGCTCTCCGC GCCGGCGCTC TTCCTGGCCG CCAAGCTGCT ACGGCGGGCC GCGAAGCTCG GGCTCCCGCC GGAGCTGGCC CTTCCCCGCC CCTCGGCCGA CAGCGGCGTC GGGGATGCCG GGGCCGGTCT ACCCGGCCTC GTCGCTGCGC TGGCCCGGGA GGTCGGGACC GCTCGCCCGG GAGATCGGGC GTCCGCTGAT CAGAGTGACG GTCCTGGTAC CGAGGCGGGC ACCACCGCCG AGGAGCGGAT CGGGGACCTG CTGTTCGCCG CGGTCGTGCT GGCCGGGGAG GAGGGGGTCG ACCCGGAGAC CGCGCTGCGC GCACGGGCCC GGCTGTTCCG GGACACGCTG GCCCGGGCCG AGCACGCCGC CCTCGCCCGC GGCGAGGAGC CCCGCGGGCT GGCTGCCGAC ATCTGGCGAT CACTGTGGGT GTCCGCGAGC GTTCCCCCGG GTGGGCCGGC AGCCACGGAC GGGCCTGTGC ACGGGGCCTG A
|
Protein sequence | MTTRITLVST SARVAPGLLT AAAWDVLRSA RVWTASPEHP QAAALREAGV GVSVLRPAPP SEPGVGLSAE AGVVPSGRAE ADPVAELRAV AGPGTHVAWL LDPVSPTAAD RALRAALTDQ DHPAEPGAVV ELLVATRELP GSALLDAVAV MDRLRSPGGC PWDAEQNHVS LAPYLLEEAY EAYQAIEDGD LAELREELGD VLMQVLFHAR IAAESGGAGW DVDDVAAGLT AKLIRRHPHV FGDVAVSGAD DVVTNWDAIK AQEKGRKSVT EGVPLSAPAL FLAAKLLRRA AKLGLPPELA LPRPSADSGV GDAGAGLPGL VAALAREVGT ARPGDRASAD QSDGPGTEAG TTAEERIGDL LFAAVVLAGE EGVDPETALR ARARLFRDTL ARAEHAALAR GEEPRGLAAD IWRSLWVSAS VPPGGPAATD GPVHGA
|
| |