Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5002 |
Symbol | |
ID | 5673341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5997720 |
End bp | 6000251 |
Gene Length | 2532 bp |
Protein Length | 843 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243856 |
Product | nucleotidyl transferase |
Protein accession | YP_001509272 |
Protein GI | 158316764 |
COG category | [G] Carbohydrate transport and metabolism [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1109] Phosphomannomutase [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGAGCCG TCGTGATGGC GGGCGGCGAG GGCACCCGGC TGCGGCCGCT GACCGCGAAC CTGCCGAAGC CCCTGCTGCC GGTCGTCAAC CGCCCGATCA TGGAGCACGT CCTGCGGCTG CTCAAGCGGC ACGGTTTCGA CGAGACCGTG GTGACGGTCC AGTTCCTCGC AGCGATGATC CGCAACTACT TCGGCTCCGG TGACGAGCTG GGCATGCACC TGTCCTACGC GACGGAGACC ACGCCGCTGG GCACCGCGGG CAGCGTGAAG AACGCCGAGG ACGCGCTGCG CCACGAGGAG TTCCTGGTCA TCAGCGGTGA CGCCCTGACC GACATCGACC TGACCGACCT CGTCGCCTAC CACCGGGCGC AGGGCGCGCT GGTCACCGTC GCGCTCAAGT CGGTGCCCGA CCCGCTCGAG TTCGGCATCG TGATCACCGG CGAGGACGGG CGGATCTCCC GGTTCCTGGA GAAGCCGACC TGGGGCCAGG TGTTCTCCGA CACCGTCAAC ACCGGCATCT ACGTGATGGA GCCGGAGGTC CTCGACCACG TCCCGGCCGG CGAGGCGGTC GACTGGTCCG GGGATGTCTT CCCCCGGCTG GTCGCCGCCG GGGCGCCGGT GTTCGGCTAC GTCGCCGGCG GCTACTGGGA GGACGTCGGC ACCATCGCCA GCTTCCAGCG CGCGCAGGCG GACGTGCTGA ACCGGCAGGT GGACGTCTCG ATCGGCGGGT TCGAGGTCTC CCCGGGGGTC TGGATCGGCG AGGACGCCGA CGTCCACCCC GACGCGATCC TCAAGGGCCC GCTGGTCGTC GGCGACTACA GCAAGGTCGA GGCCGGCGCC GAGCTGCGCG AGTTCACCGT GCTCGGCAGC AACGTGGTGG TGAAGCGGGG GGCCTTCCTG CACCGGGTGG TCGTCCAGGA CAACGCGCTC ATCGGCCCGC GGACGAACCT GCGCGGCTGC GTGATCGGCA AGAGCACGGA CGTGCTGCGG GCCGCGCGCA TAGAGGAGGG CGCGGTCATC GGCGACGAGT GCGTCATCCA GGAGGAGGCG TTCGTCTCCC ATGACGTCAA GGTCTACCCG TTCAAGACCA TCGAGGCCGG GGCGGTCGTC AACACCAGCG TCATCTGGGA GTCGCGCGGG CAGCGGTCGC TGTTCGGCCC ACGCGGGGTC TCCGGACTGG TCAACGCGGA GATCACCCCG GAGCTGGTGG TCCGCCTGGC CAGCGCGTAC GCGACGACGC TGAAGAAGGG CTCCACCGTC ACCACCGCGC GGGACGGCTC GCGCGCGGCC CGGGCCCTCA AACGGGCCGT GATCAGCGCC CTCACCGCCG GCGCGATCAA TGTCCGTGAC CTGGAGGTGG CCCCGCTGCC GGTCGCGCGC TTCGACGTCC GCACCTCGGA CGCGGCCGGC GGGATCATGC TGCGCTCCAA GCCGGGCGAC GCCGAGCGCA TCGACATCGT CTTCCTCGAC GCCGACGGCG ACGATCTCTC CCCGGCCGCG CAGCGCAAGC TCGACCGCGT GTTCACCCGG CAGGAGTTCC GCCGCGCGTT CCCCGGCGAG ATCGGCGACC TGCGCTTCCC GGCGCGCACG GCCGACGTCT ACACCCAGGA CCTGCTCGAC CGGGTGGACA CCAGCGGGCT CGCCGAGGCG GATCTCAAGG TGGTCGTCGA CCCGTCCGGG GGAGCGGCGT CGCTGCTGCT GCCCACCCTG CTCGGCCGGC TCGGCGTCGA CGTGCTGACC GTCAACGGCC GGCTGGACGA GACCTACCCC GAGCCCGGCG CCGAGCAGGA GCGCAGGGCT CTGGACCGGC TGGGCGCGCT CGTGGCGAGC TCACGCGCCG CGTTCGGGGT CCGCTTCGAC CACATCGGCG AGCGGATCAC CATCGTCGAC GAGCGCGGCG ACCTCATCAG CGACGAGCGG GCCCTGCTGG TCATGCTCGA CCTCGTCGCG GCGGAGAACC GCGGCGCCCA GGCGGCGGTG CCCGTCACCA CCACCCGGGT CGCCGACCAG GTGGGGCGCT TCCACGGCCT GACCGTGCGG CGGATGTCGA TGTCCGGCTC CGAGCTCTCC CGGGTCGTGC AGGCCGAGCC GATCGTCTTC GCCGCCGACG GGCGGGGCGG ATTCGTCGTG CCGGAGTTCG CTCCCGTGAT CGACGGCCTG GCGGCGTTCG TCCGGCTGGT CGCGCTGGTC GCCCGGACCA GGCTGACGCT CAGCGCGATA GACGCGCGGA TCCCTCCGGT CGCGATGGTG CGGGCGTCGG TGCCCACACC GTGGGCGGAG AAGGGCACTG TCATGCGCCG CGTCGTCGAG TCGGTGGACG TCGACTCGGG CGACCAGGTC GACACCACCG ACGGGGTGCG CGTCGTCGGC CCGGACGGAT CCTGGGTGCT GGTGCTGCCC GACCCGTCCG AGGCGGTGAC CCATCTGTGG GCCGAGGCGG CGGATCTGGG CGGCGCGCAG AAGCTGGTGC GCCGGTGGAG CGCCGTCGTG GAGACGGTGC CGCCGGAGCA GGCTCCCGTC ACCCGTTCAT GA
|
Protein sequence | MRAVVMAGGE GTRLRPLTAN LPKPLLPVVN RPIMEHVLRL LKRHGFDETV VTVQFLAAMI RNYFGSGDEL GMHLSYATET TPLGTAGSVK NAEDALRHEE FLVISGDALT DIDLTDLVAY HRAQGALVTV ALKSVPDPLE FGIVITGEDG RISRFLEKPT WGQVFSDTVN TGIYVMEPEV LDHVPAGEAV DWSGDVFPRL VAAGAPVFGY VAGGYWEDVG TIASFQRAQA DVLNRQVDVS IGGFEVSPGV WIGEDADVHP DAILKGPLVV GDYSKVEAGA ELREFTVLGS NVVVKRGAFL HRVVVQDNAL IGPRTNLRGC VIGKSTDVLR AARIEEGAVI GDECVIQEEA FVSHDVKVYP FKTIEAGAVV NTSVIWESRG QRSLFGPRGV SGLVNAEITP ELVVRLASAY ATTLKKGSTV TTARDGSRAA RALKRAVISA LTAGAINVRD LEVAPLPVAR FDVRTSDAAG GIMLRSKPGD AERIDIVFLD ADGDDLSPAA QRKLDRVFTR QEFRRAFPGE IGDLRFPART ADVYTQDLLD RVDTSGLAEA DLKVVVDPSG GAASLLLPTL LGRLGVDVLT VNGRLDETYP EPGAEQERRA LDRLGALVAS SRAAFGVRFD HIGERITIVD ERGDLISDER ALLVMLDLVA AENRGAQAAV PVTTTRVADQ VGRFHGLTVR RMSMSGSELS RVVQAEPIVF AADGRGGFVV PEFAPVIDGL AAFVRLVALV ARTRLTLSAI DARIPPVAMV RASVPTPWAE KGTVMRRVVE SVDVDSGDQV DTTDGVRVVG PDGSWVLVLP DPSEAVTHLW AEAADLGGAQ KLVRRWSAVV ETVPPEQAPV TRS
|
| |