Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6129 |
Symbol | |
ID | 5674450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7457150 |
End bp | 7458964 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244981 |
Product | uroporphyrinogen III synthase HEM4 |
Protein accession | YP_001510379 |
Protein GI | 158317871 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase [COG1587] Uroporphyrinogen-III synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.143048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.442264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCC GACGTACCAA GAAGCCCGTG TCCCCGGTCG CGCTTGTGGG CGCCGGTCCC CGTGATCCGG GGCTGCTCAC GGTCCTCGCC GTGGAGACTC TCACCGCTGC CGACGTGGTG GTCGCCGACC CGGACGTACC GTCCGAGGTG GTCGAGTCGC TGTCGGCCGA GGTGCTGCGG ATCGGTGACC TCGACGCCCC GAAGCCGGTG CGGGACGCGG AGGCCGCGAC CGCCGCGGTC GTCAGCCGGG CACGGGCCGG GGACAAGGTC GTCCGGCTCT ACGCCTCGGA CCCGTGGCTG ACCCGGATCG GCGCGGCGGA CGCGCAGTCG CTGGCCAAGG CCAAGATCCC CTACCGGGTG GTTCCCGGTA TCTCGACCTC CGCCGCGGTC GCGACCTACG CCGGTGTCGC GCCGGGCAGC CCGGTCACCT TCGCCAGCAC GTCGGGTGTC TTCTCGTCCT CGGGCTCGGT CGCCTCGGCC TCACCGTTCG GCGTCGGTGA CCCGGTCGGC CCGCTCACCC CGCCCCCCTT CGGGACGTCC CGCCTCGGCG GCCGGTCGAT GCCGAGCCTC GGCGGCGGCC CGCTGGGCGT GGGCGGCGGC TTCGGCGCGC CCCTCGGCCT GGGCACCCCT ACCGGCTTCG GTGGTCTCGG CGTGCCCACC ACGCCGGGCG ACGTCGACTG GGGCGCGCTC GCGCAGGCCC CCGGCACGCT GATCGTCACC GCCGGCCCCA CCGAGATCGG CAAGGTGGCG ACCGCGCTGG TCGAGCACGG CCGCGCCGGT GACACCCCCG TCGCGGTCAC CGTCGACGGC ACGACCACCG ACCAGCGCAC CGTCACCTCG ACCCTCGACC GGATCGAGGC CGACGTCGCG CCGATGCTCA ACGCCACCGC GAACCCGCCG AACGAGGTCA TCATCTCCGT CGGCCCGGTC GTCGCGACCC GGGCGAAGCT GTCCTGGTGG GAGACCCGCG CCCTGTTCGG CTGGACGGTG CTGGTGCCCC GGACGAAGGA ACAGGCGGCG ATCCTCTCCG ACTCGCTGCG CGCCCACGGG GCGAGTCCGC TGGAGGTGCC GACGATCGCC GTCGAGCCGC CGCGGACGGC CGCGCCGATG GAGCGCGCCA TCACCGGGCT GGTCTCCGGC CGCTACCAGT GGGTCGCCTT CACCTCGGTG AACGCCGTCA AGGCGGTGCA GGAGAAGGTC GAGGAGCGCA GCCTGGACGC CCGCGCCTTC GCCGGTGTCA AGGTCGCCGC GATCGGCGAG GCCACCGCGG ACGCGCTGCG CGCCTTCGGT ATCCGCCCCG ACCTGGTGCC CGCCGGCCAG CAGTCCAGCG AGGGCCTGCT CGAGGACTGG CCCGAGTTCG ACGAGTCGCT TGACCTGCTC GACCGGGTTC TCCTGCCGCG CGCCGACATC GCCACCGACA CTCTCGTCGC CGGCGTCAAG GACCGCGGCT GGCAGGTGGA CGACGTCACC GCCTACCGGA CGGTGCGCGC CGCGCCGCCG CCCGCGCCGA TCCGCGAGGC CCTCAAGGGC GGCCGGGTCG ACGCGGTGGT CTTCACCTCC TCCTCCACGG TGCGCAACCT GGTCGGAATC GCCGGCAAGC CGCACGAGAC CACCGTCATC GCGGTGATCG GCCCGGCGAC GGCCGCGACC GCCCAGGAGC TCGGCCTGCG GGTGGACGTC CAGGCGACCG AGGCGTCGAT CCCGTCGCTC GTCGCGTCGC TGGCGGAGTT CGCCGCCGAG CACCGCGAGG AGCTCGGCAA GGTCGGCCCG CTCGCCGCCA GGCTGCCCAA GCCGCGCCGG GGTTCCCGGC GATGA
|
Protein sequence | MATRRTKKPV SPVALVGAGP RDPGLLTVLA VETLTAADVV VADPDVPSEV VESLSAEVLR IGDLDAPKPV RDAEAATAAV VSRARAGDKV VRLYASDPWL TRIGAADAQS LAKAKIPYRV VPGISTSAAV ATYAGVAPGS PVTFASTSGV FSSSGSVASA SPFGVGDPVG PLTPPPFGTS RLGGRSMPSL GGGPLGVGGG FGAPLGLGTP TGFGGLGVPT TPGDVDWGAL AQAPGTLIVT AGPTEIGKVA TALVEHGRAG DTPVAVTVDG TTTDQRTVTS TLDRIEADVA PMLNATANPP NEVIISVGPV VATRAKLSWW ETRALFGWTV LVPRTKEQAA ILSDSLRAHG ASPLEVPTIA VEPPRTAAPM ERAITGLVSG RYQWVAFTSV NAVKAVQEKV EERSLDARAF AGVKVAAIGE ATADALRAFG IRPDLVPAGQ QSSEGLLEDW PEFDESLDLL DRVLLPRADI ATDTLVAGVK DRGWQVDDVT AYRTVRAAPP PAPIREALKG GRVDAVVFTS SSTVRNLVGI AGKPHETTVI AVIGPATAAT AQELGLRVDV QATEASIPSL VASLAEFAAE HREELGKVGP LAARLPKPRR GSRR
|
| |