Gene Franean1_6129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6129 
Symbol 
ID5674450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7457150 
End bp7458964 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content75% 
IMG OID641244981 
Producturoporphyrinogen III synthase HEM4 
Protein accessionYP_001510379 
Protein GI158317871 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.442264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCC GACGTACCAA GAAGCCCGTG TCCCCGGTCG CGCTTGTGGG CGCCGGTCCC 
CGTGATCCGG GGCTGCTCAC GGTCCTCGCC GTGGAGACTC TCACCGCTGC CGACGTGGTG
GTCGCCGACC CGGACGTACC GTCCGAGGTG GTCGAGTCGC TGTCGGCCGA GGTGCTGCGG
ATCGGTGACC TCGACGCCCC GAAGCCGGTG CGGGACGCGG AGGCCGCGAC CGCCGCGGTC
GTCAGCCGGG CACGGGCCGG GGACAAGGTC GTCCGGCTCT ACGCCTCGGA CCCGTGGCTG
ACCCGGATCG GCGCGGCGGA CGCGCAGTCG CTGGCCAAGG CCAAGATCCC CTACCGGGTG
GTTCCCGGTA TCTCGACCTC CGCCGCGGTC GCGACCTACG CCGGTGTCGC GCCGGGCAGC
CCGGTCACCT TCGCCAGCAC GTCGGGTGTC TTCTCGTCCT CGGGCTCGGT CGCCTCGGCC
TCACCGTTCG GCGTCGGTGA CCCGGTCGGC CCGCTCACCC CGCCCCCCTT CGGGACGTCC
CGCCTCGGCG GCCGGTCGAT GCCGAGCCTC GGCGGCGGCC CGCTGGGCGT GGGCGGCGGC
TTCGGCGCGC CCCTCGGCCT GGGCACCCCT ACCGGCTTCG GTGGTCTCGG CGTGCCCACC
ACGCCGGGCG ACGTCGACTG GGGCGCGCTC GCGCAGGCCC CCGGCACGCT GATCGTCACC
GCCGGCCCCA CCGAGATCGG CAAGGTGGCG ACCGCGCTGG TCGAGCACGG CCGCGCCGGT
GACACCCCCG TCGCGGTCAC CGTCGACGGC ACGACCACCG ACCAGCGCAC CGTCACCTCG
ACCCTCGACC GGATCGAGGC CGACGTCGCG CCGATGCTCA ACGCCACCGC GAACCCGCCG
AACGAGGTCA TCATCTCCGT CGGCCCGGTC GTCGCGACCC GGGCGAAGCT GTCCTGGTGG
GAGACCCGCG CCCTGTTCGG CTGGACGGTG CTGGTGCCCC GGACGAAGGA ACAGGCGGCG
ATCCTCTCCG ACTCGCTGCG CGCCCACGGG GCGAGTCCGC TGGAGGTGCC GACGATCGCC
GTCGAGCCGC CGCGGACGGC CGCGCCGATG GAGCGCGCCA TCACCGGGCT GGTCTCCGGC
CGCTACCAGT GGGTCGCCTT CACCTCGGTG AACGCCGTCA AGGCGGTGCA GGAGAAGGTC
GAGGAGCGCA GCCTGGACGC CCGCGCCTTC GCCGGTGTCA AGGTCGCCGC GATCGGCGAG
GCCACCGCGG ACGCGCTGCG CGCCTTCGGT ATCCGCCCCG ACCTGGTGCC CGCCGGCCAG
CAGTCCAGCG AGGGCCTGCT CGAGGACTGG CCCGAGTTCG ACGAGTCGCT TGACCTGCTC
GACCGGGTTC TCCTGCCGCG CGCCGACATC GCCACCGACA CTCTCGTCGC CGGCGTCAAG
GACCGCGGCT GGCAGGTGGA CGACGTCACC GCCTACCGGA CGGTGCGCGC CGCGCCGCCG
CCCGCGCCGA TCCGCGAGGC CCTCAAGGGC GGCCGGGTCG ACGCGGTGGT CTTCACCTCC
TCCTCCACGG TGCGCAACCT GGTCGGAATC GCCGGCAAGC CGCACGAGAC CACCGTCATC
GCGGTGATCG GCCCGGCGAC GGCCGCGACC GCCCAGGAGC TCGGCCTGCG GGTGGACGTC
CAGGCGACCG AGGCGTCGAT CCCGTCGCTC GTCGCGTCGC TGGCGGAGTT CGCCGCCGAG
CACCGCGAGG AGCTCGGCAA GGTCGGCCCG CTCGCCGCCA GGCTGCCCAA GCCGCGCCGG
GGTTCCCGGC GATGA
 
Protein sequence
MATRRTKKPV SPVALVGAGP RDPGLLTVLA VETLTAADVV VADPDVPSEV VESLSAEVLR 
IGDLDAPKPV RDAEAATAAV VSRARAGDKV VRLYASDPWL TRIGAADAQS LAKAKIPYRV
VPGISTSAAV ATYAGVAPGS PVTFASTSGV FSSSGSVASA SPFGVGDPVG PLTPPPFGTS
RLGGRSMPSL GGGPLGVGGG FGAPLGLGTP TGFGGLGVPT TPGDVDWGAL AQAPGTLIVT
AGPTEIGKVA TALVEHGRAG DTPVAVTVDG TTTDQRTVTS TLDRIEADVA PMLNATANPP
NEVIISVGPV VATRAKLSWW ETRALFGWTV LVPRTKEQAA ILSDSLRAHG ASPLEVPTIA
VEPPRTAAPM ERAITGLVSG RYQWVAFTSV NAVKAVQEKV EERSLDARAF AGVKVAAIGE
ATADALRAFG IRPDLVPAGQ QSSEGLLEDW PEFDESLDLL DRVLLPRADI ATDTLVAGVK
DRGWQVDDVT AYRTVRAAPP PAPIREALKG GRVDAVVFTS SSTVRNLVGI AGKPHETTVI
AVIGPATAAT AQELGLRVDV QATEASIPSL VASLAEFAAE HREELGKVGP LAARLPKPRR
GSRR