Gene Francci3_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1334 
SymbolhemE 
ID3906547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1601250 
End bp1602359 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content73% 
IMG OID637878667 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_480440 
Protein GI86740040 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0406905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTCC TCCACGTTGA CGCGCGACCC GGATCCGGGC CGGGTGGCGT CTCGCCCCCG 
CCGAGCGGTG CAGCGCTCGC CCGGCGGCCC GGCCTGGCCG ACACCGCGCC CTTCCTGCGT
GCCTGTCGAC GGGAGCACCC CGGGACCACG CCGGTGTGGT TCATGCGTCA GGCCGGGCGG
GTCCTGCCCG AGTACCGGGC CCTGCGCGCG GGAGTCGCCA TGCTCGACTC CTGCCGGGAC
GCCGAGATGA TCACGGAGAT CACGCTCCAG CCGGTACGCC GGTTCCGGCC GGACGCGGCG
ATCTTCTTCT CCGACATCGT GGTGCCCCTG GTCGCGATCG GCCTCGACAT CGACATCGTC
GCCGGGATCG GACCGGTGGT GGCCGAGCCC GTGCGGGACG CCGTCGGGCT CGCCGCGTTG
CGTGCACTGG AGCCGGACGA CGTTCCCTAC GTGGCCGACG CGGTCCGGTT CCTGCTGGCC
GAGCTGGGTT CAACCCCGCT GATCGGGTTC GCCGGGGCGC CGTTCACCCT CGCGAGCTAC
CTCATCGAGG GCGGACCGAG TCGCGACCAC GCCCGCACCA AGGCGTTGAT GTACAGCGAA
CCGAAGCTCT GGCACGCCCT GCTGGCCCGG CTCGCCGACA TCACCACCGC CTTCCTGCGC
GTCCAGGTGG ATGCCGGTGT TGACGCGCTG CAGCTGTTCG ACTCCTGGGC CGGGGCGCTG
GACGAGGCGG ACTACCGTCG CTACGTCGCG CCGCACAGCG CTCGGGTGCT GGCGGCCTTC
GCCGGTGAGG TGCCGCGCAT CCACTTCGGT GTGAACACCG GTGAGCTGCT CGCCGCGATG
GGCCAGGCGG GTGCGGACGT CGTCGGCGTC GACTGGCGGG TCCCTCTCGA CGAGGCCGCC
CGGCGGATCG GGCCCGGTCA TGCCGTGCAG GGAAACCTCG ACCCGACCGC GGTCTTCGCC
CCCGAACCGG TGCTCGCCGC CAAGGTGCGC GACGTCTGCG CCCGCGGGGC CGAGGCAGAG
GGGCACGTGT TCAACCTCGG CCACGGGGTG CTGCCGCAGA CCGATCCGGG CGTGCTCGCG
CACGTCGCCG ACCTTGTCCA CGGCGGATGA
 
Protein sequence
MPVLHVDARP GSGPGGVSPP PSGAALARRP GLADTAPFLR ACRREHPGTT PVWFMRQAGR 
VLPEYRALRA GVAMLDSCRD AEMITEITLQ PVRRFRPDAA IFFSDIVVPL VAIGLDIDIV
AGIGPVVAEP VRDAVGLAAL RALEPDDVPY VADAVRFLLA ELGSTPLIGF AGAPFTLASY
LIEGGPSRDH ARTKALMYSE PKLWHALLAR LADITTAFLR VQVDAGVDAL QLFDSWAGAL
DEADYRRYVA PHSARVLAAF AGEVPRIHFG VNTGELLAAM GQAGADVVGV DWRVPLDEAA
RRIGPGHAVQ GNLDPTAVFA PEPVLAAKVR DVCARGAEAE GHVFNLGHGV LPQTDPGVLA
HVADLVHGG