Gene Franean1_5175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5175 
SymbolhemE 
ID5673509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6210539 
End bp6211621 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content74% 
IMG OID641244029 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001509439 
Protein GI158316931 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0841992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCG GAAGTACCGC CGTTGCGCCG CCTCGCCCCG GGGTCTCGCC CCGACGTCCA 
GGTCTCGCCG CGACTTCGCC GTTCCTGCGC GCCGCCGCCG GGGACCGCCC CGACAGCGTG
CCCGTGTGGT TCATGCGGCA GGCCGGCCGG GTGCTTCCCG AGTACCGCGC CCTGCGCGCC
ACCACCGCCA TGCTCGATTC CTGCCGCAAC GCCGACATGG TCACCGAGAT CACCCTGCAG
CCGGTGCGCC GTTTCCGGCC CGACGCCGCG ATCTTCTTCT CGGACATCGT CCTGCCGCTC
GCGGCCGTCG GGGTGGACGT CGACATCGTC GCCGGGGTCG GGCCCGTGGT GGCCCATCCC
GTCCGGGCGC CGTCCGACCT GGACGTGCTG CGCCCGCTGG AGCCCGGTGA CGTGCCCTAC
GTGAGCGAGG CGGTGGCCTC CCTGGTCCGC GAGCTCGGGC AGACGCCGCT GATCGGTTTC
GCCGGCGCGC CGTTCACCCT GGCCAGCTAT CTGATCGAAG GCGGGCCGAG CCGCAACCAC
ACCCGCACGA AGGCGCTCAT GTACGCCGAG CCGGCGCTGT GGCACGACCT GCTCGGCCGG
CTCGCCGACA TCACGGCCGC CTTCCTGCGG GTGCAGGTCG ACGCCGGCGC CGACGCCATC
CAGCTGTTCG ACTCGTGGGC GGGCGCGCTC AGCGAGGACG ACTACCTGCG CTACGTGGCT
CCGCACAGCA CCCGGGTTCT CGCGGCGTTC GCCGACGACG GCATCCCGCG CATCCACTTC
GGGGTGAACA CCGGGGAGCT GCTCGGCGCG ATGGGCGCGG CCGGCGCGGA CGTCGTCGGG
GTCGACTGGC GCGTCCCGCT GGACGAGGCC GCCCGCCGGG TCGGCCCCGG TCGTGCCGTG
CAGGGCAACC TCGACCCGGC CGCGGTCTTC GCCCCGTCCG ACGTGCTCGC GGCGAAGGTC
CGCGACGTCT GCCGCCGGGG TGCGGCCGCT CCCGGGCACA TCTTCAACTT CGGGCACGGC
GTGCTTCCGG AGAGTGATCC GGGCGTGCTG GCGCACATCG TGGACCTCGT CCACCAGTTC
TGA
 
Protein sequence
MSLGSTAVAP PRPGVSPRRP GLAATSPFLR AAAGDRPDSV PVWFMRQAGR VLPEYRALRA 
TTAMLDSCRN ADMVTEITLQ PVRRFRPDAA IFFSDIVLPL AAVGVDVDIV AGVGPVVAHP
VRAPSDLDVL RPLEPGDVPY VSEAVASLVR ELGQTPLIGF AGAPFTLASY LIEGGPSRNH
TRTKALMYAE PALWHDLLGR LADITAAFLR VQVDAGADAI QLFDSWAGAL SEDDYLRYVA
PHSTRVLAAF ADDGIPRIHF GVNTGELLGA MGAAGADVVG VDWRVPLDEA ARRVGPGRAV
QGNLDPAAVF APSDVLAAKV RDVCRRGAAA PGHIFNFGHG VLPESDPGVL AHIVDLVHQF