Gene Francci3_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0487 
Symbol 
ID3903007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp569467 
End bp571290 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content74% 
IMG OID637877818 
Producturoporphyrinogen-III synthase / uroporphyrinogen-III C-methyltransferase 
Protein accessionYP_479602 
Protein GI86739202 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.188044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCC GACGAACGAA GAAGCCCGTC TCCCCGGTCG CCCTGGTGGG TGCCGGACCG 
CGTGACCCCG GACTGCTCAC CGTCCTCGCG GTCGAGACCC TGGCCGCCGC CGACCTGGTG
GTTGCCGATC CCGAGGTGGC GCCGGCGGTG GTGGAGAACC TCACCGCCGA AGTGCTGCGC
ATCGGCGACC TCGAGGCGCC GAAGCCGGTC CGTGACGCGG AGGCGGCGAC CGCCGCGGTG
GTGAGCAGGG CCCGTGCCGG TGACAAGGTG GTCCGGCTCT ACGCCTCCGA TCCGTGGCTG
ACCCGCATCG GCGCCGCGGA CGCCCAGGCG CTCGCGAAGG CCAAGATCCC CTATCGGGTG
GTGCCCGGCA TCCTCACCGG CGCCGCGGTC GCGACCTACG CCGGGGTCGC CCCGGGCGCG
CCGGTCACCT TCGCCAGTAC CTCCGGGGTC TTCTCGTCGT CGGGCACCGT GCCGGCGTCC
TCGCCGTTCG GCGGGAGCGA GCCGGTGGGT CCGCTCACCG GTCCTCCGTT CGGGGCCTCG
CGGCTGGGCG GACGGCCGAT CGCCGGTCTC GGGGGCGGGC TCGGCGGTGG CCTGGGAACC
GGCGGGTTCG GCTCGCCGCT GGGCCTCGGT GCCGCGGGCG GCTTCGGCGG GTTCGGTGCG
CCGATCTCCT CCCCCGGTGA CCTGGACTGG GGTGCCCTCG CCCAGACCCC GGGCACCCTC
GTGGTCACGG CGGCACCGAC GGAGGTCGGC AAGGTCGCGG CGGCGCTCGT CGAGCACGGG
CGGCCCGGTG ACACCCCGAT CGCGGTGACC GTGGACGGCA CCACCACCGA CCAGCGCACC
GTCACCTCGA CGCTCGACCG GGTCGAGGCG GACACGGCAC CGCTGTTGAC CGCGGCCCCC
ACCCCGATCA CCCAGGTCGT GCTGTCCATC GGCCCGGTGG TCGCGACCCG CGGCAAGCTC
TCCTGGTGGG AGACCCGGGC GCTGTTCGGT TGGACGGTGC TGGTCCCGCG GACCAAGGAA
CAGGCGGCGA TCCTGTCCGA CCTGCTGCGT TCGCACGGCG CAAGCCCGCT GGAGGTGCCG
ACGATCGCCG TCGAGCCCCC GCGGACCGCC GCCCCGATGG AGCGCGCGAT CACCGGTCTG
GTCTCTGGGC GCTACCAGTG GGTGGCCTTC ACCTCGGTCA ACGCGGTCAA GGCGGTGCAG
GAGAAGGTCG AGGAACGCAG CCTCGACGCC CGCGCGTTCG CGGGGGTCAA GGTTGCGGCG
ATCGGCGAGG CGACGGCCGA CGCGCTGCGT ACCTTCGGTA TCCGGCCCGA TCTCGTCCCG
TCCGGGCAGC AGTCCAGCGA GGGGCTGCTG GCGGACTGGC CGGACTTCGA CGACACCCTC
GACCTGCTCG ACCGGGTGCT GCTGCCGCGT GCCGACATCG CCACCGAGAC CCTGGTCGCC
GGCCTGAAGG AACGCGGCTG GCAGGTCGAT GACGTGACGG CCTACCGCAC CGTGCGTGCC
GCTCCGCCGC CCGCATCCAT CCGGGAGGCG CTGAAGGGCG GCCGGGTCGA CGCGGTCGTC
TTCACCTCAT CCTCGACCGT GCGCAACCTG GTCGGCATCG CCGGCAAGCC CCACGAGACG
ACGGTGATCG CCGTGATCGG TCCGGCGACG GCCGCCACTG CCCAGGAGCT CGGGCTGCGC
GTCGACGTGC AGGCTCCGGA GGCGTCCATC CCCGCCCTTG TCGGGGCGCT GGCGGAGTTC
GCTGCCGAAC ACCGGGAGGA GCTCGGGAAG ATCGGGCCGC TCGCTGCGCG GTTGCCGAAG
CCACGGCGAG GCTCGCGGCG GTGA
 
Protein sequence
MATRRTKKPV SPVALVGAGP RDPGLLTVLA VETLAAADLV VADPEVAPAV VENLTAEVLR 
IGDLEAPKPV RDAEAATAAV VSRARAGDKV VRLYASDPWL TRIGAADAQA LAKAKIPYRV
VPGILTGAAV ATYAGVAPGA PVTFASTSGV FSSSGTVPAS SPFGGSEPVG PLTGPPFGAS
RLGGRPIAGL GGGLGGGLGT GGFGSPLGLG AAGGFGGFGA PISSPGDLDW GALAQTPGTL
VVTAAPTEVG KVAAALVEHG RPGDTPIAVT VDGTTTDQRT VTSTLDRVEA DTAPLLTAAP
TPITQVVLSI GPVVATRGKL SWWETRALFG WTVLVPRTKE QAAILSDLLR SHGASPLEVP
TIAVEPPRTA APMERAITGL VSGRYQWVAF TSVNAVKAVQ EKVEERSLDA RAFAGVKVAA
IGEATADALR TFGIRPDLVP SGQQSSEGLL ADWPDFDDTL DLLDRVLLPR ADIATETLVA
GLKERGWQVD DVTAYRTVRA APPPASIREA LKGGRVDAVV FTSSSTVRNL VGIAGKPHET
TVIAVIGPAT AATAQELGLR VDVQAPEASI PALVGALAEF AAEHREELGK IGPLAARLPK
PRRGSRR