Gene Francci3_0745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0745 
SymbolfbiC 
ID3905813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp861113 
End bp863812 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content72% 
IMG OID637878078 
ProductFO synthase 
Protein accessionYP_479858 
Protein GI86739458 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGTA CGGCGGCGAC ATCCAGCGGT GCCGAATCCA GCGGTGCCGA ATCCAGCGGT 
GCCGAATCCA GCGGTGCCGA ATCCAGTTCC CCACCGTCCC CGTCGGCGCT ACGCCGGGCG
CTGCGCCGGG CCCGGGACGG CGCCACGCTC GACGTGGTAG AGGCCACCAT TCTGCTGGCC
GCGCGCGGCG ATGACCTCGC GGACCTGTGC CGCTCGGCGG CCGGGGTCCG CGACGCGGGT
CTGGCGTCGA TCGGGCGGCC GGGTGTCGTC ACCTACTCGC CGAAGGTGTT CATCCCGTTG
ACCCGGTTGT GCCGGGACCG CTGCCACTAC TGCACCTTCG CCACCGTGCC GCACCGGTTG
CCCGCCCCCT ACCTCACGGT GGACGAGGTG CTCGGCATCG CCCGGGCGGG CGCCGCGGCG
GGCTGCGCCG AGGCGCTGTT CACGCTTGGT GATCGTCCCG AGGATCGGTG GCCGGCGGCC
CGGGAATGGC TCGACGCCCA CGGCTACGAC TCGACGCTGG GTTACCTGCG TGCCGTCGCG
ATCGCCGTGC TGGAGGAGAC CGGTCTGCTT CCGCACCTCA ACCCGGGTGT ACTGAGCTGG
GTGGAACTCG CCCGGCTCAA GCCGGTGGCC CCGTCGATGG GGATGATGCT GGAGACGACC
GCGACCCGCC TGTGGTCGGA GCCGGGCGGG GCGCACTACG GATCACCCGA CAAGGACCCG
GCCGTCCGCC TGCGGGTCCT CACCGACGCC GGGCGGTTGT CGATCCCGTT CACGACCGGC
CTGCTGCTCG GCATCGGGGA GACCCGGACG GAGCGGGCCG AGACGATCTT CGAGCTTCGA
GGTCTGGCCC GCCGCTATGG GTCGATCCAG GAGGTGATCG TCCAGAATTT CCGCGCCAAG
GACGACACCG CGATGCGCTC CGCGCCCGAC GTCGGTGCCG AGGAGTTGGC CGCGGCCGTC
GCGGTCACCC GGCTGGTGCT GGGCCCCACG ATGCGGGTGC AGGCTCCGCC GAACCTCGTT
GACACCGTGG AGTGCGCCCT GCTGCTTTCG GCCGGCATCG ACGACTGGGG CGGCGTCTCC
CCGGTCACCC CCGACCACGT CAACCCCGAG CGGCCCTGGC CGGACCTGGA CGTCCTGGCG
GCCGTGACGG CGCAGGCCGG GTTCACGCTG CGGCCCCGGC TCACCGTCTA CCCCTCCCAT
CTCGGTGAAC CGTGGATCGA CCCGCGCCTG GCCGGACACG TCGGGGCGTT GGCCGGCGTG
GACGGCCTGC TCGCCGACGG CGTGCTCCCG GTCGGCCGCC CCTGGCAGGA GCCCGACGGC
GGACTCGCTG ACACCGGCCG CACCGATCTG CACGCCGCGG TCGACACGGT CGGGCGCCGC
AGCGAGAAAC GGGCGGACTT CGACACGGTC TACGGTGACT GGGAGTCGCT TCGGGCACAG
GTCAGCGCGG CCGGGACGGG GGTTGACGGA GTCGTCGGCA CGGGCAGCCG GGCCATCGCG
AGCCGGGCCG GCGCGGCTGC CGGTGCCGGT GCCATGCGGC GGATCGACCC CGACATGCTC
GCCGCCCTGC GCCACGCCGG GTCCGATCCG GCCGGGATCA CCGACGCCGA GGCGCTGACC
CTGTTCGGCG CGGACGGCGC CGCCCTGGAG GAGCTCTGCG CGCTCGCGGA CGGATTGCGC
CGCGACGTCG TGGGCGAGGA CGTCACCTAC GTCGTCAACC GCAACATCAA CTTCACCAAT
GTCTGCTACA CCGGATGCCG GTTCTGCGCG TTCGCGCAAC GCCGTGGGGA CGCCGACGCC
TACACGCTGT CCTTGGACGA GGTGGGCAGC CGGGCGGCGC AGGCCTGGGC GGTCGGCGCG
ACCGAGGTCT GCGTCCAGGG CGGGATCCAT CCCGATCTAC CGGGCGCCGC GTACTTCGAT
CTTGCCCGTG AGATCAAGCG GCAGGCGCCG GGGTTGCATC TGCACGCCTA CTCGCCGATG
GAGGTTCTCA ACGGCGTCAC CCGCACCGGC CTGTCGATCG GCGACTGGCT CACCGCGGCC
CGTGAGGCCG GCGTCGACAC GATCCCCGGC ACCGCGGCGG AGATCCTCGA CGACGATGTC
CGTTGGGTGC TGACGAAGGG CAAGCTGCCC GCGGCGAGCT GGGTCGAGGT GGTGACCACC
GCGCACCGGG TCGGCCTGCG CTCCTCGGCC ACGATGATGT ACGGGCACGT GGACACCCCC
GCACACTGGG TGGCCCATCT GCGCCTGCTA CGGCGCATCC AGGACTCCAC CGGCGGCTTC
ACCGAGTTCG TGGCCCTCCC CTTCGTCCAT CACAGCTCGC CGATCTACCT GGCCGGTGTC
GCTCGTCCCG GCCCCACCCG CCAGGAGAAC CGGGCTGTGC ACGCGATGGC GCGCATTCTG
CTGCACGGCT CGATCGACAA CATCCAGTGC TCCTGGGTCA AGCTCGGGGT GGAGGGCTGT
CGGGCCGTGC TCACCGGTGG GGCGAACGAC ATCGGCGGCA CGCTGATGGA GGAGACGATC
AGTCGGATGG CCGGCTCGCA GCACGGCTCG CGGCGCTCCG TGGCCGAACT CGAGGAGGTG
GCCGCGGGGG CGGGTCGTCC CGCTCGGCAA CGCACCACCA CGTATGGCCG GATCCCTGAC
GAGCGCTTTC GCGCTGCCCG GGGCCGCACC GTCCGGGCCA TGCTGCCGGT GGTGTCCTGA
 
Protein sequence
MMSTAATSSG AESSGAESSG AESSGAESSS PPSPSALRRA LRRARDGATL DVVEATILLA 
ARGDDLADLC RSAAGVRDAG LASIGRPGVV TYSPKVFIPL TRLCRDRCHY CTFATVPHRL
PAPYLTVDEV LGIARAGAAA GCAEALFTLG DRPEDRWPAA REWLDAHGYD STLGYLRAVA
IAVLEETGLL PHLNPGVLSW VELARLKPVA PSMGMMLETT ATRLWSEPGG AHYGSPDKDP
AVRLRVLTDA GRLSIPFTTG LLLGIGETRT ERAETIFELR GLARRYGSIQ EVIVQNFRAK
DDTAMRSAPD VGAEELAAAV AVTRLVLGPT MRVQAPPNLV DTVECALLLS AGIDDWGGVS
PVTPDHVNPE RPWPDLDVLA AVTAQAGFTL RPRLTVYPSH LGEPWIDPRL AGHVGALAGV
DGLLADGVLP VGRPWQEPDG GLADTGRTDL HAAVDTVGRR SEKRADFDTV YGDWESLRAQ
VSAAGTGVDG VVGTGSRAIA SRAGAAAGAG AMRRIDPDML AALRHAGSDP AGITDAEALT
LFGADGAALE ELCALADGLR RDVVGEDVTY VVNRNINFTN VCYTGCRFCA FAQRRGDADA
YTLSLDEVGS RAAQAWAVGA TEVCVQGGIH PDLPGAAYFD LAREIKRQAP GLHLHAYSPM
EVLNGVTRTG LSIGDWLTAA REAGVDTIPG TAAEILDDDV RWVLTKGKLP AASWVEVVTT
AHRVGLRSSA TMMYGHVDTP AHWVAHLRLL RRIQDSTGGF TEFVALPFVH HSSPIYLAGV
ARPGPTRQEN RAVHAMARIL LHGSIDNIQC SWVKLGVEGC RAVLTGGAND IGGTLMEETI
SRMAGSQHGS RRSVAELEEV AAGAGRPARQ RTTTYGRIPD ERFRAARGRT VRAMLPVVS