Gene Francci3_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1976 
Symbol 
ID3903684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2316691 
End bp2320812 
Gene Length4122 bp 
Protein Length1373 aa 
Translation table11 
GC content66% 
IMG OID637879312 
Productamino acid adenylation 
Protein accessionYP_481079 
Protein GI86740679 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTCGG GCGGAGACGT GGAAAGGATT CCGCTGTCGG CGGTGCAGCA CGCAATGCTG 
TTGCACAGTG GCAGCGCTCC CGGCGAGGGA TTCTACGTTC TGCAGAAGTT GTTCACGTTG
CGGGAGCAGC TGAATCTGGC GGTGTTCGAG CAGGCACTGC GGGACCTGTT CGACTGGCAT
CCGATGCTGC GGACCAGGCT TCTGCTGGAC CATCCGGACG GTCCGAGCCA GGTGGTCAGC
AGTGATGTCG ATGTGTCCTT GCCCGCGCGG GACTGGAGGC AGTTGGACAG CGCAACTCGG
CAACAGAGCC TGCGCGAATA CCTCGACGAT GACAAGCGCC GGTCCTTCGA TTTCGCGGCC
GGGCCCCCGA TGCGGTTCGC GGTCTTCCGC ATGGATAATC ATGAATACGA GTTCGTCTGG
ACGACCCATC ATGCACTGAT GGACGGGCGC TCCCTTTTCA TGGTGAGCGC GGAACTCTTC
GAACGGTACG ATCACCTGCT CGTCGGGCGC CGGGCCGAGC GTCCCCGTCC GCGCCCGTAC
CGGGACTACG TCGCCTGGCT GGCCGAGCAG GACCTTGTCG CAGCGGAGTC GTTCTGGCGT
GCCGATCTGG CTGGTGTTAC GGAGCCCACG CCATTCGTCG TCGACGTGGT CCGCGTCCCC
GACCCCGGTG AGAACGTCCT GACGGCCGTG GAGACCGAGC TTTCGGAAGA GATGACCAGG
GCACTGGAAC TCCTGGTCCG TGAGCAACAG GTAACCGCTA ATAACGTCGT GCAGGCGGCG
TGGACGGTCC TCCTGCAACA GCTCAGTGGC CGCCATGACG TAGTATTCGG TACAACCCGG
GCATGTCGCG GATTCCTGGA CGGCGCCCAG GACATGCTTG GTCTCTTTCT CAATACGCTG
CCATTCCGCG TCGTCGCCCC GAGCACGATG ACCGTGATCG AGCTGCTGAA GCGGCTGCGA
CGGCATCAGC TGGCGCTGCG AGAGTTTGAG CACACCTCGC CGACCCGTAT CCACACCTGG
AGTGATATTC CGGCCTCGCT CCCGATGTTC GAGAGTCTGC TCATCTTTGA GAACTACCTG
CCTGCTACCC GGCTGCGGTC GCTGGGGGGT TCCTGGCGGC ACCGGGAATT CCGTCTGGAG
GAGCAGACGA ATTACCCGCT CACTCTCTAT GCGAACCACG ACCACCGCCT CATCCTCAGG
ATCGGTTATG ACCGCCGCCG TTTCACCGAC AAGACGGTAG AACGGATGGG ATCCCAGGTT
CGGCGGTTGT TGGAGGCGAT GGTTGCCGCC CCGGAGGTCC CCCTCGGTGC GCTTCCGAAA
CTTGCCCCGG AGGAGCTGGC CGGCCGGCGG ACGAAGGGAC TGGCCGGGCC GGTCGGACCG
GTTGAGCCGG TTGAGCCGGG GGAGCCGGCT GGAGTTGTCG AGCACGAGAC CGACTGGGTA
CGGCGCCTGA TGTCCGTCGA GCCGCTCGAC CTCCCGCGCC GTAATCCGGT CGCGGATCCG
CCGCACCGGT ACCGGGATAT CCGATTCCCC GTCCCCGCTG ATCTCGCCGG CGTGGAGGAG
AAGATGGCCG CACGGGATGG TGACGGCCGG GGGCGCGATA GGTCCGAATG GTTGCTCACG
GTGCTGCTGG CCTTCCTGGT TCGATTCAGC GGTGCGGACG GTCCGGACGT CGACCTGTCG
TGGGAAGCCG AATCCGACGC CTCGGGCGGG GATGGCATAC CGCTTGCCCG CCATCGGCCG
TTCCGGGTGC CCTCGCTCAA CGAGAGCCGG GGTTTCCTGG AGTTCCGCCG TGAGGTGGCC
AGGCAGCTCC GCCTGTTCAG CGGGCGTAGC GGCCACCCCC GCGACCTAGG GGTTCGTCAC
GCGGAGCTAC GTGCGTGCGC GCCATGGGCC GAGCGCGGCA TGCCGGTCGC CGTCGAGCTG
GTTGACAATC TCGATGTCGC TGCCGAGCCG CGGTCCGCGA CCGTGCTGTT CGCGCAGATC
CCGCGAGACG GTGAGGGATG CCGCTGGCTC ATCGCCGAGA ACGTCCTGCA AACTGGTGCG
GCCGACACGA TGCGCTCCAT GTTCGTCGCG TTCCTCGAGG CCCTCACGGG CTCCGCGACG
GCCGATCTGA CCCAGCTCCC GCTATTGTCG CCCGAGGACA CGCAACGTGT GCTTGTCACC
TGGAACGACA CGGACGCCGA CTACCCGCGC GACCTGTGCG CGCATCAGTT GTTCATACGG
CAGGCACGTA GCCGTCCCGA GGCACCCGCC GTCGTCTGCT CTGATCGCAG TCTCTCGTAC
GGCGAACTTG ATCGACGGTC GACGCGCCTC GCCGCGTTCC TCGGTCGCCA CGGCATCGGT
CCCGGCAGCC TTGTCGGCAT CTACCTCGAG CGGTCGGAAG AAATTGTTGT CGCTGTACTC
GGCGTGATGA AATCCGGAGC GGCCTTTGTG CCGTTGGATC CGGTGTATCC ACCCGACCGC
ATCACTCAGA TGCTCACCAG TTCGGGGTCG ACACTGCTGC TCACCCGGAC GAGTTTGGAG
CCCGATGTCC GGGACTGCCC GGCGACCGTC GTCACGCTCG ATCAGTACTG GGATGTGATC
GCGACAGCCG GGGGCGGTCC GGGCGAGGAA CATGACGAGG AATCTGACGA GGAAAAATAT
GACCGAGGGT CACCGGAGGG TCGTGCTTAT GTAATCTACA CGTCGGGGTC GACCGGCCGG
CCCAAGGGGG TGGATGTCGG CCACCGGGCC CTGACGAACC TTCTGTGCTC GATGGCGCGG
ACTCCAGGAT TTACCGAGTA CGACCGCCTG CTGGCGGTGA CCACGGTCTG CTTTGACATC
GCCTACCTCG AGCTTCTCCT GCCGCTGGTG ACAGGCGGGC AGGTCGAGGT GGTACCCGCC
GATGTCGCCA GTGACGGGTT CGAACTGCGG AGACGGATCG AGCGGAGCCG CCCGACGGTC
ATGCAGGCGA CCCCGGCGAC CTGGCGGATG CTGATCGCCG CGGCCTGGGA GGGTGACCGT
GGCCTGACGG CCCTGTGCGG AGGCGAACAG CTGCCTCGCG ACCTCGCTGA CGGCCTTCTC
GCGCGGGTTG CGAAGGTGTG GAACCTCTAC GGCCCGACCG AGACGACGAT CTGGTCCTCG
GTCGACCGCG TGGAGCCGGG TCGGCAGGTT ACGATCGGGC GACCGATCGC CAATACCCGG
TTTTACGTCC TCGACAGATG GCTTCAGCCG GTGCCGCCCG GGGTACCCGG CGAGCTGTAC
ATCGGTGGGG ACGGGGTGGC CGCGGGCTAT CTCGGCGAGC CGGAGCTCAC CCGGGAAAGG
TTCGTCCGAG ACCCGTTCGG ACCAGACGAG TCCGCCGTCA TGTACCGGAC CGGGGACATC
GTGCGCCACT TGCCCGACGG CCGGATTGAC TACCTCCACA GGGTCGACAA CCAGGTCAAG
CTGCACGGAT ATCGCATCGA ACCCGGTGAG ATCGAAGAAG CTCTGCGCCG GCATGACGGC
ATCGCTGAGG CGGTGGTCTG CCCACGCGAC ATCGCCCCCG GTAACCGCCA GCTGGTGGCC
TACCTCGTCT CGGCCGAATC CGGCCTCGGC GCGCGGCCGG AGGAACTCCG GCGGTACCTA
CGTACCCGAC TACCTCCCTA CATGATCCCG GCTGCGTTCG TGACCGTGAC ACGTCTGCCG
CTGACAGCCA ACGGCAAGAT CGATCGTCGG TCCCTACCGG GGCCGCCTGA GGTTCTCCCG
CCCGCAGGCG GAGAGCGGGT CCTGCCCCGT AGCGAGCTCG AGGCGGCAGT CGCCAGCATC
TGGCGGGGTG TCCTGGGGGT GCCGGAGGTC GGAATCGACG AGAACTTCTT CGACGCCGGC
GGTAACTCGT TGCTGCTCAT GCAGGTTCTG ATCCGGCTTC AGGCGGAGCT GTCCGAGTCG
CTTACGCGGG TCGACATGTT CGCCTATCCG ACTGTCCGGG GACTGGCGGC ATACTTGTCC
ACCGCTGCGC CACGCCGGTC CGGTGACGCC GGATCCGGCG CGCCACACCC TCCCGGACGA
GCTGGGGCAC CCGAAGAATC CGGGGGGCGT TCCCGGTCAG CACTCGCGGG CCTCAGGCGC
AGACGTGGTG GTCTCAGCTC CCGCCAGCCA CCCGGCGCAT GA
 
Protein sequence
MRSGGDVERI PLSAVQHAML LHSGSAPGEG FYVLQKLFTL REQLNLAVFE QALRDLFDWH 
PMLRTRLLLD HPDGPSQVVS SDVDVSLPAR DWRQLDSATR QQSLREYLDD DKRRSFDFAA
GPPMRFAVFR MDNHEYEFVW TTHHALMDGR SLFMVSAELF ERYDHLLVGR RAERPRPRPY
RDYVAWLAEQ DLVAAESFWR ADLAGVTEPT PFVVDVVRVP DPGENVLTAV ETELSEEMTR
ALELLVREQQ VTANNVVQAA WTVLLQQLSG RHDVVFGTTR ACRGFLDGAQ DMLGLFLNTL
PFRVVAPSTM TVIELLKRLR RHQLALREFE HTSPTRIHTW SDIPASLPMF ESLLIFENYL
PATRLRSLGG SWRHREFRLE EQTNYPLTLY ANHDHRLILR IGYDRRRFTD KTVERMGSQV
RRLLEAMVAA PEVPLGALPK LAPEELAGRR TKGLAGPVGP VEPVEPGEPA GVVEHETDWV
RRLMSVEPLD LPRRNPVADP PHRYRDIRFP VPADLAGVEE KMAARDGDGR GRDRSEWLLT
VLLAFLVRFS GADGPDVDLS WEAESDASGG DGIPLARHRP FRVPSLNESR GFLEFRREVA
RQLRLFSGRS GHPRDLGVRH AELRACAPWA ERGMPVAVEL VDNLDVAAEP RSATVLFAQI
PRDGEGCRWL IAENVLQTGA ADTMRSMFVA FLEALTGSAT ADLTQLPLLS PEDTQRVLVT
WNDTDADYPR DLCAHQLFIR QARSRPEAPA VVCSDRSLSY GELDRRSTRL AAFLGRHGIG
PGSLVGIYLE RSEEIVVAVL GVMKSGAAFV PLDPVYPPDR ITQMLTSSGS TLLLTRTSLE
PDVRDCPATV VTLDQYWDVI ATAGGGPGEE HDEESDEEKY DRGSPEGRAY VIYTSGSTGR
PKGVDVGHRA LTNLLCSMAR TPGFTEYDRL LAVTTVCFDI AYLELLLPLV TGGQVEVVPA
DVASDGFELR RRIERSRPTV MQATPATWRM LIAAAWEGDR GLTALCGGEQ LPRDLADGLL
ARVAKVWNLY GPTETTIWSS VDRVEPGRQV TIGRPIANTR FYVLDRWLQP VPPGVPGELY
IGGDGVAAGY LGEPELTRER FVRDPFGPDE SAVMYRTGDI VRHLPDGRID YLHRVDNQVK
LHGYRIEPGE IEEALRRHDG IAEAVVCPRD IAPGNRQLVA YLVSAESGLG ARPEELRRYL
RTRLPPYMIP AAFVTVTRLP LTANGKIDRR SLPGPPEVLP PAGGERVLPR SELEAAVASI
WRGVLGVPEV GIDENFFDAG GNSLLLMQVL IRLQAELSES LTRVDMFAYP TVRGLAAYLS
TAAPRRSGDA GSGAPHPPGR AGAPEESGGR SRSALAGLRR RRGGLSSRQP PGA