Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1976 |
Symbol | |
ID | 3903684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2316691 |
End bp | 2320812 |
Gene Length | 4122 bp |
Protein Length | 1373 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637879312 |
Product | amino acid adenylation |
Protein accession | YP_481079 |
Protein GI | 86740679 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGTCGG GCGGAGACGT GGAAAGGATT CCGCTGTCGG CGGTGCAGCA CGCAATGCTG TTGCACAGTG GCAGCGCTCC CGGCGAGGGA TTCTACGTTC TGCAGAAGTT GTTCACGTTG CGGGAGCAGC TGAATCTGGC GGTGTTCGAG CAGGCACTGC GGGACCTGTT CGACTGGCAT CCGATGCTGC GGACCAGGCT TCTGCTGGAC CATCCGGACG GTCCGAGCCA GGTGGTCAGC AGTGATGTCG ATGTGTCCTT GCCCGCGCGG GACTGGAGGC AGTTGGACAG CGCAACTCGG CAACAGAGCC TGCGCGAATA CCTCGACGAT GACAAGCGCC GGTCCTTCGA TTTCGCGGCC GGGCCCCCGA TGCGGTTCGC GGTCTTCCGC ATGGATAATC ATGAATACGA GTTCGTCTGG ACGACCCATC ATGCACTGAT GGACGGGCGC TCCCTTTTCA TGGTGAGCGC GGAACTCTTC GAACGGTACG ATCACCTGCT CGTCGGGCGC CGGGCCGAGC GTCCCCGTCC GCGCCCGTAC CGGGACTACG TCGCCTGGCT GGCCGAGCAG GACCTTGTCG CAGCGGAGTC GTTCTGGCGT GCCGATCTGG CTGGTGTTAC GGAGCCCACG CCATTCGTCG TCGACGTGGT CCGCGTCCCC GACCCCGGTG AGAACGTCCT GACGGCCGTG GAGACCGAGC TTTCGGAAGA GATGACCAGG GCACTGGAAC TCCTGGTCCG TGAGCAACAG GTAACCGCTA ATAACGTCGT GCAGGCGGCG TGGACGGTCC TCCTGCAACA GCTCAGTGGC CGCCATGACG TAGTATTCGG TACAACCCGG GCATGTCGCG GATTCCTGGA CGGCGCCCAG GACATGCTTG GTCTCTTTCT CAATACGCTG CCATTCCGCG TCGTCGCCCC GAGCACGATG ACCGTGATCG AGCTGCTGAA GCGGCTGCGA CGGCATCAGC TGGCGCTGCG AGAGTTTGAG CACACCTCGC CGACCCGTAT CCACACCTGG AGTGATATTC CGGCCTCGCT CCCGATGTTC GAGAGTCTGC TCATCTTTGA GAACTACCTG CCTGCTACCC GGCTGCGGTC GCTGGGGGGT TCCTGGCGGC ACCGGGAATT CCGTCTGGAG GAGCAGACGA ATTACCCGCT CACTCTCTAT GCGAACCACG ACCACCGCCT CATCCTCAGG ATCGGTTATG ACCGCCGCCG TTTCACCGAC AAGACGGTAG AACGGATGGG ATCCCAGGTT CGGCGGTTGT TGGAGGCGAT GGTTGCCGCC CCGGAGGTCC CCCTCGGTGC GCTTCCGAAA CTTGCCCCGG AGGAGCTGGC CGGCCGGCGG ACGAAGGGAC TGGCCGGGCC GGTCGGACCG GTTGAGCCGG TTGAGCCGGG GGAGCCGGCT GGAGTTGTCG AGCACGAGAC CGACTGGGTA CGGCGCCTGA TGTCCGTCGA GCCGCTCGAC CTCCCGCGCC GTAATCCGGT CGCGGATCCG CCGCACCGGT ACCGGGATAT CCGATTCCCC GTCCCCGCTG ATCTCGCCGG CGTGGAGGAG AAGATGGCCG CACGGGATGG TGACGGCCGG GGGCGCGATA GGTCCGAATG GTTGCTCACG GTGCTGCTGG CCTTCCTGGT TCGATTCAGC GGTGCGGACG GTCCGGACGT CGACCTGTCG TGGGAAGCCG AATCCGACGC CTCGGGCGGG GATGGCATAC CGCTTGCCCG CCATCGGCCG TTCCGGGTGC CCTCGCTCAA CGAGAGCCGG GGTTTCCTGG AGTTCCGCCG TGAGGTGGCC AGGCAGCTCC GCCTGTTCAG CGGGCGTAGC GGCCACCCCC GCGACCTAGG GGTTCGTCAC GCGGAGCTAC GTGCGTGCGC GCCATGGGCC GAGCGCGGCA TGCCGGTCGC CGTCGAGCTG GTTGACAATC TCGATGTCGC TGCCGAGCCG CGGTCCGCGA CCGTGCTGTT CGCGCAGATC CCGCGAGACG GTGAGGGATG CCGCTGGCTC ATCGCCGAGA ACGTCCTGCA AACTGGTGCG GCCGACACGA TGCGCTCCAT GTTCGTCGCG TTCCTCGAGG CCCTCACGGG CTCCGCGACG GCCGATCTGA CCCAGCTCCC GCTATTGTCG CCCGAGGACA CGCAACGTGT GCTTGTCACC TGGAACGACA CGGACGCCGA CTACCCGCGC GACCTGTGCG CGCATCAGTT GTTCATACGG CAGGCACGTA GCCGTCCCGA GGCACCCGCC GTCGTCTGCT CTGATCGCAG TCTCTCGTAC GGCGAACTTG ATCGACGGTC GACGCGCCTC GCCGCGTTCC TCGGTCGCCA CGGCATCGGT CCCGGCAGCC TTGTCGGCAT CTACCTCGAG CGGTCGGAAG AAATTGTTGT CGCTGTACTC GGCGTGATGA AATCCGGAGC GGCCTTTGTG CCGTTGGATC CGGTGTATCC ACCCGACCGC ATCACTCAGA TGCTCACCAG TTCGGGGTCG ACACTGCTGC TCACCCGGAC GAGTTTGGAG CCCGATGTCC GGGACTGCCC GGCGACCGTC GTCACGCTCG ATCAGTACTG GGATGTGATC GCGACAGCCG GGGGCGGTCC GGGCGAGGAA CATGACGAGG AATCTGACGA GGAAAAATAT GACCGAGGGT CACCGGAGGG TCGTGCTTAT GTAATCTACA CGTCGGGGTC GACCGGCCGG CCCAAGGGGG TGGATGTCGG CCACCGGGCC CTGACGAACC TTCTGTGCTC GATGGCGCGG ACTCCAGGAT TTACCGAGTA CGACCGCCTG CTGGCGGTGA CCACGGTCTG CTTTGACATC GCCTACCTCG AGCTTCTCCT GCCGCTGGTG ACAGGCGGGC AGGTCGAGGT GGTACCCGCC GATGTCGCCA GTGACGGGTT CGAACTGCGG AGACGGATCG AGCGGAGCCG CCCGACGGTC ATGCAGGCGA CCCCGGCGAC CTGGCGGATG CTGATCGCCG CGGCCTGGGA GGGTGACCGT GGCCTGACGG CCCTGTGCGG AGGCGAACAG CTGCCTCGCG ACCTCGCTGA CGGCCTTCTC GCGCGGGTTG CGAAGGTGTG GAACCTCTAC GGCCCGACCG AGACGACGAT CTGGTCCTCG GTCGACCGCG TGGAGCCGGG TCGGCAGGTT ACGATCGGGC GACCGATCGC CAATACCCGG TTTTACGTCC TCGACAGATG GCTTCAGCCG GTGCCGCCCG GGGTACCCGG CGAGCTGTAC ATCGGTGGGG ACGGGGTGGC CGCGGGCTAT CTCGGCGAGC CGGAGCTCAC CCGGGAAAGG TTCGTCCGAG ACCCGTTCGG ACCAGACGAG TCCGCCGTCA TGTACCGGAC CGGGGACATC GTGCGCCACT TGCCCGACGG CCGGATTGAC TACCTCCACA GGGTCGACAA CCAGGTCAAG CTGCACGGAT ATCGCATCGA ACCCGGTGAG ATCGAAGAAG CTCTGCGCCG GCATGACGGC ATCGCTGAGG CGGTGGTCTG CCCACGCGAC ATCGCCCCCG GTAACCGCCA GCTGGTGGCC TACCTCGTCT CGGCCGAATC CGGCCTCGGC GCGCGGCCGG AGGAACTCCG GCGGTACCTA CGTACCCGAC TACCTCCCTA CATGATCCCG GCTGCGTTCG TGACCGTGAC ACGTCTGCCG CTGACAGCCA ACGGCAAGAT CGATCGTCGG TCCCTACCGG GGCCGCCTGA GGTTCTCCCG CCCGCAGGCG GAGAGCGGGT CCTGCCCCGT AGCGAGCTCG AGGCGGCAGT CGCCAGCATC TGGCGGGGTG TCCTGGGGGT GCCGGAGGTC GGAATCGACG AGAACTTCTT CGACGCCGGC GGTAACTCGT TGCTGCTCAT GCAGGTTCTG ATCCGGCTTC AGGCGGAGCT GTCCGAGTCG CTTACGCGGG TCGACATGTT CGCCTATCCG ACTGTCCGGG GACTGGCGGC ATACTTGTCC ACCGCTGCGC CACGCCGGTC CGGTGACGCC GGATCCGGCG CGCCACACCC TCCCGGACGA GCTGGGGCAC CCGAAGAATC CGGGGGGCGT TCCCGGTCAG CACTCGCGGG CCTCAGGCGC AGACGTGGTG GTCTCAGCTC CCGCCAGCCA CCCGGCGCAT GA
|
Protein sequence | MRSGGDVERI PLSAVQHAML LHSGSAPGEG FYVLQKLFTL REQLNLAVFE QALRDLFDWH PMLRTRLLLD HPDGPSQVVS SDVDVSLPAR DWRQLDSATR QQSLREYLDD DKRRSFDFAA GPPMRFAVFR MDNHEYEFVW TTHHALMDGR SLFMVSAELF ERYDHLLVGR RAERPRPRPY RDYVAWLAEQ DLVAAESFWR ADLAGVTEPT PFVVDVVRVP DPGENVLTAV ETELSEEMTR ALELLVREQQ VTANNVVQAA WTVLLQQLSG RHDVVFGTTR ACRGFLDGAQ DMLGLFLNTL PFRVVAPSTM TVIELLKRLR RHQLALREFE HTSPTRIHTW SDIPASLPMF ESLLIFENYL PATRLRSLGG SWRHREFRLE EQTNYPLTLY ANHDHRLILR IGYDRRRFTD KTVERMGSQV RRLLEAMVAA PEVPLGALPK LAPEELAGRR TKGLAGPVGP VEPVEPGEPA GVVEHETDWV RRLMSVEPLD LPRRNPVADP PHRYRDIRFP VPADLAGVEE KMAARDGDGR GRDRSEWLLT VLLAFLVRFS GADGPDVDLS WEAESDASGG DGIPLARHRP FRVPSLNESR GFLEFRREVA RQLRLFSGRS GHPRDLGVRH AELRACAPWA ERGMPVAVEL VDNLDVAAEP RSATVLFAQI PRDGEGCRWL IAENVLQTGA ADTMRSMFVA FLEALTGSAT ADLTQLPLLS PEDTQRVLVT WNDTDADYPR DLCAHQLFIR QARSRPEAPA VVCSDRSLSY GELDRRSTRL AAFLGRHGIG PGSLVGIYLE RSEEIVVAVL GVMKSGAAFV PLDPVYPPDR ITQMLTSSGS TLLLTRTSLE PDVRDCPATV VTLDQYWDVI ATAGGGPGEE HDEESDEEKY DRGSPEGRAY VIYTSGSTGR PKGVDVGHRA LTNLLCSMAR TPGFTEYDRL LAVTTVCFDI AYLELLLPLV TGGQVEVVPA DVASDGFELR RRIERSRPTV MQATPATWRM LIAAAWEGDR GLTALCGGEQ LPRDLADGLL ARVAKVWNLY GPTETTIWSS VDRVEPGRQV TIGRPIANTR FYVLDRWLQP VPPGVPGELY IGGDGVAAGY LGEPELTRER FVRDPFGPDE SAVMYRTGDI VRHLPDGRID YLHRVDNQVK LHGYRIEPGE IEEALRRHDG IAEAVVCPRD IAPGNRQLVA YLVSAESGLG ARPEELRRYL RTRLPPYMIP AAFVTVTRLP LTANGKIDRR SLPGPPEVLP PAGGERVLPR SELEAAVASI WRGVLGVPEV GIDENFFDAG GNSLLLMQVL IRLQAELSES LTRVDMFAYP TVRGLAAYLS TAAPRRSGDA GSGAPHPPGR AGAPEESGGR SRSALAGLRR RRGGLSSRQP PGA
|
| |