Gene Francci3_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0991 
Symbol 
ID3905847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1174730 
End bp1177960 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content63% 
IMG OID637878324 
Productacyl transferase region 
Protein accessionYP_480103 
Protein GI86739703 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAAGC AGAGGAAGAG CGACGATCGA CTGCCCCGGG TCTCCCCGGA TCTTGCGGAC 
CCGATCCGGC CCGTCCTGGT CGAACAGCTG ACCAACAGTG ACATCACGGT CACTACAACC
GACGACTCCG CCAACGACGC TCCGGTCAAG AAGGCGGATC AGACCATCCG CGCGGATCCG
AGTAAGGGTT CGGGTGAACG GACGCGGGAT GCTGGGGGAA ACACCTCCAG CGCGGTTGCG
GTGATTGGGA TCGGGTGCCG CCTTCCTGGC GGCGTTGGTT CCGCGGCGTC GCTGTGGGAT
TTCCTCCGCG ACGGTCGTGA TGCGGTCACC GATGTGCCAT CAGAGCGATG GAGTGAGGCG
TCGCGGGAAG CATCGCAGGC AGAGACCGAT CGGAACGTGC TGTGGCGTGG CGGGTTTTTG
TCGGAGGACG TCGGCGCCTT CGATCCAGAG GCCTTCGGTA TCGACCCGAC CGAGGCAGAG
TTGATCGATC CTCAACATCG GCTGTTGCTG GAGGTCGTGC AGGAGGGGTT CGAGCACGCT
GGATTGCCAA CCGATGACCT GGTAGGGAGC AGCACTGCTG TGTTCGTCGG CATGTCCAAC
TTGGACCATA TGCTCCACGC ACATCAGCTC CCGAGCGGCG GGGGCCCCTA CTTTGTGCCC
GGTAACCAGG CCGGACCGGC ATCCGGTCGG ATATCGCACG TCTTCGGTTT GCGCGGTCCC
AGCATGACCG TGGACACGTC GTGTTCGACA GGGTTGGCCA CCGTGTACCT GGCATGTAAC
AGCCTTCGGC AGGACGAGTG TGACCTCGCG ATCACTGGCG CGGTCAACCT GCTACTCAGC
CCCCGAACGT TCCTGGCCTA TAACGAGTTG GGAGTGCTAT CTCCCACCGG GCGATGCTTC
AGCTTTGACG AACGAGCGGA CGGCTACGTT CGGGCCGAGG GCTGCGTGGT CCTCGTACTT
AAACGTCTCG ACGACGCGAT GCGTGACCAG GACCGCGTGC TCGCCGTGCT ACGGGGGGTG
GCGGTCAATC ATGACGGGAA GACGTCCCCG TTCACCGTTC CTTCCGAGCA AGCTCAGGAA
GAGGTGTTCC GTACCGCCCT GAGTATTGCC GATGTCGACC CGGAAGAAAT TGGAATGATC
GAAGCGCACG GCACCGGAAC CATCGTTGGC GACCCGATCG AGTTCCGTTC GCTGGCCGCC
GTCTACGGAC GGGGACGAGG CCGGTGTGCA TTGGGTTCGG CTAAGACGAA CTTCGGCCAC
GCCGAACCCG CCGCGGGCAT GGTGGGGCTA CTCAAAGCCA TATTGGCGGT GTACCACGGT
GAGGTCCCGG CGTCCCTGCA TTTCCGACGG TGGAACCCCG CTATCAACCC GTCCGGCACC
AGGTTGTTCG TTCCGACCGT AACGACACCA TGGCCAGTCA CCGGTGGTCC GCGCCTCGCG
GCCGTGTCCT CCTACGGTGT GGGCGGGACC AACGCGCATG CCATCGTGGA GGAACCGCCG
GATTCCAGCT CCCCGGTCGC GCCGAGGTCG TCCAGCAGCA GCGACGATTC CGTTCTCACG
TTCCTGCTGT CGGAAGGATC AGAAACTGCG CTGCAGCATT CCGCGATCAG GCTCGCCGAC
TGGCTCATCC GCTCCGGCGC GACGACACCC TTGAAAGATA TCGCGCACAC GCTCGCGGTA
CGCCGCTCGC ACGGCCCCGA GAGACTGGCG GTCGTCGCTC GCTCCCGGGA CGATCTCGTC
AACCGCCTAC GCGCATATGC CGACGGGACA CACCCCCGTC CTGAAGGCAT GGTCAACGAT
TATGTCCACG CCGAGCACGC AACGGGACCG GTGTGGGTGT TCAGCGGCCA CGGCTCGCAG
TGGCCCGGTA TGGGGCGAGA CCTGTTCGCC ACCGAACCCG TGTTTGCCGA CACGATCGCG
GCGCTCGACC CCCTGATTCG CGCCGAGTCG GGTTTCTCGC CGGAGGAGGT GCTGCGGGCC
GGTGACGAGG TGACCCGAAT CGACCAGGTA CAGCCCTTGA TTTTCGTGGT ACAGGTCGCT
CTGGCGCGCA CCCTGCAATC CCACGGCATC CACCCCGCCG CCGTAGTAGG ACACTCCATG
GGTGAAATCG CCGCCGCCGT GATCGCGGAG GCGTTGACCG TGGAAGACGG CATCCGCGTC
ATCTGCCGTC GGTCTCGGCT GTGCGTTCCC ATCGCCGAAG CCCGCGTTGC GGCGATGGCG
GTCGTCGAAC TGGATGCGGC CACGGTCCAG GCAGAGATCG ACCATCTTCC CGACGTGGCT
GTCGCGTTCT TCGCCGCACC CCGGTCCACC GTGATCGGCG GTACTCGAGT CGAAGTCGAA
CGCCTGGTCG AAAGCTGGAC ATCCCGCGAC GTTCCCGCGC ATATGATCAA TGTCGATGTC
GCCTCGCACT GTCCCCTGAT CCATCCAGTC GCCGACGCGT TGACCGCTGA GCTCAGCGAT
ATACGGCCTA GGCAGCCGAC GATCCGTTTC TATACCACGG TTCTGCCTGA TCCGCGGCAG
ACACCCACAT TCGATGCTGC CTACTGGGGT GAGAACATGC GCTGCCCGGT TCGGGCGGTG
GATGCCACGA CCGCCATCGT CAACGACGGA CACCAGCTTT TCCAGGAGAT CTCTCCACAT
CCGGTGGCAA TCCACCCGCT TATTCTGACC CTCCAGGCGG CCGGGGCGCC GGAGGCCACG
GTTGTACCGA CTACCGATAA CAGGCACGAT CAGGCCACAG CCCTACGAAC GAGCATCGCA
GCGCTTCATT GCGCCGGGCT CGACATGAAT TGGCGGCGGT GGCACGGCGA TGGAGCCATC
GCGGACGTGC CGCCTACGAC GTGGGACAGA CGCACCTATC TGATCAAGCT ACTCAGGAAC
CTGGCGCCCC CGACCGACTC GGCCCGGTCA GCGCATACAG AGCCGACCGA GCAACGTCCG
GACACTGGGA CGGACGCCGA CATCAGCGCC GAGATTTCAC AGGCTACCAG AGATGAACGC
CTCAAGATCA TCAGAAAGAT GATCATTGAA ATCTTGCGTG AGATCCTCAG CTTGCGAGCG
CGTCGACTGA GCCCTAGTGC CGCCTTTTCG GAACTCGGTC TGAACTCCCT ACGCGCCGTG
GAATTCCGCG GACGAATCCA GCAAATATTC AAGGTCTCCA TCTCGCTCGC CGCAATCCGG
GAGCATCCCA CAATCGCAGA ATTCAGTGAG TATATCGCCG AACTGCTGTA A
 
Protein sequence
MSKQRKSDDR LPRVSPDLAD PIRPVLVEQL TNSDITVTTT DDSANDAPVK KADQTIRADP 
SKGSGERTRD AGGNTSSAVA VIGIGCRLPG GVGSAASLWD FLRDGRDAVT DVPSERWSEA
SREASQAETD RNVLWRGGFL SEDVGAFDPE AFGIDPTEAE LIDPQHRLLL EVVQEGFEHA
GLPTDDLVGS STAVFVGMSN LDHMLHAHQL PSGGGPYFVP GNQAGPASGR ISHVFGLRGP
SMTVDTSCST GLATVYLACN SLRQDECDLA ITGAVNLLLS PRTFLAYNEL GVLSPTGRCF
SFDERADGYV RAEGCVVLVL KRLDDAMRDQ DRVLAVLRGV AVNHDGKTSP FTVPSEQAQE
EVFRTALSIA DVDPEEIGMI EAHGTGTIVG DPIEFRSLAA VYGRGRGRCA LGSAKTNFGH
AEPAAGMVGL LKAILAVYHG EVPASLHFRR WNPAINPSGT RLFVPTVTTP WPVTGGPRLA
AVSSYGVGGT NAHAIVEEPP DSSSPVAPRS SSSSDDSVLT FLLSEGSETA LQHSAIRLAD
WLIRSGATTP LKDIAHTLAV RRSHGPERLA VVARSRDDLV NRLRAYADGT HPRPEGMVND
YVHAEHATGP VWVFSGHGSQ WPGMGRDLFA TEPVFADTIA ALDPLIRAES GFSPEEVLRA
GDEVTRIDQV QPLIFVVQVA LARTLQSHGI HPAAVVGHSM GEIAAAVIAE ALTVEDGIRV
ICRRSRLCVP IAEARVAAMA VVELDAATVQ AEIDHLPDVA VAFFAAPRST VIGGTRVEVE
RLVESWTSRD VPAHMINVDV ASHCPLIHPV ADALTAELSD IRPRQPTIRF YTTVLPDPRQ
TPTFDAAYWG ENMRCPVRAV DATTAIVNDG HQLFQEISPH PVAIHPLILT LQAAGAPEAT
VVPTTDNRHD QATALRTSIA ALHCAGLDMN WRRWHGDGAI ADVPPTTWDR RTYLIKLLRN
LAPPTDSARS AHTEPTEQRP DTGTDADISA EISQATRDER LKIIRKMIIE ILREILSLRA
RRLSPSAAFS ELGLNSLRAV EFRGRIQQIF KVSISLAAIR EHPTIAEFSE YIAELL