Gene Franean1_7085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7085 
Symbol 
ID5675395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8648994 
End bp8652791 
Gene Length3798 bp 
Protein Length1265 aa 
Translation table11 
GC content70% 
IMG OID641245930 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001511321 
Protein GI158318813 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCCC CGGGCGCCCT TGATGATCCG ACTGGGACGG TCGTCTGCCC GTATCCGGGG 
CTTGCCGGAT TCGATGCTGC GAGTGAGCGG TGGTTCTTCG GTCGGGAGCG GATGGTTACT
GACCTGGTCG TGCGGGTGGG GGTGCGGCTG AGTGAGGGTG GTCCGCTGGT CTTGGTGGGG
GCGTCCGGGT CGGGTAAGTC GTCGTTGTTG CGTGCGGGCC TGTTGCCGGT TCTGGCACGT
GGGGTGGTCG CAGGGTCGGA CTCTTGGCCG CAGGTGTTGA TGACGCCGGG TGAACATCCC
TTGCGGAGTC TGATCGATCA GATCGCGGCT GCCGGAATCC CTGCGGCGGC GTTGCTCAAC
GACGGTACGG CTGTGGTCCC GGAGGTCCCG GACCGGCTCG CCGAGCTCTT GGCGTCCCAC
GCGGGCGGCA GTGGCCTGGT GATCGCGGTG GACCAGTTCG AGGAGGTCTT CACCTTGTGT
GACGATCTTT CGGAGCGGGA GGCATTCATT CGGGCACTGT GCGTGGCAGC GGCCTGTCGG
GGTGATCGGG GGCCGGCTGC TGTGGTCGTC CTTGGTCTGC GGGCCGATTT TTACGCCCGG
TGCGCTGCCT ATCCGGAGTT GGTCGAGGCG CTGCAGACTG GCCAGGTGGT GGTCGGTCCG
ATGACGGCCG CGGAGATCCG TGACGTCGTG GTCAAGCCGG CGAGCACGGC TGGTGTTGAT
GTGGAGCCGG GTCTGGTCGA GCTGCTGTTG CGTGATCTGG GGGCGGCGAC GGCGGCGTCG
CCGGACGGCG CGTTGAGTGG TGCGGGTGTG GTCACCGATC CCGGTTCGCT GCCGTTGCTT
GCGCATGCCT TGCGTTCGGC GTGGTTCGCT CGGGACGCAT CGGGGTTGAC GGTGGGGGCG
TATCTGCGGA TCGGCGGGTT GACCGGCGCG ATCGCGCAGA CGGCTGAGCG CGCCTATACG
GGGTTGGACG CCGCGGCGCA GCAGGCCGCA CGGCCGTTGT TGATGCGGAT GGTCCGCCTC
GGCGAGGGGG AACAGGACAG TAGGCGGCGG GTCCGGCGGG CGGACCTACT CGCCGCGGTG
CCCGTCCGCG AGGCGGCCGC CGTGCTGGAC GCGCTGATCG CGGCTTTCCT CGTGGAGGCG
GACGCCGATG GCGACAGCGA CAGCCTGCAG ATCGCGCACG AGGCGCTGTT GTGGTCGTGG
CCGCGGTTGC GGGAGTGGAT GGACCTCGAC CGGGCAAGCG CGGTCGCGCT GCAGCAGCTC
AACGACGCGG CCGAGATCTG GGAAGCGGGG GGCCGCGATC CCTCCTACCT GTTCGGCGGG
ACACGGTTGG CGGCGGCCCG TGAATGGGTG GAAGCCCACC CGGGCCGTGC GCCTATGTCC
CCGGTCGCTC GGGATTTCTA CGACGAGAGC CTGCGGGCTG ACAGCGCCGC CCAGCGGGCG
GTGGTGCGGC GGACCCGGCG TCTGCGCCGG CTCGTCGCCG CGTTGACGGT CCTGGTGGTG
GCGGCAGCGT CGCTGGCCGG GCTGGTCTAC CGGCAGAAGA CCACGGCGGC GGACGCGCGG
GACCGGGCGC TGTCCCAGCG GATCGCGAGC CAGGCCGATA TCGCCCGTGA TCGGAACCCG
GCGATCGCGG CACAACTCAG CCTGGTCGCC TACCGCACGG CGAACACACC GGAGGCACGG
GGCAGTGTGC TGTCGTCGTT CAACGGTGGT TCCGGGGTGC CAACGCGGTA TCTGTTTCAC
CGCAACGCCG TCGGCACGGT GACCTACAGC CCGAACGGGA AGCTGATCGC GACGGGCAGC
GACGACTGGA CCGCGGCGAT CTGGGACGCA GCCGACCCCC GCCGGTCGAC CCCGCTCGCG
GTCCTGCCGG GCCCGCAGGA CGGGGGTCAT AGCCGGGCGG TGAAGTCGGT GGCGTTCAGC
CGGGACTCGA GGCTGCTCGC CACTGGTAGC GGCGACCGGA CGGCCAAGAT CTGGGACGTC
TCGACGCCGA GCCGGCCCCG GCTGCTGGCG ACTCTGCCGG AGACGACCGG GGACGTCTAC
GGGCTGGCGT TCAGCCCGGT CGCGGACCGG CTCGCCGTCG CCGGGTACGG CAACTCCGCG
CGGATCTACG ATGTGTCCGA GCCGGGCCGG CCGGCCGTGC AGGGTGCGCT CCTGATCAAC
GGTGAGACGC CCTTGCACCA GGGACCGATC GACACACTCG CTTTCAGCCC GGACGGTTAC
ATTCTCGCCG CCGGTGACGA GGCCGCATCC ACGGTGCTCT GGTATGTCGG ACCAGCCCTG
GACATCATGC CGGTCGACCT GCTCAACGTG CCCGACGACG ATCCGACCGC GGACCGGGTC
GGCACGATCC GGGCTGTAGC GTTCGGTCCC GATGGCAGAA GCGTCTACAC CGCGGGGGTC
GCAGGCCGGA TCCGGGTATT CAGCGGACCG GATCTGCTCC ACCTGACCCA TACCGCGACC
ATGGGTGTTG GGAACGGGCC GATGGCCGCG CTCGCGGTCG CGCCGACCGG TGGGCTGGTC
GCCGTGGGTG GAAACCTGTT CAGCGGTGTT CCGCTGTTCG ATCCCGCCGC CGCCACACAG
CAAGCCGTCC TCAGCGATCC CTCCGACGGG GCGAGAGCGC TTACTTTCCT CAATGAGGGC
ACGGTGACCT GGACGGTCGC GTTCAGCCCC GACGGGCGCC ACCTCGCCAG CGGATCCGCG
GATGGCGCAC TGCGGGTCTG GGAGATCCCA GGGCCGGCGC TGATCGGCCG TCACGGCGAC
CAGAAGTCGG CCGAGGTCAA CCCGCGCACC GGCGACGTCG TCACAGTGAC CGACTCGGCG
GCGGAACTCT GGAACATCAC CGACCCGTAC CGTCCCACCT CGCTGTCGGT CATCACCGAC
GCCCTGGCCG ACGAGTACGA CCTCACGACT AGTGCGGCGT TCCATCCCGA CGGCCGCATC
CTGGCTGTCG GGACGGCGAA GCGGGTTTGT CTCTACGACA TCACGGATCC GAGGCGACCA
TTCATGATCA CTGCGTTTGA AGGGCCGGCC GGCGGGGTGT TCTCCCTGCG TTTCAGCCCG
GACGGCCGGA CTCTCGTCCT CGGTGGTCTC GGCTCGTCAC CAGCGCCTGC CTTTTCCGCC
AGTCTGGAGA CCTGGGACGT GCGTGACCCG CGGCATCCGG CCAGGCTTGC CAGCACGGTC
GCGCACCGCG GCAGCGTGCG GGACATCCAG TTCAGCGAGG ACGGACACGT CATGGCGTCC
GTAGCCGATC GCACCGTGCA GCTGTGGGAC GTCAGCGATC CGCGCCGGAT CGCCGCCCGG
GCGACCCTGC CGGACTTCCC CGGCGCGGCG CTGCGCGCTG CGTTCGCCCC GGACGGCCGG
ACCCTCGCGG TCGGCGGCGG AGGACCCTAC GCGACCCTGT GGAACATCAC CGACCCGGCG
CGGCCGACCC GGACCGCATC GCTGCCCGGC CACGTCAGCG AGGTCAACAC CGTGACGTTT
AGCCCCGACG GTCGCACGCT CGTGACCGGC AGCGGGGACA ACTCGGTGCG GGTCTGGGAC
GTCCGCCGTC CCTCGAATCC GAGGTTGGTC GAACGGTTGA GCCGTTCCGC CGGCACCAAC
AGCGGGATCG GCACGGTTGC CTTCTCCCGG GACGGATCCA CCCTGGTCGG TGTCGTCTTC
ACCGAACCAG GAGCACTGTG GGACCTGGAC GTGGACCGGG TCGCGCAACG CATCTGCGCG
CAGGCCGGGG TCGGCATCAC CAGGGATGAA TGGGCGCGTT TCCTTCCGGA CAGGGCCTAC
GATCCGCCGT GTGACTGA
 
Protein sequence
MPAPGALDDP TGTVVCPYPG LAGFDAASER WFFGRERMVT DLVVRVGVRL SEGGPLVLVG 
ASGSGKSSLL RAGLLPVLAR GVVAGSDSWP QVLMTPGEHP LRSLIDQIAA AGIPAAALLN
DGTAVVPEVP DRLAELLASH AGGSGLVIAV DQFEEVFTLC DDLSEREAFI RALCVAAACR
GDRGPAAVVV LGLRADFYAR CAAYPELVEA LQTGQVVVGP MTAAEIRDVV VKPASTAGVD
VEPGLVELLL RDLGAATAAS PDGALSGAGV VTDPGSLPLL AHALRSAWFA RDASGLTVGA
YLRIGGLTGA IAQTAERAYT GLDAAAQQAA RPLLMRMVRL GEGEQDSRRR VRRADLLAAV
PVREAAAVLD ALIAAFLVEA DADGDSDSLQ IAHEALLWSW PRLREWMDLD RASAVALQQL
NDAAEIWEAG GRDPSYLFGG TRLAAAREWV EAHPGRAPMS PVARDFYDES LRADSAAQRA
VVRRTRRLRR LVAALTVLVV AAASLAGLVY RQKTTAADAR DRALSQRIAS QADIARDRNP
AIAAQLSLVA YRTANTPEAR GSVLSSFNGG SGVPTRYLFH RNAVGTVTYS PNGKLIATGS
DDWTAAIWDA ADPRRSTPLA VLPGPQDGGH SRAVKSVAFS RDSRLLATGS GDRTAKIWDV
STPSRPRLLA TLPETTGDVY GLAFSPVADR LAVAGYGNSA RIYDVSEPGR PAVQGALLIN
GETPLHQGPI DTLAFSPDGY ILAAGDEAAS TVLWYVGPAL DIMPVDLLNV PDDDPTADRV
GTIRAVAFGP DGRSVYTAGV AGRIRVFSGP DLLHLTHTAT MGVGNGPMAA LAVAPTGGLV
AVGGNLFSGV PLFDPAAATQ QAVLSDPSDG ARALTFLNEG TVTWTVAFSP DGRHLASGSA
DGALRVWEIP GPALIGRHGD QKSAEVNPRT GDVVTVTDSA AELWNITDPY RPTSLSVITD
ALADEYDLTT SAAFHPDGRI LAVGTAKRVC LYDITDPRRP FMITAFEGPA GGVFSLRFSP
DGRTLVLGGL GSSPAPAFSA SLETWDVRDP RHPARLASTV AHRGSVRDIQ FSEDGHVMAS
VADRTVQLWD VSDPRRIAAR ATLPDFPGAA LRAAFAPDGR TLAVGGGGPY ATLWNITDPA
RPTRTASLPG HVSEVNTVTF SPDGRTLVTG SGDNSVRVWD VRRPSNPRLV ERLSRSAGTN
SGIGTVAFSR DGSTLVGVVF TEPGALWDLD VDRVAQRICA QAGVGITRDE WARFLPDRAY
DPPCD