Gene Franean1_2761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2761 
Symbol 
ID5671150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3268666 
End bp3272019 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content70% 
IMG OID641241670 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001507090 
Protein GI158314582 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.645221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.818272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCAA GTGCCCGTGC GGGCAGAGGA AGTATCGCGG GGTCGTCCGG CGACGGCGGG 
GTGGAATTCA GGCGGGCCGT CGCTGCCTAT GCGGTCGTGG CTGGGCTCTC CGACACTCCG
CTGCGTGGTC TGGGCATCTC CGAGGTCGAC GCGCGGGTGA CCGCGGTGGT GCTGGAGACC
GAGGATGCGG TCGACGATCT GCGGGTTGAC TTCGCGACCG GTGGGACGGC GTACATCCAG
GCGAAGAGGA CCTTACGGGG AGGCACCGTC CTCGCGAAAG CGGTCGCGCA GTGGGCGCGC
GCGGGCGAGG GGCCACTGGA CCCGGCCCGC CATCGGTTGG TGATCGCAGC GGGTGAGCTG
TCGGACTCGA TGAAGGCGCT GCGCCGGGCT CTCGACGCGC GGCGGTTGTC GGTCCCAGGA
ACTCCGAGCC CGCAGGAAGC GGAACAACTT CACCGCGTCG AGGAGCTGTT GTCGCCGCTC
AACCCTGCGG CACGAGACAT CGTGCTTCAG TGCGCGGTCG TCTGGCATCT TGACGTCGAG
GAGACGGACC GCGCCGACGC GCAGAACGCC TTGCATCAGC TACGCGGGCT CGTCGCGGGC
GGAGACGCCC CGGCGGCAGG CAACGCGTGG TCGGCGCTGC TTGATGCTGC TGGGTATGCG
GCTCGCCGCC GCGGTGGCCA TAATCTCGAC GGTTGGCGAA CCGCGATGCG CGATCGCGGC
ATCGGTCTCG TCGAGCACAG CAACGCACCT GAACACTCTG GGAGCCAGTC CAGCCTCGCC
CGGCCATCCA GCCATCCGAT TGGGGACCAC GACCTGCGGG GCCGGCTCAC GCTGCAACCG
CCGGTCCGCG CGCCACGACT GAGGGGAAGC CTGGCGTCGG CGACCCGGCC TGGCAGGACG
AGGTGGCTAT GGGGGGCCGC CCTGGCGGCC GTTCTGATAA TGGGGGCGGG TACCGCGATC
TATTCGGTGC ATGACCGGCA GGCTTCCGCG ACTGACGAGC GGAACCTGGC TGACTCCCGC
CAGTTGGCAG CCGACGCGGA GAGCACGCGG GACAGCGACC CGAACCTGGC ACGCCAACTG
GCTGTGCTGG CATGGCGCAC CGCGCCGACC ATGGAGGCAT GGCGAGCACT GTTCGCGGTC
GCCGATCTGC CGACCGATGT CGACGTGGAC GATCAGGTCC TGGCGGTGGG CCGCGTCGGT
GAGCGGTTCG TCGCCGTCAC CGCGCTGTCC ACGAAGGCCC AGCTGTGGGA TGTCACCCGG
GTCAGCCGCA AGGCCGAACC ACTGGCCGTG CTGCCGGGCC AAGGTCGGGT CACCGCGGCG
GTCTTCAGCC CGGATGGTCG ACGGCTCGCG ACCATGGCAG TCGACGGCAG CGTCCTGGTC
TGGAACGTCG AGAGTGCCAC CGCCGGACCG CCCGGGAACA ATCCGGTGGA GCTCACAGCA
CTAGCGTCCC TGCCCGGGAA CGACTCCAAC TCGCCAGCAA TGGCCTTCAA CTCCGACGGC
ACCCTGCTCG CGGTCCCCGG CCCCGAAGGG TCGGTGGAGG TCTGGAACAC CCGGCTGCCG
GCCGGGAGCG CCTCCGCTCA GAACAGCACA GGCCCCGACC TCGAGGCCCT GCCGCTCCAA
ATCCCCGGCG TCGTCAGCGG CATCGACGAT CCCGGCGTGT CCGCGCTGGC GTTCACCCCC
GACAGCGGTC GGCTGGCCGT TGGAACGTCC GGGGGAACCT TCGCTGTCTT CGCTCTGGGG
CGGTCGGCGG AACCACTGAT GCGGCGGACC AACTATGCCG GCGCCATGAA CGCCGACGCG
GTCGAAGGCA CTGCGGGACA GGCGACGCCA CCGGACACGA TGGAAACCAG CCACCTCCCG
CCGCAGCGGC AACCCGTCAC GCGACTGCTG TTCCGGCCGG ACGACGGCGC GACCCTCGCC
GTGGCTGGAG GCGACGGCGT GGTGCGCCTG TGGGACGTCG CCGGGACCGA TCCCACGCTG
GCATCGCAGA ACGCCCCCGC GAACCTGCCC ACGAAGCCGA CCGCGGAACT CGGCGACGGC
GCGGCCGCCG TTGACGCCAT CGCCTACAGC GCCGACGGCC TCCACCTGGC CGTTGGCTTC
GGGGCCAGCG GTGGGACCGT CAACGTATGG GACGTCACCG ACGAGAGCAG CCCGGCCCAA
GGCGCCCCCA TCGAGCGGCT CAACGACTCG CCGACCTCCC TGACCTTCGA CCGGGCCGGT
CAGACCCTCG CGGTCGGGAC CAAGGACGGT GCTGTGCGGC TGTCGACCGT TGCGTCTCCC
GGGATGCCTC AGCCCAGACT CACGGTGCCG GACATCCTCG GGGACCATGT GGCGGTGAAC
GGCGCGGGAA CGACCGTCGC TGTATCGCTC GCTGTCGACG GCGTGGACGG CGCGATCGAG
ATCTGGGATC TCCGCGCCGG ATCGAGCGAG CCGACCGTCA CGTTCCCCGG CCGCGACGCG
ACGGTCTACT CGATCGCGAT CAGCCCGGAC GGCGCCCTGC TCGCCGCCGG ATACGACGAC
GGGGTCGTCA GGGTGTGGGA CCTGCGCGCT GCCCTCTCGA CACGCGCCGC CGCATCACCT
GTGGCCGACC TCGCCGAGCA CGGCGCGATC GTGCAGGGCC TCGCCTTCAC CAGCGACGGC
AGACTGCTCG CGTCCGGAGA CCAGGACGGG ACGATCAAAC TCTTCCGACT CGACGGCACC
CATCCCCCGA AGCGCATCGC CGAGGCGGCA TCCACCCACG GCTCGGTCCA GGGAATCACC
TTCAGCGCCG ACAACGGAAC ATTGGTGACT GGCGGTGATA GCGGTGTGGG CGTCTACGAC
GTCCCGGACG TCAGCACCGG CGGGCCGCTG ACCACGGTCA CCGACATCTC GGACGCAACC
ACTGGGAACC TCTCCGGAGG TGTCGCGCTC AGCCCCGACG GACGCACACT CGCCGTCGGC
GGCCACGGCA CCGTCACGAC CTGGGACATC ACCGGCCCGA CGCCACAGCA GCAGGGCACA
TCCCTCCCTG TCGGAAGCAA GGAGGCGGCG GTGAGCACGG TTGACTTCAG CCCGGACGGC
CGCCTGCTGA CGGCAGCCGG CAACGACGGC GACATCGCAA TCTGGGACAC CAGCCGCCCC
GGCCAGCCGA AGCCGCTCAC CAACCTCCCC GGCCACGACA ACGGAACCCG CCTCGCGGCG
TTCACCGACG ACAGCCGAAC ACTCGTGACT CTCGGCTCCG ACTACCTCGC GAAACAATGG
GACGTCGACC CCGAAACACT CGTCGGCCGC GCCTGCAACG GAGTCTCCCA ACCGATGACC
AAGACAGACT GGCGCACCTT CGTCAACAAC CGCGAGTACG CCGCGCCGTG CTGA
 
Protein sequence
MPSSARAGRG SIAGSSGDGG VEFRRAVAAY AVVAGLSDTP LRGLGISEVD ARVTAVVLET 
EDAVDDLRVD FATGGTAYIQ AKRTLRGGTV LAKAVAQWAR AGEGPLDPAR HRLVIAAGEL
SDSMKALRRA LDARRLSVPG TPSPQEAEQL HRVEELLSPL NPAARDIVLQ CAVVWHLDVE
ETDRADAQNA LHQLRGLVAG GDAPAAGNAW SALLDAAGYA ARRRGGHNLD GWRTAMRDRG
IGLVEHSNAP EHSGSQSSLA RPSSHPIGDH DLRGRLTLQP PVRAPRLRGS LASATRPGRT
RWLWGAALAA VLIMGAGTAI YSVHDRQASA TDERNLADSR QLAADAESTR DSDPNLARQL
AVLAWRTAPT MEAWRALFAV ADLPTDVDVD DQVLAVGRVG ERFVAVTALS TKAQLWDVTR
VSRKAEPLAV LPGQGRVTAA VFSPDGRRLA TMAVDGSVLV WNVESATAGP PGNNPVELTA
LASLPGNDSN SPAMAFNSDG TLLAVPGPEG SVEVWNTRLP AGSASAQNST GPDLEALPLQ
IPGVVSGIDD PGVSALAFTP DSGRLAVGTS GGTFAVFALG RSAEPLMRRT NYAGAMNADA
VEGTAGQATP PDTMETSHLP PQRQPVTRLL FRPDDGATLA VAGGDGVVRL WDVAGTDPTL
ASQNAPANLP TKPTAELGDG AAAVDAIAYS ADGLHLAVGF GASGGTVNVW DVTDESSPAQ
GAPIERLNDS PTSLTFDRAG QTLAVGTKDG AVRLSTVASP GMPQPRLTVP DILGDHVAVN
GAGTTVAVSL AVDGVDGAIE IWDLRAGSSE PTVTFPGRDA TVYSIAISPD GALLAAGYDD
GVVRVWDLRA ALSTRAAASP VADLAEHGAI VQGLAFTSDG RLLASGDQDG TIKLFRLDGT
HPPKRIAEAA STHGSVQGIT FSADNGTLVT GGDSGVGVYD VPDVSTGGPL TTVTDISDAT
TGNLSGGVAL SPDGRTLAVG GHGTVTTWDI TGPTPQQQGT SLPVGSKEAA VSTVDFSPDG
RLLTAAGNDG DIAIWDTSRP GQPKPLTNLP GHDNGTRLAA FTDDSRTLVT LGSDYLAKQW
DVDPETLVGR ACNGVSQPMT KTDWRTFVNN REYAAPC