Gene Franean1_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4246 
Symbol 
ID5672601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5056197 
End bp5059073 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content67% 
IMG OID641243119 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001508536 
Protein GI158316028 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.713375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTT CATCGAACTG GGGGCGAGCC GGCTCCGCAC ATCACTACGA TGCTTTCATC 
TCCTACACCG AGGCAGACGT CGCGATCGCC CGGGAACTAG AACGCGCAAT CGCCTCGCAT
GTTTCTCCGG ACCCCCGACT GAAGAGCCCG AGAGTCTTCC GGGACCGTAC CAAACTAACT
ACCGCTCCAC GGCTCGCCGC GGCCATCGAG CAGGCTCTCG ATCGCTCGGA ATACTTCGTG
CTGGTGGCCT CACCGGACGT CGTCCATTCC AGGTGGGTAC GGCTGGAGAT CGAGGCGTGG
CAGCGGAGGG ACTACCGGTT CGAGCGACTC CTTGTCGTCC TGGCCGGGGA GGATCTTCGC
ACGTCGCTCC CCGGCCCACT CCGTGCGTTC TACTTTCCCG AGCGCGTCGA CAGCACCGGG
GACGCCCGAC ACACACCCCT GGAGCTCTCC CCCAACTACC TGTCCGTCAA GCCAGTGATC
CGCTCACTGC ACAAAAAGCA CGGCAGACGC CCCAGCCGCC CGGGCCGACT GTCCCAGAAC
GCGACCGAAG AAGAGATAGG AGAGTTCGAG AGCCAGAGAC AGAAATGCGA GGAATGGGAC
CAGAATGCCT CGTCCCTAAT CACCAGCCAT GACGAATTCG CCGCGAGCGC GGGCGAGATC
ACGGCACGCC TTTATAATGT CGACAAGGAT GAGCTCGACC AGGCAAAGAT ACGCGAGGAG
AGGAAGAAGA AGCGGAAGCG GCGCTGGCTG AGAGTCATTT CGGCGACGGC CACGGCAGGT
ATTCTGAGCC TCGTCGTGAT CCTCTTGGCG CTACGGCAGG AGGCGGGCCG GCAGCGTTCG
GACGCGGACC AGAAGCGTGC GGTAGCACGC TCCGAGCAGC TTGCCCAGAC TGCCCGCGCC
CTGCTTGAGA CCTCGCCGAA CACGGCGAGA CAGATCAGCC TGGCCGCCTA CGACGTCGCT
CCGACAGCCA CCGCGCGGAG CGCCATGATG GCGGCCGGCC GGGAGCCGGG AACAATCGAT
GCCCGGTCGG TGGGGGGACT GCGCCTCAGT CCAGACGGTC GCCTCCTCGC GATCGTCGAG
ACCGCCACGA CCGACAACGT CGTCAGGCTC TGGAATCTTG CCACCCATCG TGTCGCCGCG
ACGGTCACCG GCCACCGGGG CAAGGTGAAC GCGGTCGCCT TCAGCCCCGA CGGCCGGCTT
TTCGCGACTG CGGGCGACGA TCATACGGCC CGGATCTGGT CGGTTGCCGA CGCCGTCGCC
GCGCGTGAGG TCAACGTTCT CCGGCCGGGC CTCGGGCCGC TCACGGCGCT GGATTTCGGA
GCCGCGAACC AGTTGGTGCT TGGTAGCGCC CAGATTCGCT CGGTCGCCTC CGATCCAACG
AGCCCCGGCA GAGAGATCGA CAGTGCTACG AGTTCGCTGC AGGCCTGGGT TGGCGTCGAG
GGGCGAAACC CCGTTGGGGG GCCTGTGCTG ACAATGGACG GCGGGAATGT GACCTCAGTC
GATGTCGCAC CCGAGGGCGG GCGTCTGGTC GTTCACGCAG GCGCCACCAT GCTGTGGGAC
GTCGGTCCGA CGAGCATGAT CACAGGCGGC CCGCTCACCC TGCCAGGGAC GCGCGACACC
GACATCTACG CCGATGACCG CCAGGATCTC GACGGGGACG GCTTCGCCGA CCGTGTCGCC
TGTTTCACCA CCGGCGGCCG GGTCGTCGTC TCGGGCCCGC AGGTCTTCGA TGTCGGTGCC
GGCACCCTGG TTCGGGCCGC GTCTCCGCTG GGTGCATCAC AGGACAGCCC GCTGGCAGCG
GCCTGTGACA GCGGAGTCGT CAGCGCCGCG CCGGACATCC GGGGTGTCCG TGTCTGGTCA
ACGCCCGCGA ACGACGCAGG TGCGCTGGTC GCCCGGCCGA TACCCGATCT GGGCGAGGCA
CACGGCGCGA CCGCAGTCCT GAGTACGGAG GGCGATCTGC TCGCTTTCGC GGAACCGGGC
GGCGCGGTCC GCCTTATCGA CATCCGTGAC CCGGCCCGGC TTGGCCGGGT CCAGATCGAG
CAGGCCGAGC CACGGAAGCC GGACCCGGGC ATGTCCTACC GATCGGTCAG ATCGGGTGAG
CGCCATTCGA GCCGGAACGG GACGCTGGCG GTGATAATGG ACGACAGTTC CACGACGGTG
AGTTTCGGGG TGTGGGACTA CTCGGACTCG GCGCATCCGC GCCGCACGGG AACGATCAAT
CTCCCGTCCA GGTCATCACC GGTGCCACCA TCGATCAGCG ATGACGGCAG GCGGCTGGCC
ATGTCGAACA CCGTCAACGC GATCGACATC TTCGGGTTGG ACGACCAGGA TGCACTGGTC
CACATCGGCC GATACGAGGC GTTCGGCGCG CAGATCCCGG GGGCCCCGGG CGCACGCTAC
GCGCCTATTC CCGGAACGTT CAGCGCGAAC GGGCGCATCT ACGCCGCACC GGTCACGGAC
GGCCCGCGGT CCATCCAGCT GTGGGAGGTG CACGAGGATC GCCTCACGGC AGTCTCCACC
GTCCCGACGG CCGGAGAGGT ACTAGCGCTC GACGACGAGG GCGACATGCT GGCCGTCGCG
GATACGGACG GCACCGTGCG GCTCTGGGAC CTGCGGGACA TCCAGGCTCC TGCCCTCTAC
GTTGAGTTCA CTGCCCTGCC CGACAGGCCT GTCACCGAGC TACGGTTCAC CGGCGGTGAC
GACGCCGGAA TGCGTGTCGG TGTCACGCAG GACGACACGG TCTCGTGGTG GGCTGTCAGT
CAGGATGCTC TGGTTGACGA TCTGTGCGCC GAGGTCGGTG ACCTGCTGAG CGCCGACGAA
TGGCGCCGAC TCATCGCCGA CGTGCCTGCG CGCCGGCCCT GCGCAGCGCC CGGCTGA
 
Protein sequence
MSVSSNWGRA GSAHHYDAFI SYTEADVAIA RELERAIASH VSPDPRLKSP RVFRDRTKLT 
TAPRLAAAIE QALDRSEYFV LVASPDVVHS RWVRLEIEAW QRRDYRFERL LVVLAGEDLR
TSLPGPLRAF YFPERVDSTG DARHTPLELS PNYLSVKPVI RSLHKKHGRR PSRPGRLSQN
ATEEEIGEFE SQRQKCEEWD QNASSLITSH DEFAASAGEI TARLYNVDKD ELDQAKIREE
RKKKRKRRWL RVISATATAG ILSLVVILLA LRQEAGRQRS DADQKRAVAR SEQLAQTARA
LLETSPNTAR QISLAAYDVA PTATARSAMM AAGREPGTID ARSVGGLRLS PDGRLLAIVE
TATTDNVVRL WNLATHRVAA TVTGHRGKVN AVAFSPDGRL FATAGDDHTA RIWSVADAVA
AREVNVLRPG LGPLTALDFG AANQLVLGSA QIRSVASDPT SPGREIDSAT SSLQAWVGVE
GRNPVGGPVL TMDGGNVTSV DVAPEGGRLV VHAGATMLWD VGPTSMITGG PLTLPGTRDT
DIYADDRQDL DGDGFADRVA CFTTGGRVVV SGPQVFDVGA GTLVRAASPL GASQDSPLAA
ACDSGVVSAA PDIRGVRVWS TPANDAGALV ARPIPDLGEA HGATAVLSTE GDLLAFAEPG
GAVRLIDIRD PARLGRVQIE QAEPRKPDPG MSYRSVRSGE RHSSRNGTLA VIMDDSSTTV
SFGVWDYSDS AHPRRTGTIN LPSRSSPVPP SISDDGRRLA MSNTVNAIDI FGLDDQDALV
HIGRYEAFGA QIPGAPGARY APIPGTFSAN GRIYAAPVTD GPRSIQLWEV HEDRLTAVST
VPTAGEVLAL DDEGDMLAVA DTDGTVRLWD LRDIQAPALY VEFTALPDRP VTELRFTGGD
DAGMRVGVTQ DDTVSWWAVS QDALVDDLCA EVGDLLSADE WRRLIADVPA RRPCAAPG