Gene Franean1_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1060 
Symbol 
ID5669474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1246309 
End bp1250343 
Gene Length4035 bp 
Protein Length1344 aa 
Translation table11 
GC content78% 
IMG OID641239989 
Producthypothetical protein 
Protein accessionYP_001505422 
Protein GI158312914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.362047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGG CGTTCGACGG CAGCGGCTAC CGGCGACGGG TCCTGGCGCC GCTGCGTGCC 
CGGACCCCGG TGGACACGGC CGACCCCTAC CTGGTCGCGG ACCTCGATCC GCTGCTGGAA
CACACCGACT CCGAGGTCGC CGCGCAGCTC GCCCGGGTGA TGGCGTTCCT GCAGCGTGAG
CGCAACTCGG CGAAGTACGC GGCGCTGGCC ACCGAGCTGG TGCGCCGCCG CGGCGAGTGG
GAGGCGCCGC TGCTCGACGG CGACGCCCGG GCCCGCCTGC GGCACACCGT CCTCGATGCC
CGGCGCAACG GCGACGCCGA GCGGCTCGCC AAGGTCGACG GCTACCTCGT CACACTGCGT
GATCGTTTCG GTGGCATCCC GGCCTCCAGG GTCGCCGGCC TGCGCCGCCT CGCGGCCGCG
GCCGGGGTGA CCGGGGCCGA GTTCGACGCC CGGCTGGGCC GCGAGGTCAT CATCGCGGAC
GGCGGCGGCG CCGGAGTCGA GGCGCTCGCC CCCGAGGTCC GCAGCCAGAT CCGGCAGCGG
CTGGAGGACC TGCGCGTCCT GCGCGGCGGA GACCGGGCCG GCACCGCCTC CCTGTGGGAC
TTCCTCGGTC TGCCGCCCGA CGCCGGGCCG GAGCGCATCC GCTCCGCGTG GGAGGCGGTG
GCGGCCGGCA ACGCCCGACG CCCGCACGAC CGCGAGAAGA CGCTGACGGC CGACCTGCTG
GCCATGGTGC GCTCCCGGCT GGTCGAGGGT GACCCGGCGG CGTACACCGC GGGTCTGCTC
GCCGATGTCG CCGACGAGCT CCGCCCGATC GTCGAGGAGC ACGTCGTGCT CGACGGCGAG
CTGACGGCAG TCGCCTACGA GGGGCTCGTC CGGGCCGCCC TGGCCGCCGG CCGCGGGCTG
GGCGCCGAGC AGGCGAAGAC GGTCATCCTG GGCATCGCCC GCAACCTGGG CGCCGCGGTC
AGCACCGGCG GCGCCGTCGA CTACGTGCTG TGCCCCGGCT GTGGACGTCC CGAGCCGGTC
GGCGGCTCGC GGACCTGCCG GTACTGCGAC GCGGGGCTCT ACACCACCTG CCCCGGGTGC
GCCTCGCTCA CCGAGGCCGC CGCCGTGACC TGCCGGCGCT GCGGGTACAG CCTGCGCCAG
GTCCGGGCCG CCGGCGACGC GCTCGCCGCC GTGCGCCAGG CGCTGGAGGC CGGCCGCCCC
CGCGAGGCGA GCGACGGCCT CGCCCGGGTG CGCGCGGCCG TCGCGGCCGC CGGCTCGGGG
GCCGGCAACG GGACGGCGGA GGCCGCCGAC GAGCTGGAGT CGCTCGTCCG GGCCGGGCTC
GGCGCCGCCG AGGCGGGCTG GCGGGCACTG GCGGAGGAAC GCTCCACCCT GCGCTCCGAC
GCCGCGGTGG AACGCGCCCG CTGGCTGGTC GCCCGCGCGG CGGACGTCCC CGGGCCGGAC
GGCCGCCCGC CGGCCGAGGT GCTCGGCGAG CTGACCGCGC AACAGGCCGT CATCCGGCGC
CGGGTCGAGG CCGCCCGCGG CCTGCCGCCC GAGCAGCAGG AGGCCGCGCT GGTCGCCGTG
CTCGCCACCG CCGTCGACAG CGCGGACGCG CTGCGCGCTC TCGCTGCTCT GCCGCTGCAG
CCACCGACCG ATCTGACCTC CGTGCTGGCC GACGACGCGG TCCTGCTGCG CTGGCGGCCG
TCCGCCTCGG CCGGCCCGGT CACCTACCGG GTGGAGCGGG TCGCCGTCGA TCCCGGCTCC
GGGCAGCTCA CCCGGCGCGG CCTGGGCACC ACCAGCTCCA CCGAGCTCGC CGACGCGGGC
GCCCCACCGT GGACGCCGGT CCGGCACGAG GTGACCGCGC TCTCCGGCGA GCGGCGCTCC
TGGCCGGTCA GCACCGCGCC GGTCATCGCC GTGCGGGACG TCGCCGACCT ACGGGCCGAG
GCGACCCCGA CCGGGGTCCG CCTCACCTGG CGGCCGAGCG GGCCGTCCGA CACCGTGACG
ATCGAGCGCA CGGTCGATCC CGACTCGTCG GTCTCCGCCC CGCCGCGCCG GGCCCGGGTG
ACCGGCGGGA GCTTCCTCGA CTCCGACGTG CTGCCCGGCG TGGGCTACCG CTACCGCGCG
TTCGTCGAGT ACACCGACGT CGACGGCAGC GCCGCCCGCA CCTCGGGGTC GCGGGCCGAA
TTCGGCCTGC TCACCCGGCC GCGACCGGTC ACCGACCTGG TCGTCGGCGC CGAGGACGGG
CAGGTCGCCC TGCGGTGGAC GCCGCGTTCC GGGGCCGAGG TGCGGGTCTA CGCGACGGCC
GTCCCGCCGT CCGGCGGCGC CGTCGGCGCT CTCCACCCGG GTGGTCCCGG TGCGGACGGG
TCGGGCTCCG GTGCGAACGG GGCCGGCGGG TATGGCGCGG GTGCGTACGG TGCCGGTGGG
CAGGGTGCCG GTGGGCAGGG TGCCGGTGGG CACGGCACCG GGGTGCTTGG CGCCGGCGCG
GTGTCGGTCG GGGCGGGGGA GTTCGGGCGG GGGCCCGAGC CGAGCTCCGG CCCGCTGGCG
CTGCTGGGCG GCGAGGGCGC CGAGGTCCCG CTCGCCGCGC TGACACCACC GCTGCGCCTG
GTCGGCGCGA GCCGGCAGGG TCACCTGCGG GACGCAGCGG TTCCGCTTGC TCCCGGCACC
GGCGAGCTGA TCTACACCCC GGTCACGGTC GTCGGTGGTC TTGGTGTGCT CGGCCGCTCG
GCTCCGCACC GCTTCCCGGT GGTGACGCAC GACGTCTCCG AGTTCACCGC CGGCATCGCC
GACTTCCCGA CCGGCATCGC CGACTTCCCG ACCGGCATCG CCGACTTCCC GACCGGCAAC
GGTGGTCACG GTCCCGGCAA CGGTGGTCAC GATCCCGCCA GCGCCGGGCT CGGTCCCGCG
CAGTCCGCCC CGCCCGGCCA CGGGGGTACC GCTCCCGGCC ATGGCGGTGG CCCGCCGGCC
GGCGGGCCAC CGCTGCCCGC GGTGTCCGCG GGCGCCCGGC CGGTCATGAT CGACGTCATG
CCGTCCCCGG AGACCGTCCG GGCGCCCGGG GGCGCTCCGC TCGTAGCTCC CGTACCTGTC
GGTCCGTCGT CCCCAGGTGG TCCGTCGTCC CCGCTCGGTC CGTCCTCTCC GATCGGTCCG
CCTGTATCGG GTGCCCCGCC GCTTCCGGGT GCCCCGCCGT TTTCGGGTGC CCCGCCGCTT
TCGGGTGCGC AGCCCGCTCC CGGCGCGTCG CCGGTGCCGG GCCCTCCGCC GGCGACAGGT
GCCCTCCCGA TTCCCGGTGG TCCGGCGGGG CCCGGCGGAC CGAGCGCCGG AGGCGGCGCG
CCGGGGACTG GCGGGGTCCC TGTGGGCCCG CCGACGGTGG GAATGCCGCT GCCCGTGGCG
TCGGACGAGT CCGGGGCGGA CCAGCGTCCG GAACACGAGG AGTGGTCAGC CGCCGTCCCG
CGGCCCGCGG CTGACCCGGT TGACATGCCG GACCCGATGG GCGCCGCCGC ACCCGGGTCG
GGACCGGCGG CCGCCACGTC CGGGGCGCAC CCCGGTATGG GGACGGGTCC AGCGACGCCC
GACGTCGGCG GGCACTCCGT GGCGCCGGCC GTTCCCGGCG CACCCGGCCC CGGCACGCTC
GCCGCGCCGC CCCCCGCGCC CGCCCAGACC CAGGGCCACG CCCTGGTTCC CGGGCCGCCC
GCGCCGCTCG GTTCTCCGGC CCAGCTCCAG CCGCCGGTGC CGGTGCCACC GCCGGCGGAG
CTGACGTCCG TCACCTATTC GGTGTCGAAG GCGGGGTGGC GCCGGCGGAC CCTGCGCGTC
CAGGTGCGGG CGACCGGGCC GGCCCCCAGG CTGGTGCTGT TGGCGCGGCC CGGCGAGGAG
CCGCCCGGGT CGCCGGCCGA GGGACAGGTG CTGGCCGAGT TGCAGCCGGC TCCGAGCTCG
GGGTCGTGGA CCATGGAGGT CACCCTCGAG GGGGCGCAGC TCCCGTGGGG GGTCCGGCTC
CTGCCGGTGG TCACACCGGG TGCTCCGGCG GTGTGGATCG ACCATCCTGA GGACCCGATG
CTCGTCGTCC GCTGA
 
Protein sequence
MSAAFDGSGY RRRVLAPLRA RTPVDTADPY LVADLDPLLE HTDSEVAAQL ARVMAFLQRE 
RNSAKYAALA TELVRRRGEW EAPLLDGDAR ARLRHTVLDA RRNGDAERLA KVDGYLVTLR
DRFGGIPASR VAGLRRLAAA AGVTGAEFDA RLGREVIIAD GGGAGVEALA PEVRSQIRQR
LEDLRVLRGG DRAGTASLWD FLGLPPDAGP ERIRSAWEAV AAGNARRPHD REKTLTADLL
AMVRSRLVEG DPAAYTAGLL ADVADELRPI VEEHVVLDGE LTAVAYEGLV RAALAAGRGL
GAEQAKTVIL GIARNLGAAV STGGAVDYVL CPGCGRPEPV GGSRTCRYCD AGLYTTCPGC
ASLTEAAAVT CRRCGYSLRQ VRAAGDALAA VRQALEAGRP REASDGLARV RAAVAAAGSG
AGNGTAEAAD ELESLVRAGL GAAEAGWRAL AEERSTLRSD AAVERARWLV ARAADVPGPD
GRPPAEVLGE LTAQQAVIRR RVEAARGLPP EQQEAALVAV LATAVDSADA LRALAALPLQ
PPTDLTSVLA DDAVLLRWRP SASAGPVTYR VERVAVDPGS GQLTRRGLGT TSSTELADAG
APPWTPVRHE VTALSGERRS WPVSTAPVIA VRDVADLRAE ATPTGVRLTW RPSGPSDTVT
IERTVDPDSS VSAPPRRARV TGGSFLDSDV LPGVGYRYRA FVEYTDVDGS AARTSGSRAE
FGLLTRPRPV TDLVVGAEDG QVALRWTPRS GAEVRVYATA VPPSGGAVGA LHPGGPGADG
SGSGANGAGG YGAGAYGAGG QGAGGQGAGG HGTGVLGAGA VSVGAGEFGR GPEPSSGPLA
LLGGEGAEVP LAALTPPLRL VGASRQGHLR DAAVPLAPGT GELIYTPVTV VGGLGVLGRS
APHRFPVVTH DVSEFTAGIA DFPTGIADFP TGIADFPTGN GGHGPGNGGH DPASAGLGPA
QSAPPGHGGT APGHGGGPPA GGPPLPAVSA GARPVMIDVM PSPETVRAPG GAPLVAPVPV
GPSSPGGPSS PLGPSSPIGP PVSGAPPLPG APPFSGAPPL SGAQPAPGAS PVPGPPPATG
ALPIPGGPAG PGGPSAGGGA PGTGGVPVGP PTVGMPLPVA SDESGADQRP EHEEWSAAVP
RPAADPVDMP DPMGAAAPGS GPAAATSGAH PGMGTGPATP DVGGHSVAPA VPGAPGPGTL
AAPPPAPAQT QGHALVPGPP APLGSPAQLQ PPVPVPPPAE LTSVTYSVSK AGWRRRTLRV
QVRATGPAPR LVLLARPGEE PPGSPAEGQV LAELQPAPSS GSWTMEVTLE GAQLPWGVRL
LPVVTPGAPA VWIDHPEDPM LVVR