Gene Franean1_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1989 
Symbol 
ID5670390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2388960 
End bp2393006 
Gene Length4047 bp 
Protein Length1348 aa 
Translation table11 
GC content76% 
IMG OID641240910 
Producthypothetical protein 
Protein accessionYP_001506332 
Protein GI158313824 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.49043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCC GCGCGCTCAG CCTGGATCGC TACGGTGCCT TCGAGGACCG GCGGATCGAG 
TTCGGCCGTG GGCTGACGGT GGTCGTCGGC GCGAACGAGG CCGGCAAGTC GACGACGCTC
GACGCCCTGT CGGACCTGCT GTGGACCTTC CGCGGCACCC GGCGGAGCTT CCAGTTCAGC
CAGGGCGCGC TGAGCCTGAC GGCCGACCTC GAACTGCCCG CGACGCATCC GGCCGAGCTC
CCGCCGCCGT CCCAGGAGAC GGGGCCGGCC CGGCAGGCGC CGCCGGCCAG GGTCGAGGTC
AGGCGGCGCA ACAGCGGTCT GCAGACCGTT GACGACGGCG CGGTCTTCCT CCCCCCATGG
GGTTCCGGTG GCGCCGACGC GCGTCGGCGC TGGCGGCAGG CGTTCGGGCT GTCCCACGCC
GAGCTGCGGG CAGGTGGCGC GAGTGTCTTC GAGGGCACCG GCGACCTGGC CGAGCTCGTC
TTCACCGCGC GCAGCGGGCG GGCGGTCCGC GGCCTGCTCG ACACGCTCGC CGCGGAGATG
GACTCCCTGT ACAAGTCCCA CCGCAACAAC AAGAGCGTCC GGGTCCGCAC CGCGCTGGCG
GACTACGAGC GGCTGGCCGA GCAGGCGGCG TCCGCCATGA CCCGCGCGGC CCATGTCGAC
GATGCCCGCC GGGAGCAGGC GCTGCGCCGC CGCGACGCGC AGGCGGCCGC GGCCGCGACC
CGAGCCGCCG CCGACCGCCG CTCCCGCCTG GAACGGTGGG TGCGCGCAGC CCCACATGCC
CGGGAGCTGG TCGTGCTCCG GGAACGTCAC GCCGCGCTGC TCGCCGCGGG CGTGGCCCTC
ACCGCCGAGG AGACGGCGCT GTTCGACTCC AGCGTGCGGG AGGCCGCGAC GGCCGAGGCC
GACCTCGACC GGCTGACCGC CGATCTGCTC GACCGCCGGG CCGCGCGGGA GGCGCTGACG
ACCGACGAGT CCGTGCTCGC CGACGGCGAG ATGATCACCC GGCTGGAGCA GTCCCGCGTG
GCGCGGCTCG GCGACGGCGC CGACGCCCGC GCGCTCGAGA ACGAGGCCGC GCGCCACGCG
GACTCCGCCC GTGCGCTCCT GCTCGACCTG GCTGGTCCGG GCGACCCGCG CACCGCCGCC
GAGCTCCTGG CCGACCTGCA CGTGCCGCGT GACCTCGCGG CCCAGCTCGA CGCGCTCGCC
GTGACGGTCG GCGAGCTGAC CGACGAGCTG CGCCGCGCCG AGGACGCGCT CGCCGCCGCC
CGCCTGCGCC GCGACGGCAC CGACCAGTCC CCGGGCCACG ACCCGGCGTC GATCAGCCAG
CTGAAGGCGG TAATCGGCGC GATAGCCGGC GAGGGCTCGG CGACCGCGCT GACCAGGGCG
GCCGTCGACG AGGCGGCCCG CGCTGTCCGC GACCGCCGGG AGGCGCTGCA CCGCGCCGGC
GCCCGCGACC CGGACGGCCC GCCGCCCGGG ATGCCCGGGC GCGACGAGAT CCGGCTGGCG
CGCGACCGGC TCGACGCCGC AGAGGTGGCC CTCGCCCGCC GCGAGGAGGA GCTCGTCGGC
GCCACCCGGA ACCTCGAGCT CGCGCAGAGC CGGGTCGCCG ACGCCGACGG CCGCGACCTC
CCGGACCAGG CCACGCTCGA CCGGGCCCGC GCGACCCGCG AGGAGCTCTG GAACCTGCTG
GTCGACGCGG CCGGCGGCCT GCCGCCAACG ACCGGCGGCC GGACTCCCGC CCCGGCCGTT
GGCGGGGTGC CACCAGTCGG CGGCCGGCTG ACGCCGGAGC GGGCTCGCGA GCTGCTGCCG
GTCATCGCCG CCGGAATCGC CCGCGCCGAC GAGGTCGCGG ATCAGCTCAT CCGCCACGCC
GATCTGGCCG CCCGGCGGGC CCAGCTCCAC CGGGACGCCG ACCTCGCGCA CGAACGGGCC
ACCGCGGCAC TCGCCGCCAC CGCGGCCGCC GCCGAGGCAG CCGATCAAGC CCGCGCGGCG
TGGGAGGGCC ACTGGCACAA GCCAGGCCTG GCCGTGCCCT CTCGGGCGGA CGCGGACGAC
GTCCAGCGGG CTGTTGACGA GGCGCACCGG GCGCACGCGG AGCTGCTCGC CGCGGAGACC
CGCATCGCCC GGCTGGGTGC CCAGGCCGAC GCCCAGCGGA CCGCGCTCGC CCAGGCCCTC
GCCGAGGTCG GCCTCGCCCG GGTGGACGCC GACCTCGACA GTCTTCTCAC CTGCGCGCAG
ACCGTGCTCG CGGACGACGA CCTCGCACAG GTCCGCCGGG CTGACGCCGC CCACTTCGGG
CGGGCCGTCG AGGAGGCCGA ACGCGAACGG GACAAGCGGC GCGAGGAGCT GCTCGGCGCC
GAGGGCGACT GGGACAACCT CGTCCGGGCG GCCGGGCTGG CTTCGGTCAC CGACCCGCGG
GGCTGGACCG AACGGCGGGG CGTGCTCGCG CAGGCGGTGG CTCTGCACGA GCAGGCCGAC
CGCGCCGCGC GGGACGCCGA GCGTGCCGCC GGCCGCCATC AGGCGTTCGC CGCGGACATC
CGCGAGCTCT CGGCCCGTCA TGGCCTCCTG CACCGCTCGC GCGGCCAGCG CCCCGACACC
GACGCCAGCG CCGGGGCCGG GAGTGGGGCT GAGGGTGGGG CTGGGAGTGA AACCGGAACT
GGCGGCGAAC GTCTCGCGGC GGGCCCAGTC GCGCCCGACG GGCCGGGCGA TGAGGCCGAC
GAGCTGGCCG ATCTGCTCGA TCAGCTCAGC GCACGGCACC AGGCGAGCAT CGAGAGCCGG
ACCCGCCGCG CGGAGATCGA CCAGGCGGTC GCCACCCTGG GCACCCGGGT CACCGCCGCG
ACGCTGCTGC GGGACAGCGC TGTCGGCCGC CTCGACGAGT TGCGCGCGCA GGTCGTCCTC
GCGCCCGGTC AGCAACTGAC CGACGCGGCG GACCGCGGCC GGGACCTCGC CGGGATCACG
GCGGCGATGA CCGCCGCCGA AGGCCTGCTG CGCGCCGCGG CACCCGGCGA CGAGCTCGAA
GCCGTCGTCC GCGCGCTGGC CGCGAGCACC GACGAGGACC TCGCCGCCGA CCTCGCCGAC
GCGCGCGACG AGCACGAGCG CCGCACGGCC GCGCAGACAG AGGCGTGGAC GGCCGTCGGG
ACGGTCGAAC GGCATCTGCG GGATCTCGAG AGCGGCGCGG CCACCGGCGA GCTGCACGCC
CGTGCGCAGG AGTCCCTGGC GCTCGTAGCC GAGACGGCGG AGCGCTACGT CATCGCGCGC
ATCCAGTACG AGACGCTTGG CCGTGAGCTG GAGTCCTACG AGCGCCGTCA CGCGTCCCCC
CTGCTCGCGG ACGCGGGACG CCTGCTCGAA CGGCTCACCG AGGGCCGCTA CGTCGCCCTG
CGGGCCATCG ATCGCGGCGA CGGCGCCCGG ACGCTGCGGG TCGTCCGCGC CGACGAGGAC
GAGCTCGGCC CGGGCGAGCT TTCCGAGGGC ACGGCCGACC AGGTGTTCCT GGCCCTGCGC
CTGGCGGCGA TCGACCAGCT CCAGCGGGAG CGGACCAGCC GCGGCGAACC CACGCTGCCG
GTCGTGCTCG ACGACGTCCT GATGACGTTC GACGACACGC GCGCCGAGGC GGCACTTCGC
GTGCTGGCCG AGATCGCCGG GCGCTGGCAG ATCATTCTGC TGACGCATCA CGAGCATCTC
ACCGACCTGG CACGAGTGGT GGATGCGCAG CTGCGCGCCG ACGGACCAGG ACAGGCGGAC
GGCGCCTTGA CCCCACCTGG CGACGCCGAG CTGGTCACGA TCAGCTATCT GCCGGGGGCG
AATGTGCTCA CGCCCACCAG GGATGCGGAG CAGATCCGGA CACTGGCCAC GCACGTCGTC
CCGCCCGACC CCGGCGCCGG TGAGCCCGGC GGATCCCCGG CTCTCGCCGT GACGGCAGCG
AGCGGGGGTG GTAACGGAAG CGCGGCCGCC GGCCGGGACG CCGGGAGGAT CCGCGCCTGG
GCCCGCCAGA ACGGATTCGA GGTCGGCGAC CGTGGCCGCA TTCCCCGGGA GATCCTCGAC
GCGTTCGCCG ACGCGCACTC CGCCTGA
 
Protein sequence
MRVRALSLDR YGAFEDRRIE FGRGLTVVVG ANEAGKSTTL DALSDLLWTF RGTRRSFQFS 
QGALSLTADL ELPATHPAEL PPPSQETGPA RQAPPARVEV RRRNSGLQTV DDGAVFLPPW
GSGGADARRR WRQAFGLSHA ELRAGGASVF EGTGDLAELV FTARSGRAVR GLLDTLAAEM
DSLYKSHRNN KSVRVRTALA DYERLAEQAA SAMTRAAHVD DARREQALRR RDAQAAAAAT
RAAADRRSRL ERWVRAAPHA RELVVLRERH AALLAAGVAL TAEETALFDS SVREAATAEA
DLDRLTADLL DRRAAREALT TDESVLADGE MITRLEQSRV ARLGDGADAR ALENEAARHA
DSARALLLDL AGPGDPRTAA ELLADLHVPR DLAAQLDALA VTVGELTDEL RRAEDALAAA
RLRRDGTDQS PGHDPASISQ LKAVIGAIAG EGSATALTRA AVDEAARAVR DRREALHRAG
ARDPDGPPPG MPGRDEIRLA RDRLDAAEVA LARREEELVG ATRNLELAQS RVADADGRDL
PDQATLDRAR ATREELWNLL VDAAGGLPPT TGGRTPAPAV GGVPPVGGRL TPERARELLP
VIAAGIARAD EVADQLIRHA DLAARRAQLH RDADLAHERA TAALAATAAA AEAADQARAA
WEGHWHKPGL AVPSRADADD VQRAVDEAHR AHAELLAAET RIARLGAQAD AQRTALAQAL
AEVGLARVDA DLDSLLTCAQ TVLADDDLAQ VRRADAAHFG RAVEEAERER DKRREELLGA
EGDWDNLVRA AGLASVTDPR GWTERRGVLA QAVALHEQAD RAARDAERAA GRHQAFAADI
RELSARHGLL HRSRGQRPDT DASAGAGSGA EGGAGSETGT GGERLAAGPV APDGPGDEAD
ELADLLDQLS ARHQASIESR TRRAEIDQAV ATLGTRVTAA TLLRDSAVGR LDELRAQVVL
APGQQLTDAA DRGRDLAGIT AAMTAAEGLL RAAAPGDELE AVVRALAAST DEDLAADLAD
ARDEHERRTA AQTEAWTAVG TVERHLRDLE SGAATGELHA RAQESLALVA ETAERYVIAR
IQYETLGREL ESYERRHASP LLADAGRLLE RLTEGRYVAL RAIDRGDGAR TLRVVRADED
ELGPGELSEG TADQVFLALR LAAIDQLQRE RTSRGEPTLP VVLDDVLMTF DDTRAEAALR
VLAEIAGRWQ IILLTHHEHL TDLARVVDAQ LRADGPGQAD GALTPPGDAE LVTISYLPGA
NVLTPTRDAE QIRTLATHVV PPDPGAGEPG GSPALAVTAA SGGGNGSAAA GRDAGRIRAW
ARQNGFEVGD RGRIPREILD AFADAHSA