Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1989 |
Symbol | |
ID | 5670390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2388960 |
End bp | 2393006 |
Gene Length | 4047 bp |
Protein Length | 1348 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240910 |
Product | hypothetical protein |
Protein accession | YP_001506332 |
Protein GI | 158313824 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.49043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGTCC GCGCGCTCAG CCTGGATCGC TACGGTGCCT TCGAGGACCG GCGGATCGAG TTCGGCCGTG GGCTGACGGT GGTCGTCGGC GCGAACGAGG CCGGCAAGTC GACGACGCTC GACGCCCTGT CGGACCTGCT GTGGACCTTC CGCGGCACCC GGCGGAGCTT CCAGTTCAGC CAGGGCGCGC TGAGCCTGAC GGCCGACCTC GAACTGCCCG CGACGCATCC GGCCGAGCTC CCGCCGCCGT CCCAGGAGAC GGGGCCGGCC CGGCAGGCGC CGCCGGCCAG GGTCGAGGTC AGGCGGCGCA ACAGCGGTCT GCAGACCGTT GACGACGGCG CGGTCTTCCT CCCCCCATGG GGTTCCGGTG GCGCCGACGC GCGTCGGCGC TGGCGGCAGG CGTTCGGGCT GTCCCACGCC GAGCTGCGGG CAGGTGGCGC GAGTGTCTTC GAGGGCACCG GCGACCTGGC CGAGCTCGTC TTCACCGCGC GCAGCGGGCG GGCGGTCCGC GGCCTGCTCG ACACGCTCGC CGCGGAGATG GACTCCCTGT ACAAGTCCCA CCGCAACAAC AAGAGCGTCC GGGTCCGCAC CGCGCTGGCG GACTACGAGC GGCTGGCCGA GCAGGCGGCG TCCGCCATGA CCCGCGCGGC CCATGTCGAC GATGCCCGCC GGGAGCAGGC GCTGCGCCGC CGCGACGCGC AGGCGGCCGC GGCCGCGACC CGAGCCGCCG CCGACCGCCG CTCCCGCCTG GAACGGTGGG TGCGCGCAGC CCCACATGCC CGGGAGCTGG TCGTGCTCCG GGAACGTCAC GCCGCGCTGC TCGCCGCGGG CGTGGCCCTC ACCGCCGAGG AGACGGCGCT GTTCGACTCC AGCGTGCGGG AGGCCGCGAC GGCCGAGGCC GACCTCGACC GGCTGACCGC CGATCTGCTC GACCGCCGGG CCGCGCGGGA GGCGCTGACG ACCGACGAGT CCGTGCTCGC CGACGGCGAG ATGATCACCC GGCTGGAGCA GTCCCGCGTG GCGCGGCTCG GCGACGGCGC CGACGCCCGC GCGCTCGAGA ACGAGGCCGC GCGCCACGCG GACTCCGCCC GTGCGCTCCT GCTCGACCTG GCTGGTCCGG GCGACCCGCG CACCGCCGCC GAGCTCCTGG CCGACCTGCA CGTGCCGCGT GACCTCGCGG CCCAGCTCGA CGCGCTCGCC GTGACGGTCG GCGAGCTGAC CGACGAGCTG CGCCGCGCCG AGGACGCGCT CGCCGCCGCC CGCCTGCGCC GCGACGGCAC CGACCAGTCC CCGGGCCACG ACCCGGCGTC GATCAGCCAG CTGAAGGCGG TAATCGGCGC GATAGCCGGC GAGGGCTCGG CGACCGCGCT GACCAGGGCG GCCGTCGACG AGGCGGCCCG CGCTGTCCGC GACCGCCGGG AGGCGCTGCA CCGCGCCGGC GCCCGCGACC CGGACGGCCC GCCGCCCGGG ATGCCCGGGC GCGACGAGAT CCGGCTGGCG CGCGACCGGC TCGACGCCGC AGAGGTGGCC CTCGCCCGCC GCGAGGAGGA GCTCGTCGGC GCCACCCGGA ACCTCGAGCT CGCGCAGAGC CGGGTCGCCG ACGCCGACGG CCGCGACCTC CCGGACCAGG CCACGCTCGA CCGGGCCCGC GCGACCCGCG AGGAGCTCTG GAACCTGCTG GTCGACGCGG CCGGCGGCCT GCCGCCAACG ACCGGCGGCC GGACTCCCGC CCCGGCCGTT GGCGGGGTGC CACCAGTCGG CGGCCGGCTG ACGCCGGAGC GGGCTCGCGA GCTGCTGCCG GTCATCGCCG CCGGAATCGC CCGCGCCGAC GAGGTCGCGG ATCAGCTCAT CCGCCACGCC GATCTGGCCG CCCGGCGGGC CCAGCTCCAC CGGGACGCCG ACCTCGCGCA CGAACGGGCC ACCGCGGCAC TCGCCGCCAC CGCGGCCGCC GCCGAGGCAG CCGATCAAGC CCGCGCGGCG TGGGAGGGCC ACTGGCACAA GCCAGGCCTG GCCGTGCCCT CTCGGGCGGA CGCGGACGAC GTCCAGCGGG CTGTTGACGA GGCGCACCGG GCGCACGCGG AGCTGCTCGC CGCGGAGACC CGCATCGCCC GGCTGGGTGC CCAGGCCGAC GCCCAGCGGA CCGCGCTCGC CCAGGCCCTC GCCGAGGTCG GCCTCGCCCG GGTGGACGCC GACCTCGACA GTCTTCTCAC CTGCGCGCAG ACCGTGCTCG CGGACGACGA CCTCGCACAG GTCCGCCGGG CTGACGCCGC CCACTTCGGG CGGGCCGTCG AGGAGGCCGA ACGCGAACGG GACAAGCGGC GCGAGGAGCT GCTCGGCGCC GAGGGCGACT GGGACAACCT CGTCCGGGCG GCCGGGCTGG CTTCGGTCAC CGACCCGCGG GGCTGGACCG AACGGCGGGG CGTGCTCGCG CAGGCGGTGG CTCTGCACGA GCAGGCCGAC CGCGCCGCGC GGGACGCCGA GCGTGCCGCC GGCCGCCATC AGGCGTTCGC CGCGGACATC CGCGAGCTCT CGGCCCGTCA TGGCCTCCTG CACCGCTCGC GCGGCCAGCG CCCCGACACC GACGCCAGCG CCGGGGCCGG GAGTGGGGCT GAGGGTGGGG CTGGGAGTGA AACCGGAACT GGCGGCGAAC GTCTCGCGGC GGGCCCAGTC GCGCCCGACG GGCCGGGCGA TGAGGCCGAC GAGCTGGCCG ATCTGCTCGA TCAGCTCAGC GCACGGCACC AGGCGAGCAT CGAGAGCCGG ACCCGCCGCG CGGAGATCGA CCAGGCGGTC GCCACCCTGG GCACCCGGGT CACCGCCGCG ACGCTGCTGC GGGACAGCGC TGTCGGCCGC CTCGACGAGT TGCGCGCGCA GGTCGTCCTC GCGCCCGGTC AGCAACTGAC CGACGCGGCG GACCGCGGCC GGGACCTCGC CGGGATCACG GCGGCGATGA CCGCCGCCGA AGGCCTGCTG CGCGCCGCGG CACCCGGCGA CGAGCTCGAA GCCGTCGTCC GCGCGCTGGC CGCGAGCACC GACGAGGACC TCGCCGCCGA CCTCGCCGAC GCGCGCGACG AGCACGAGCG CCGCACGGCC GCGCAGACAG AGGCGTGGAC GGCCGTCGGG ACGGTCGAAC GGCATCTGCG GGATCTCGAG AGCGGCGCGG CCACCGGCGA GCTGCACGCC CGTGCGCAGG AGTCCCTGGC GCTCGTAGCC GAGACGGCGG AGCGCTACGT CATCGCGCGC ATCCAGTACG AGACGCTTGG CCGTGAGCTG GAGTCCTACG AGCGCCGTCA CGCGTCCCCC CTGCTCGCGG ACGCGGGACG CCTGCTCGAA CGGCTCACCG AGGGCCGCTA CGTCGCCCTG CGGGCCATCG ATCGCGGCGA CGGCGCCCGG ACGCTGCGGG TCGTCCGCGC CGACGAGGAC GAGCTCGGCC CGGGCGAGCT TTCCGAGGGC ACGGCCGACC AGGTGTTCCT GGCCCTGCGC CTGGCGGCGA TCGACCAGCT CCAGCGGGAG CGGACCAGCC GCGGCGAACC CACGCTGCCG GTCGTGCTCG ACGACGTCCT GATGACGTTC GACGACACGC GCGCCGAGGC GGCACTTCGC GTGCTGGCCG AGATCGCCGG GCGCTGGCAG ATCATTCTGC TGACGCATCA CGAGCATCTC ACCGACCTGG CACGAGTGGT GGATGCGCAG CTGCGCGCCG ACGGACCAGG ACAGGCGGAC GGCGCCTTGA CCCCACCTGG CGACGCCGAG CTGGTCACGA TCAGCTATCT GCCGGGGGCG AATGTGCTCA CGCCCACCAG GGATGCGGAG CAGATCCGGA CACTGGCCAC GCACGTCGTC CCGCCCGACC CCGGCGCCGG TGAGCCCGGC GGATCCCCGG CTCTCGCCGT GACGGCAGCG AGCGGGGGTG GTAACGGAAG CGCGGCCGCC GGCCGGGACG CCGGGAGGAT CCGCGCCTGG GCCCGCCAGA ACGGATTCGA GGTCGGCGAC CGTGGCCGCA TTCCCCGGGA GATCCTCGAC GCGTTCGCCG ACGCGCACTC CGCCTGA
|
Protein sequence | MRVRALSLDR YGAFEDRRIE FGRGLTVVVG ANEAGKSTTL DALSDLLWTF RGTRRSFQFS QGALSLTADL ELPATHPAEL PPPSQETGPA RQAPPARVEV RRRNSGLQTV DDGAVFLPPW GSGGADARRR WRQAFGLSHA ELRAGGASVF EGTGDLAELV FTARSGRAVR GLLDTLAAEM DSLYKSHRNN KSVRVRTALA DYERLAEQAA SAMTRAAHVD DARREQALRR RDAQAAAAAT RAAADRRSRL ERWVRAAPHA RELVVLRERH AALLAAGVAL TAEETALFDS SVREAATAEA DLDRLTADLL DRRAAREALT TDESVLADGE MITRLEQSRV ARLGDGADAR ALENEAARHA DSARALLLDL AGPGDPRTAA ELLADLHVPR DLAAQLDALA VTVGELTDEL RRAEDALAAA RLRRDGTDQS PGHDPASISQ LKAVIGAIAG EGSATALTRA AVDEAARAVR DRREALHRAG ARDPDGPPPG MPGRDEIRLA RDRLDAAEVA LARREEELVG ATRNLELAQS RVADADGRDL PDQATLDRAR ATREELWNLL VDAAGGLPPT TGGRTPAPAV GGVPPVGGRL TPERARELLP VIAAGIARAD EVADQLIRHA DLAARRAQLH RDADLAHERA TAALAATAAA AEAADQARAA WEGHWHKPGL AVPSRADADD VQRAVDEAHR AHAELLAAET RIARLGAQAD AQRTALAQAL AEVGLARVDA DLDSLLTCAQ TVLADDDLAQ VRRADAAHFG RAVEEAERER DKRREELLGA EGDWDNLVRA AGLASVTDPR GWTERRGVLA QAVALHEQAD RAARDAERAA GRHQAFAADI RELSARHGLL HRSRGQRPDT DASAGAGSGA EGGAGSETGT GGERLAAGPV APDGPGDEAD ELADLLDQLS ARHQASIESR TRRAEIDQAV ATLGTRVTAA TLLRDSAVGR LDELRAQVVL APGQQLTDAA DRGRDLAGIT AAMTAAEGLL RAAAPGDELE AVVRALAAST DEDLAADLAD ARDEHERRTA AQTEAWTAVG TVERHLRDLE SGAATGELHA RAQESLALVA ETAERYVIAR IQYETLGREL ESYERRHASP LLADAGRLLE RLTEGRYVAL RAIDRGDGAR TLRVVRADED ELGPGELSEG TADQVFLALR LAAIDQLQRE RTSRGEPTLP VVLDDVLMTF DDTRAEAALR VLAEIAGRWQ IILLTHHEHL TDLARVVDAQ LRADGPGQAD GALTPPGDAE LVTISYLPGA NVLTPTRDAE QIRTLATHVV PPDPGAGEPG GSPALAVTAA SGGGNGSAAA GRDAGRIRAW ARQNGFEVGD RGRIPREILD AFADAHSA
|
| |