Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3791 |
Symbol | |
ID | 5672155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4494953 |
End bp | 4497442 |
Gene Length | 2490 bp |
Protein Length | 829 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242670 |
Product | hypothetical protein |
Protein accession | YP_001508090 |
Protein GI | 158315582 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.805476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA GGGGAATGCT GGCGCGGCGC CAGTCCACAT CTCCTCCCCC TGCTGGCGCG AAGCCGGATG CCGGCCCGGG GGCGAGCATC CGGCTTCGGA ACGCGTCTGG CGTCCCGCAG GTCCAGCGCG AGGACGTCCC TGCCGTGACG ACGACCCCGA AGGAGCTCAT TTCGAAGTAC ACGACGCTCC TCATGCTCGA CGAGGCGGGG CTCGGGAAGG ATCTCGCCGC GCGCGTGGTG GAAAGACGGG ACGTCATGCT CACTCACGGG GTGTTCGACA ACCTCAGGAC CACCGACCGC GACGATGTCG CCGAGGAACT GGCCCGGTCT GCCGGCAGCA GGCTCCGGGA GGTGGACGAG GGACTCCGGA TCCGCCTCAT CCGGGAGATG CTCGACACCG TCGTCGACGA GGACGCGGAA AAGCAGGTCA CGGCGGTCTG GCTGAGCTTC GAGAACGAAG GGAAGCTGGG AACGGTCATC ACCAACCAGC GCGCCCTGTG GGATCGCTCG CTCAGCGAGT GCACGCCGCT GAGCGACCGT TTCGCCGCCT ACACGGCAGC GTTCGGCATG GATATCCTGA AGGCAGCCGG CGTCTATCTC GACGAGAACC GAAAGATCAC GGAGAAGGAA GCCAAGGGGT TCGGGCTTAA TCTCGGTGGT GCGGCCGAGA CGGCACCGCC GGAGCAGGCG GACGCCTATC TCGACGAGGT GCGCAAGGCG GCCAGGGTCG TGGTGCAGCT GCAGGCCCTG ACGTATGAGC TGCGGCTGGT GCCTGTCGGC TATCGCAGCA CCCCCGAGCT CCAACAGAAG CTTTCGGCGG ACGCCAGGAC AGACGCGGAG GCCTACAAGG CCGTCACCGC GCCGCCCCCA GACACGCTAC GGAGGTACGG GCAGCCGGAG CTCTTCGATC CCGCCCGGCG CCCCGACTAC CCGCCGACCG GCAAGGAGGA GCCGGAGATG GCGCAGTGGA CGGAGGCGGA GTCACACCAC AGCAAGATCG GGGCGCTGAT CGCCGAGTAC GCCCGAAACT ACCCCGCCGT CTACGCGTCG ATCACGCAGG GAAACATCGG GGAGCTGGCC GAGGCGACGG ACGCAGGCAA GGCCCGCGGA CTGGTGGAGA ACATTCTCCG GAAGACGCTT ACGGCGATAG AAGAGACCAG AAGCAGACTC GGGACGGGTA TCACACACTA CGATCTCGCC CCGATTCAGC AGCAGCTGTT TACGAACACC CTCGCCACGC CGGCCAAAGC GTCCGTCAAC TGGCAGGATC CGCTGTACGC GGCGCTCGGG CGCATGGATC TGGAAAAGCA GAAGGCGAAG GACTTCTGGA CCGACCTGGG ACTCAATATG GTGTCGGCGT TCGCGCTGAT CGCCGCGCCG TTCACCGGAG GGATGACGGC CGCCGTTCTG GTCGGAGCCG GCCTCGCCGT CGGCGCCGGC ATGGCCGCCG CGAGCTGGGA CCGCTATCTC CAGCTCCGCC CACTCGAGAA CGCCGCGATC AGGGACGACC TGTCCTTCGT CCAGAAAGGA GCCGTCGACG CGGCGCTCCT CGAGGCGACG GTCGCGACGG TCGGCGTGTT CCTGGACGCT CTGGGTGTGC ATGGGGACAT GAAGCGGGCC GCCGGCGTCA ACCGGGTGAA GGCACTGGCG GACCTCCACG GGGCGGTCGA GGCGCAGGAG GCCGCCGCGC AGGCGCGCAT GAAGATGCAG GCGGAGGGTC TGAAGGACGC TGGCGCGGCC ACCGCGGGAG CGGCGGCCGC GATCGGGGCC CACGAGCTCG AGGACGCCAT CCCGGAGCCC GACATCGAGG TGAGGGCGGG CGGGCTGGAG GTCGATCTGC CCTCGGTCGC AGCCCCGGCA CAACGTAGCG TGATCAGCAC CAGAAGTGTT CAGCGCGCGC CGAACAAGCA GCTCGCAGCC GTCACTGCGA AAATGGCCGC GATCGATACC GCGAGAGATG TTCCGCGCGA CTGGGGAAAC CGCTTCGAGC TGTCGGTGGT GGCGAGCGTT CTCCGCGGCG AGGTCCCGGA GATGTCCGGT GTGGTCCACG CGTTTCAGGC GCAGCACAAC GCGAGTGGAC ACGGGATCGA CATCATCGCC GTCGGGACCG GCTCCAGGGG AAGGCTCAAG TTCTGGCAGA TCGAGTGCAA GTGGGCCGGG CCGGAATCGG GCTACCCGCG GCACCTCGGC GGATCACGCG CCGGCATCCA GACCAGCGCC GGATGGACCA AGGACAACTT CGTCAGATGG TGGGAGGCCG CTCCCCCGGG GGAGAAACGG CAGCTCCTCA ACGCCGTGAA GGCAGCGAAC GGCGGCCGCG CGATCGAGGT GGAGAAGCTG ACGGATCTGA TCAGCAGGGC TGAGGTGATC ATAGCCGCGC CGCTCGGCGC CGGAGCCGCC GGCGTGATGC GCAGGATATG GGGCGAGATG GGTGCGCTCA CGCGGTTCGG CGGCCGGAAG ATGAGCTACC GGGAGTTCCG ACCGAGATGA
|
Protein sequence | MAVRGMLARR QSTSPPPAGA KPDAGPGASI RLRNASGVPQ VQREDVPAVT TTPKELISKY TTLLMLDEAG LGKDLAARVV ERRDVMLTHG VFDNLRTTDR DDVAEELARS AGSRLREVDE GLRIRLIREM LDTVVDEDAE KQVTAVWLSF ENEGKLGTVI TNQRALWDRS LSECTPLSDR FAAYTAAFGM DILKAAGVYL DENRKITEKE AKGFGLNLGG AAETAPPEQA DAYLDEVRKA ARVVVQLQAL TYELRLVPVG YRSTPELQQK LSADARTDAE AYKAVTAPPP DTLRRYGQPE LFDPARRPDY PPTGKEEPEM AQWTEAESHH SKIGALIAEY ARNYPAVYAS ITQGNIGELA EATDAGKARG LVENILRKTL TAIEETRSRL GTGITHYDLA PIQQQLFTNT LATPAKASVN WQDPLYAALG RMDLEKQKAK DFWTDLGLNM VSAFALIAAP FTGGMTAAVL VGAGLAVGAG MAAASWDRYL QLRPLENAAI RDDLSFVQKG AVDAALLEAT VATVGVFLDA LGVHGDMKRA AGVNRVKALA DLHGAVEAQE AAAQARMKMQ AEGLKDAGAA TAGAAAAIGA HELEDAIPEP DIEVRAGGLE VDLPSVAAPA QRSVISTRSV QRAPNKQLAA VTAKMAAIDT ARDVPRDWGN RFELSVVASV LRGEVPEMSG VVHAFQAQHN ASGHGIDIIA VGTGSRGRLK FWQIECKWAG PESGYPRHLG GSRAGIQTSA GWTKDNFVRW WEAAPPGEKR QLLNAVKAAN GGRAIEVEKL TDLISRAEVI IAAPLGAGAA GVMRRIWGEM GALTRFGGRK MSYREFRPR
|
| |