Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4896 |
Symbol | |
ID | 5673236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5876429 |
End bp | 5878345 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243751 |
Product | hypothetical protein |
Protein accession | YP_001509167 |
Protein GI | 158316659 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.303518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0951056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGCG CAGACGACCT GCCGACCGGC CAGGTCACGG GCACTACCGC GGAGCCGGCA GCCGCGGAGC CGGCAGCCGC GGCTGTGGCG CCTCGGCCGC GGTCGGCGCA GGTCCCACCG AACGACCATG CGGTGCTGAC CGGCCGGCCC CCGCGCACCG AGCTGCCCGC GGAGACCCAG CCGGACGCGG AGACCCAGGC GGACGCGCAG ACCATGACCG CTGCGGAGAC CGAGCCGAGC GCGGAAGCGC CGCCCGCCGC GGAGGCGGAG CCGGGTGAGC AGACCGTGCC CGCCCAGCGG ACCGATTACC CATCGCGGGC CGAGCCGGCG CGGCTGATCG CCCGGTTCCG CCGGGCCCGG GACTGGACCG TCGCGGCCTC GTACCAGTTC TTCCTGGCCG GGACGCTCCT GGCGGTCACC GTCGGTGCCG TGCTGCGGTT CGCCACCCCG AGCCATCTGT GGCTCGACGA GTCGCTCACC GTCGGCATCG CGTCCCGGCC GGTGCCGGAC CTGCTGCAGG CGCTGCGTCA TGACGGGTCG CCCCCGCTCT ACTACCTGCT CTTGCACCTC TGGATCAGCG TGTTCGGTGA CGGGGACATC GCCGTCCGGG CGCTGTCGGG CGTGCTCTCC CTCGCCACAC TCCCGCTGGC CTGGCTCGCC GGGCGGTACG TGGGCCGGGC GGCGACCGCC CTCGGCAACG GCGGGCCGTC GGCCGAGCAG CGGGTCGGGC TCGCCGCGCT CCTGCTGTTC GCCTGCTCGC CGTACGCGAT CCGCTACGGC AGCGAGACGC GGATGTACTC GCTGGTCGTC CTGCTCGTGC TGGTGTTCGG CTTCGCCGCC GTCCGGGCGC TGAACCGGCC GGACCTGCCA CGGCTCGCCG CGCTGACGCT GGCGACGGCC GCCCTGGTCT ACACGCACTA CTGGACGTTC CTGGTGGTGT TCACCGTGGC GGCGTTCCTG CTCCTGCAGG CGCGGCGGCG GGAGCACTAC CGCCGCCCGG CGCTGCGGGC GTTCGTCGCG ATGGCCGCCT CGGCGGTGCT GTTCGCGCCC TGGATGCCGG TGTTCCTCTT CCAGATGCTG CACACGGGGA CACCCTGGGC GCCCCGGGTG CAGGCCCAGG TGCTGCTCGA CACGGTCTTC GACTGGGCCG GCCCGCAGTC GACCGGCGCG CTGCTGGGCA TCATCCTGCT CGGCGGGGCG CTGATCGGGC TGACGGCCCG TCCGCTCGGC GGGGAGCTGC ACGTCAAGAT CTCCGGGCGG GCGCCGGGCA GGTACCTCGC CGCGATCTGG CTCGCGCCGC TGGTGCTGGC GTACTTCGTC AACATGTTCG GCGGCAGCGC CTACGCGGAG CGCTACACCG GGATCGCGCT GCCCGCCTGC CTGCTGCTCG CGGCGCTCGG GATAGCGCAA CTGCCGGTGC ACCGGGCGTT CGTGGCGGTG GTCGTGGTCG CCTCGATCAG CGGCCTGCTC GGTGGGTACC AGCTCGCCCG GACGGAGCGG ACCCAGGCGG GCGAGATCGC CAACCGGATC GCCGACCTCG CCCGGCCGGG GGACGTCGTC GCCTACTGCC CGGACCAGCT CGGCCCGGCC GTGCACCGGG CGATCCAGCG CCGGGGCGGC ATCGACGTCC GGGAGATCGT CTACGCGGAC GAGGCCGGGC CGGCGCTGGT CGACTGGGTC GACTACGCCG ACCGGATGAA ACGCGCGAAC GGTGCGGCCT TCGCGGCCGA GGTCAACGAC CTGGCCGGGC CCGATCACGC CGTCCTGCTC GTCCGGGCGG ACGGGTACCG CTTTCTGGAG GGCGCGTGCG CGGTGCTCTC CGACCAGCTG GCGTCCCTAC GGGACCGGGC GTTGCAGGTC GAGAAGCGCG ATCTTTACGA GGGTGCCTCC CTCGAGCGGT TCTCCACCCT GCGCTGA
|
Protein sequence | MVSADDLPTG QVTGTTAEPA AAEPAAAAVA PRPRSAQVPP NDHAVLTGRP PRTELPAETQ PDAETQADAQ TMTAAETEPS AEAPPAAEAE PGEQTVPAQR TDYPSRAEPA RLIARFRRAR DWTVAASYQF FLAGTLLAVT VGAVLRFATP SHLWLDESLT VGIASRPVPD LLQALRHDGS PPLYYLLLHL WISVFGDGDI AVRALSGVLS LATLPLAWLA GRYVGRAATA LGNGGPSAEQ RVGLAALLLF ACSPYAIRYG SETRMYSLVV LLVLVFGFAA VRALNRPDLP RLAALTLATA ALVYTHYWTF LVVFTVAAFL LLQARRREHY RRPALRAFVA MAASAVLFAP WMPVFLFQML HTGTPWAPRV QAQVLLDTVF DWAGPQSTGA LLGIILLGGA LIGLTARPLG GELHVKISGR APGRYLAAIW LAPLVLAYFV NMFGGSAYAE RYTGIALPAC LLLAALGIAQ LPVHRAFVAV VVVASISGLL GGYQLARTER TQAGEIANRI ADLARPGDVV AYCPDQLGPA VHRAIQRRGG IDVREIVYAD EAGPALVDWV DYADRMKRAN GAAFAAEVND LAGPDHAVLL VRADGYRFLE GACAVLSDQL ASLRDRALQV EKRDLYEGAS LERFSTLR
|
| |