Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4681 |
Symbol | |
ID | 5673023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5590958 |
End bp | 5592436 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243538 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001508954 |
Protein GI | 158316446 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.026787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA ACCCGATCCT GCCCGGTTTT CACCCGGACC CGTCGATCTG CCGGGTGGGC GACGATTACT ACCTGGTGAC CTCGAGCTTC GAGTACTTCC CCGGGGTGCC GATCTTCCGC AGCACGGACC TCACCGGCTG GGAACAGATC GGCAACGTGC TCGACCGCCC CACCCAGCTG AACGTGACCC CGGGCCTGGA GTCGGCCAGC ACAGGGATCT TCGCGCCGAC CCTGCGCCAC CATGACGGCA GATTCTGGCT CGCCACGACG AACTTCACCG ACGTCCGTAA GGGTCACCTC ATCGTCCAGG CGGCCGACCC GGCCGGCGCA TGGACCGACC CGGTCCACAC AGGCGGGGGA ACGGTCGGGA TTGACCCCGA CCTGGCGTGG GACGAGAACG GTACCTGTCA TCTGACGTGG TGCTTTCCCG GGCAGATCAT GCAGGCCGCC GTCGACCCGG AGAGCGGGAG GCTGCTCTCC GAGCCCCGTG GGCTGTGGAG CGGGACGGGG CGTGCCCACC CTGAAGGCCC GCACCTGTTC AGCCGGGAAG GCTGGTGGTA CCTGGTCATC GCCGAGGGCG GCACCGACGG TGGCCACGCC GTGTCGATCG CGCGTTCCCG CTCGATCACC GGACCCTTCA CCGGCAACCC CGCCAATCCG ATCCTCACCA GGAGCGGCAC CGAACACCCG GTCCAGAGCA CCGGCCACGC CGACTTCGTC GAACTTCCCG GGGGTGAGTG GGCGATGGTT CACCTGGGGG TCCGCCCGCG CGGCACGTTC CCCAAGTTCC ACGTCAACGG CCGGGAGACC TTCCTGACCG GCATTACCTG GGCCGACGGC TGGCCGATCG TGGTGGAGGA CCGGTTCACG GTGCCAGTCC GGGACAACTC CTTCGTCGAC GAGTTCCGCA CGCCGACACT GCATCCCCGG TGGGTCTCCC CGGGGACCGA CCCACGGACT TTCACCCGCC ACCGACCCGG CGGAGTCGTC CTGGCCGCCG GCCGGGCACC CGACGCGGGC GAGGCCAGGC GCCTGCTCGC TGTGCGCGCC CAGGACCCGC AATGGCAGGT AACCGCGGTC ATCCCCGACG GCGACGCCTG CCTGACCGTC CGGATGGACG ACGCCCACTG GGTCGCCGTC GAACGCCGTG GGGAGATGCT GGCGGCCAGG ATGGTGCTCG GCCCACTTGA CCAGACCCTC GCCACCGCCG CCGGGATCGG CCCGGGCGAC GCGCTCGCCG TCCGCGCCGT GACGCATGCC GAGGCCGCCA GCTTCCGTGC CGGCCCCGAC CAGCTCGAGC TGGGCCACCT CACCGACGGC GAGTTCCGGC TGCTCGCCAC GGTCGACGGG CGCTACCTCT CGACCGAGGT CGCCGGCGGC TTCACCGGGC GCGTCGTCGG AGTCGAGGCG ATCGGCACCG ACGCCACCCT GTCACGATTC GAATACCTGG CCTTGGATGC CGCCGCGCCC CAGGGCTAG
|
Protein sequence | MNANPILPGF HPDPSICRVG DDYYLVTSSF EYFPGVPIFR STDLTGWEQI GNVLDRPTQL NVTPGLESAS TGIFAPTLRH HDGRFWLATT NFTDVRKGHL IVQAADPAGA WTDPVHTGGG TVGIDPDLAW DENGTCHLTW CFPGQIMQAA VDPESGRLLS EPRGLWSGTG RAHPEGPHLF SREGWWYLVI AEGGTDGGHA VSIARSRSIT GPFTGNPANP ILTRSGTEHP VQSTGHADFV ELPGGEWAMV HLGVRPRGTF PKFHVNGRET FLTGITWADG WPIVVEDRFT VPVRDNSFVD EFRTPTLHPR WVSPGTDPRT FTRHRPGGVV LAAGRAPDAG EARRLLAVRA QDPQWQVTAV IPDGDACLTV RMDDAHWVAV ERRGEMLAAR MVLGPLDQTL ATAAGIGPGD ALAVRAVTHA EAASFRAGPD QLELGHLTDG EFRLLATVDG RYLSTEVAGG FTGRVVGVEA IGTDATLSRF EYLALDAAAP QG
|
| |