Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1657 |
Symbol | |
ID | 5670059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1980773 |
End bp | 1982260 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240575 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001506001 |
Protein GI | 158313493 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0958307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.376991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCGG CTCGCCCCAT GAGGAATCCC GCCGCCCGGC CGCGCACCGT CGAATACGGC TTCGGCGGCG ACGCCGCCGC TGGCGCGGAC ACCGATCACC AGACATCGAT TCACCCGGTG ACCACTCATC CGGAGACCAA TCACCCGGAG ACCAATCACC CGGCGAGCGC TCACCGGACC ACGACTCGCC CGGCCACGAG TCACTCAACC GCGAGGCGCC CGGCCGCGGC ACGCTCGGCC GGCGCGCGCT CTAGCGGCGC GCACGGATCG CGGCGGCCGC CGCTCGATGG CCGGCTCGCC CGCACGGCGT TCGCGGTCAC CGCGGCGCTG ACGCTACTCG GTGGGGTCGG TCTCGGCCGG GGCCTCGCCG GCCCCGACGG CTCCGACTCC GCGGTGGACG CGCTCGCGGC GACCGTGGCC GGCCGCGTGC CCCCGCCCGC CGCCGTCCAC ACGACCGGGC CCACCGCCAC CGTCACCGCA CCCCCACGGC CGGCTGCCTC CGGGCTCACC GCGGTACAGG TGCCCACGGC CGCGCTCACA AGCGCTCCGC CGCCCGCCGC CGCGCAGGCC CTGCCGGTGG CGGCGCCGGC GGCCGCCGCC GCGGCCGCGG GGAACCCGTT CGCCGGCGCC CGGTTCTACA TCGATCCCGC GGACCAGGTC GCCGCCGCGA TCAACGCGCT GCGCGGCGGG AATCCCTCCG CCATCGCTGC GCTGGAGAAG ATTCTGCGTG GCGCGCACGC GGACTGGTTC GGATATGCCG ATCCGGCCAC GACCCGGCGC AACGTGGCCG GCCGGGCCAG CACCATCAGG GGTAACGGCG CGCTGCCCGT CTTCGTGGCC TACGCGATTC CGAACCGCGA CTGCGGAAGC TATTCCGCCG GCGGCGCCGG CGGCGCGCAG GGTTATCGCG ACTGGATCGC CGCCTTTGCC GCCGGGCTCG CCGGCGGGCC GGCCGCGGTC GTGCTCGAGC CCGACGCCAT CGCCCAGATC GATTGCCTCT CCCCCGCCGA CCAGCAGACG CGCTACGGGA TGCTGTCGAA CGCGGTCGAC GTCCTGAACG CCGCCGGGGC GACCGTCTAC CTCGACGCCG GCAACGCCGG CTGGCACAGC GCCGCCACCA TCGCCGCCCG GCTGAAGTCG GCGGGCGTCG ACCGGGCGCG CGGATTCGCA CTGAACGTGT CGAACTTCGG TACCACCGCC AGCGAAGTCG CCTTCGGTGA CGCGGTCAAC GCCGCGCTGG GCGGCGGAGC CCACTTCGTC GTGGACACCA GTCGCAACGG GCTGGGCCCG GCGCCGGACA ACGCCTGGTG CAACCCGCCC GGCCGCGCGC TCGGAACGCC GCCCACCGCC GCGACGGGCG ACAGCGACGT CGACGCATTC TTCTGGGTGA AGATCCCCGG GGAGTCGGAC GGCACCTGCA ACGGCGGCCC CGCCGCCGGC CAGTTCTGGC CGGACTACGC CGTCGGCCTG GGCAGCCGGG CCGGCTGA
|
Protein sequence | MPAARPMRNP AARPRTVEYG FGGDAAAGAD TDHQTSIHPV TTHPETNHPE TNHPASAHRT TTRPATSHST ARRPAAARSA GARSSGAHGS RRPPLDGRLA RTAFAVTAAL TLLGGVGLGR GLAGPDGSDS AVDALAATVA GRVPPPAAVH TTGPTATVTA PPRPAASGLT AVQVPTAALT SAPPPAAAQA LPVAAPAAAA AAAGNPFAGA RFYIDPADQV AAAINALRGG NPSAIAALEK ILRGAHADWF GYADPATTRR NVAGRASTIR GNGALPVFVA YAIPNRDCGS YSAGGAGGAQ GYRDWIAAFA AGLAGGPAAV VLEPDAIAQI DCLSPADQQT RYGMLSNAVD VLNAAGATVY LDAGNAGWHS AATIAARLKS AGVDRARGFA LNVSNFGTTA SEVAFGDAVN AALGGGAHFV VDTSRNGLGP APDNAWCNPP GRALGTPPTA ATGDSDVDAF FWVKIPGESD GTCNGGPAAG QFWPDYAVGL GSRAG
|
| |