Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0431 |
Symbol | |
ID | 5668854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 508969 |
End bp | 510324 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239363 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001504802 |
Protein GI | 158312294 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGC CGATCATTGC GGGCTTCTAT CCCGACCCGA CGGTCTGCCG AGTCGGTGAC GACTATTACC TGGCCCATTC CAGCTTCGAG TACTTTCCCG GCGCGCCGAT CTGGCACAGC CGCGACCTGA TCCGCTGGAC GCAGATCGGC CACATCCTGA CCCGCCGCAG CCAGTTCGCC CGCGGCGACT GCCGCCCGTC CTCCGGGCTG TACGCCGGGA CGCTGCGGCA TCACGACGGG CGTTTCGTCT ACGTCACCAC GAACGTCAGC GACTTCGACG GCGGCCAGCT GATCGTCACC GCCGACGACC CGGCCGGCCC GTGGAGCGAG CCGGTTCGGG TGCCCGAGGC GATCGGCATC GACCCGGACC TCGCCTGGGA CGACGACGGC ACGTGCTACC TGAGCTGGAA GGCGATGAGC CTCGCCGACG GCGAGATCGG CATCCGCCAG GCCCCGCTCG ACATCTCGTC GGGCAAGTTC CTCGCCCCGG CGTACCCGAT CTGGCAGGGC AGCGGACTCG CCGCGGCCGA AGGGCCGCAT CTGTACGCCA TCGACGGCCT CTGGTACCTG CTGCTCGCCG AGGGCGGCAC CGAGCGCGGG CACGCCGTCA CCGTCGCGCG CGGCCCGCAC CCGTCCGGGC CCTTCGAGGG CTGTGCGGCC AACCCGATCC TGAGCCACCG CAGCACCATC CACCCGGTGC AGAACACCGG CCACGCCGAC CTGGTCCGCG CGCCGGGCGG CGGCTGGGCC GCCGTCTATC TGGGTACCCG GCCGCGCGGC TGGACGCCGG GCTTCCACGT CCTGGGCCGG GAGACGTTCC TGGCCGGCAT CGACTGGGTG GACGGCTGGC CGGTTTTCGA CGAGGACCGC TACCAGGTCC CACCGCTCGA CACCGGCTTT ACGGAGGTGT TCGGTGACTC ACCGCCGCAT CTGCGCTGGG TCGCCGACGG CAGCGGCCTG CGGTGCACCC GGGTGCGCGA TCTGCGCTGG TCCGCCGAAG CGACGCTCGA CGGCCCGGGC CGCTTCGAGG TGCGCATCGA CGACCGGCAC GCGTACGGCC TGACCCGCCA CCACGACCGG GTCGAGGCGA CGGCCCGCGT CGGTGACCTC GGCGCCGTCC TCGCCACGCT CCCCGTCGCG GATGGTCACA CGGTCCTGCG GATCGAGGCG GTGCCACCGC CGTCACGCGA CGCCGGGCCC GACGAGATTG TCCTCTCCGC CGGCTCGCAC GAACTCGCCC GCCTCGACGG GCGCTACCTC TCCACCGAAG TCGCGTCCGG GTTCACCGGC CGCATGCTCG CCACCGCCGG ACCGCACGTC CGGTCGGTGA CCTACCGCCC CCAGGAGCAG CCATGA
|
Protein sequence | MPEPIIAGFY PDPTVCRVGD DYYLAHSSFE YFPGAPIWHS RDLIRWTQIG HILTRRSQFA RGDCRPSSGL YAGTLRHHDG RFVYVTTNVS DFDGGQLIVT ADDPAGPWSE PVRVPEAIGI DPDLAWDDDG TCYLSWKAMS LADGEIGIRQ APLDISSGKF LAPAYPIWQG SGLAAAEGPH LYAIDGLWYL LLAEGGTERG HAVTVARGPH PSGPFEGCAA NPILSHRSTI HPVQNTGHAD LVRAPGGGWA AVYLGTRPRG WTPGFHVLGR ETFLAGIDWV DGWPVFDEDR YQVPPLDTGF TEVFGDSPPH LRWVADGSGL RCTRVRDLRW SAEATLDGPG RFEVRIDDRH AYGLTRHHDR VEATARVGDL GAVLATLPVA DGHTVLRIEA VPPPSRDAGP DEIVLSAGSH ELARLDGRYL STEVASGFTG RMLATAGPHV RSVTYRPQEQ P
|
| |