Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4659 |
Symbol | |
ID | 5673002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5559360 |
End bp | 5561327 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243517 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001508933 |
Protein GI | 158316425 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.398765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.788478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCCGTT CATCCTTCAA TGACGGCTGG TCCTTCCGCC GGAAGACCGA CGCGTTTCTG GAGATCCTCG GCCGGGCCGA CGCCCCCTGG CAGGACGTCC GGCTGCCCCA CGACGCGATG GTGGGGCTCG AACGCGACAG GGCCGATACC GAAGCGGGTC AGCGCGGCTA CTACCCAAGC GGGGAATACC AGTACAGAAA GACGTTCTTC GCGCCGGAGG AATACCGGAA CCGGCGAGTA ACCCTTGAGT TCGAGGGTGT CTACCGGAAT GCCCGGGTGT TCATCAACGG CGATTTCGCG GCGCAGCGCG CCTACGGCTA CTCCAATTTC TACGTCCACA CGGACCATCT ACTGAAGTAC GGCAGCAACA ACGAGATCGT CGTGGAGGCG CGCAGTGGCA ACGACACCCG CTGGTACTCG GGTGGCGGCG TCTACCGCAA CACGAAACTG ATCGTCGGCG ATCTGGTTCA CCTCGCACTG GACGGGGTGA GGATCACCAC TCCGGCCGTG GACGACGACG GCACGGTCGT CGCCGGGGAC ACCGCTCCGG TCACCACGTT CATCGGCGAC ACACTCACCC TGCGCCAGCG GCTGCCCGTG CCACGACCGG AGCTGTGGGG GGTGGAGCGA CCCTATCTCT ACCTCTGCCG GACCAGCGTC ACGGCGGACG GTGAGCTACT GGACCAGGAG ACCACGCGCT TCGGCATCCG TACGCTCACC GTTGATCCGC TGCGGGGCCT GCGCATCAAC GGCGAGACGG TGAACCTGCG CGGCGCCTGC ATCCACCACG ACAACGGGGT GATCGGCGCC GCGACCATCG ACCGCGCCGA ACAGCGCAGG GTCGAGATCC TCAAACAGGC CGGTTTCAAC GCCATCCGCA GCGCCCACCA TCCGATGAGC AAGGCGCTCC TCGACACCTG CGACCAGCTC GGCGTTCTCG TGATCGACGA GCTGTTCGAC GCGTGGACCC GCTCCAAGGT GAGCCAGGAC TACGCCCTCG ACTTCGCCAC CTGGTGGGAG TCCGACGTGC GGGCGATGGT CGACAAGGAC TTCAACCATC CCAGTGTCAT CCTCTACTCG ATCGGGAACG AGATCCCGGA GACGGGCACC GCCGCCGGCG CGGCGATCAG CCGCCGGCTC GCCGAGACGA TCCGCTCGAT CGACAGCACC AGGTTCGTCA CGAACGGCGT CAACGGACTC CTCGCCGGCG GCCCCGACCT GCTCGCCTCC TTCGCCGGCG AATCCCGGAA GAAGGACAGC GAGAGCCTCG ACGTCAACGG CTTCATGACG CGGTTCCGTG AGTTCATGCC AATCCTCATG AGCTCCGAGG TCGTCGGTTC GAAGACCGCC GAGTCGATAG CCTGCCTCGA TGTCGCCGGC TACAACTACC TCGAATCACG GTACGGGCAG GACGGGGCGC AGTTCCCCAA CCGGGTGATC GTCGGAACCG AGACCTATCC CACCGACATC GACACGAACT GGCGGCTCGA GATCGGTTTC GTCTACCCCG ACCTCGAGGT CGTCAAGCAG TACGTCGACA TCGACCACGG CGACTACCAG GCCACCTTCC AACCGGCCAT CCTGGCCGGC GCCGCCGCTC TCGGGCTCGT CGTCGACTTC ACCGAGCCGG CCAACGCCGC GTGCATCGCC ACCCTTGAGG CCGCGCTGCC AGCACTCACC GGCAAGCTGG TCGACCCGGC GACCGTGCCC TCAGGCCAGC CCACCCCCGG AACATCCGAA AGCGCCGCCT GCCGCTACCT GACCCTGTTC CAGGCGATCG CCGAGAAGGC CGGCAAGGAC CTCACCTACC AGTCCTTCCA GCAGGCCGCG TTCTCTCTCG GTTCCTTCCA GGTCCCCACC CTGCGGGACA AGGCCACCTA CAGCCGCGAG ACACCCCACG GCGCCGTCCC CCCGCGCCTG TTCACGTTCG ATCCCGCGAA GAAGAACTTC TTCCCCGCCG GGAGCTGA
|
Protein sequence | MIRSSFNDGW SFRRKTDAFL EILGRADAPW QDVRLPHDAM VGLERDRADT EAGQRGYYPS GEYQYRKTFF APEEYRNRRV TLEFEGVYRN ARVFINGDFA AQRAYGYSNF YVHTDHLLKY GSNNEIVVEA RSGNDTRWYS GGGVYRNTKL IVGDLVHLAL DGVRITTPAV DDDGTVVAGD TAPVTTFIGD TLTLRQRLPV PRPELWGVER PYLYLCRTSV TADGELLDQE TTRFGIRTLT VDPLRGLRIN GETVNLRGAC IHHDNGVIGA ATIDRAEQRR VEILKQAGFN AIRSAHHPMS KALLDTCDQL GVLVIDELFD AWTRSKVSQD YALDFATWWE SDVRAMVDKD FNHPSVILYS IGNEIPETGT AAGAAISRRL AETIRSIDST RFVTNGVNGL LAGGPDLLAS FAGESRKKDS ESLDVNGFMT RFREFMPILM SSEVVGSKTA ESIACLDVAG YNYLESRYGQ DGAQFPNRVI VGTETYPTDI DTNWRLEIGF VYPDLEVVKQ YVDIDHGDYQ ATFQPAILAG AAALGLVVDF TEPANAACIA TLEAALPALT GKLVDPATVP SGQPTPGTSE SAACRYLTLF QAIAEKAGKD LTYQSFQQAA FSLGSFQVPT LRDKATYSRE TPHGAVPPRL FTFDPAKKNF FPAGS
|
| |