Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3193 |
Symbol | |
ID | 5671569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3768724 |
End bp | 3771201 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242087 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001507507 |
Protein GI | 158314999 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.360698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCCGTT CATCCTTCAA TGACGGCTGG TCCTTCCGCC GGAAGACCGA CGCGTTCCTG GAGATCCTCG GCCGGGCCGA CGCCCCCTGG GAGGACGTCC GGCTTCCCCA CGACGCGATG GTGGGACTAG AGCGCGACGG AGCCGATACC GAGGCTGGTC AGCGCGGCTA CTACCCGAGC GGGGCATACC AGTACCGGAA GACCTTCTTC GCGCCGGAAA AATACCGGAA CCGGCGAGTA ACCCTTGAGT TCGAGGGTGT CTACCGGAAT GCCCGGGTGT TCATCAACGG CGATTTCGCG GCGCAGCGCG CCTACGGCTA CTCCAACTTC TACGTCCACA CAGATCATCT ACTGAAGTAC GGCAGCAACA ACGAGATCGT CGTGGAGGCG TACAGCGGCA ACGACACCCG CTGGTACTCG GGTGGCGGCG TCTACCGCAA CACGAAACTG ATCGTCGGCG ATCTGGTTCA CCTCGCACTG GACGGGGTGA AGATCACAAC TCCGGCCGTC GACGGCGACC TCGCGGTGGT CGCGGTGGCG ACCGAGTTAC AGAACGAGTC TCCCGTCACC CGGTCGCTCG AAGTGCTCAC CGAGATCGTG GACGCCGACG GCACGGTCGT CGCCCGGGAC ACCGCTCCGG TCACCGCCTT CATCGGCGAC CGGCTCACCC TGCGCCAGCG GCTGTCCGTG CCACAGCCGG AGCTGTGGGG GGTGGACCGG CCCTATCTCT ACCTCTGCCG GACCAGCGTC ACGGCGGACG GTGAGCTACT GGACCAGGAG ACCACGCGCT TCGGCATCCG TACGCTCACC ATCGACCCGC GGCGGGGCCT GCGCGTCAAC GGCGAGGTGG TGAACCTGCG TGGCGCCTGC ATCCACCACG ACAACGGGGT GATCGGCGCC GCGACCATCG ACCGCGCCGA ACAGCGCAGG GTCGAGATCC TCAAACAGGC CGGCTTCAAC GCCATCCGCA GCTCCCACAA CCCGATGAGC AAGGCGCTCC TCGACACCTG CGACCAGCTC GGCGTTCTCG TGATCGACGA GCTGTTCGAC GCGTGGACCC GCTCCAAGGT GAGCCAGGAC TACGCCCTCG ACTTCGCCAC CTGGTGGGAG TCCGACGTGC GGGCGATGGT CGACAAGGAC TTCAACCATC CCAGTGTCAT CCTCTACTCG ATCGGGAACG AGATCCCGGA GACGGGCACC GCCGCCGGCG CGGCGATCAG CCGCCGGCTC GCCGAGACGA TCCGCTCGAT CGACAGCACC AGGTTCGTCA CGAACGGCGT CAACGGACTC CTCGCCGGCG GCCCCGACCT GCTCGCCTCC TTCCCCGGCG AATCCCGGAA GAAGGACAGC GAGAGCCTCG ACGTCAACGG CTTCATGACG CGGTTCCGTG AGCTCATGCC AATCCTCATG AGCTCCGAGG TCGTCGGTTC GAAGACCGCC GAATCGATGG CCTGCCTCGA TGTCGCCGGC TACAACTACC TCGAATCACG GTACGGGCAG GACGGGGCGC AGTTCCCCAA CCGGGTGATC GTCGGAACCG AGACCTATCC CGCCGACATC GACACGAACT GGCGGCTCGT CCAGGACCAC AGCCATGTCA TCGGTGACTT CACCTGGACC GGCTGGGACT ACCTCGGCGA GCCGGGGACC GGCCGGATCG AGTACGAGGG CGACGACGAG ACCACGAACT CCGCGCTGAA CCACCAGGGC GGCTACCCGT GGCTGACGGC GTGGTGCGGT GACATCGACA TCACCGGCCA CCGCCGGCCC GCCTCCTACT ACCGCGAGAT CGTGTTCGGC CTGCGCGGCA AGCCCTACAT CGCCGTCCAC CGTCCCGAGC GCTTCGGTCG GGCGGTCCGG GTGGCGATGT GGTGGTCGTG GAGCGACTCG GTCTCCAGCT GGAGCTGGAA CGGCCACGAG GACAGGCCGG TGCGGCTCGA GGTGTACTCC GCGGCGGACG AGATCGAGCT CCTGGTCAAC GGCCGGCGGG TCGGAACCGC CCCGGCGGGT GAGAAGAACC GGTTCAAGGC CGAGTTCGAG ACTGTCTACG AGCCCGGCGA GATCGTGGCC GTCGCCTACA CCGGGGGCCG CGAGACCGGC CGCACCTCGC TGCGCTCCGC GACGGGCGAG GTCCGCCTCG ACGCCGCGGC CGACCGCACC CGCATCACCG CCGACGACAC CGACCTCGCC TACGTCGCCC TCACGCTCGT CGACGGGGCC GGCAACCTCT ACAACACCGC CGACCGCAAG GTGGCCGTCG AAGTGACGGG GCCCGCCGTG CTCCAGGGCT TCGGCAGCGC GGACCCCAGG CCGACCGAGA ACTTCTTCGA CACGGTCCGC ACGACCTTCG ACGGCCGGGC GCTCGCCGTC ATCCGCCCGA CGGCCCCCGG CTCGATCACC GTCACCGCCA CCGCGGAGGG CTGCGAGCCG TCCACCGTGC GCATCGAAGC CGAACCCTCG GGCCCCTCCG CCGAATGA
|
Protein sequence | MIRSSFNDGW SFRRKTDAFL EILGRADAPW EDVRLPHDAM VGLERDGADT EAGQRGYYPS GAYQYRKTFF APEKYRNRRV TLEFEGVYRN ARVFINGDFA AQRAYGYSNF YVHTDHLLKY GSNNEIVVEA YSGNDTRWYS GGGVYRNTKL IVGDLVHLAL DGVKITTPAV DGDLAVVAVA TELQNESPVT RSLEVLTEIV DADGTVVARD TAPVTAFIGD RLTLRQRLSV PQPELWGVDR PYLYLCRTSV TADGELLDQE TTRFGIRTLT IDPRRGLRVN GEVVNLRGAC IHHDNGVIGA ATIDRAEQRR VEILKQAGFN AIRSSHNPMS KALLDTCDQL GVLVIDELFD AWTRSKVSQD YALDFATWWE SDVRAMVDKD FNHPSVILYS IGNEIPETGT AAGAAISRRL AETIRSIDST RFVTNGVNGL LAGGPDLLAS FPGESRKKDS ESLDVNGFMT RFRELMPILM SSEVVGSKTA ESMACLDVAG YNYLESRYGQ DGAQFPNRVI VGTETYPADI DTNWRLVQDH SHVIGDFTWT GWDYLGEPGT GRIEYEGDDE TTNSALNHQG GYPWLTAWCG DIDITGHRRP ASYYREIVFG LRGKPYIAVH RPERFGRAVR VAMWWSWSDS VSSWSWNGHE DRPVRLEVYS AADEIELLVN GRRVGTAPAG EKNRFKAEFE TVYEPGEIVA VAYTGGRETG RTSLRSATGE VRLDAAADRT RITADDTDLA YVALTLVDGA GNLYNTADRK VAVEVTGPAV LQGFGSADPR PTENFFDTVR TTFDGRALAV IRPTAPGSIT VTATAEGCEP STVRIEAEPS GPSAE
|
| |