Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6988 |
Symbol | |
ID | 5675299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8509551 |
End bp | 8511170 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245834 |
Product | lysozyme |
Protein accession | YP_001511225 |
Protein GI | 158318717 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.14763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCTT TGCGAGGTCG GCACCGACGC CGTCGTCGTC AGATCATTTC GGGTACCGCG GCCCTGCTCA CCGTTGGCGG TGCGGTCGGC CTGGCCGTGG TGGTGCCCGG CACCGCGGAC GCCGCCGGCC TCGACGCCGC GTACAGCCGC ACCAACGACT GGGGCACCGG CTACTCCGCC CAGTATCAGG TCACCAACTC CGCGGACTCG CCGGACGGCT TCACCCTCGA GTTCGACCTG CCGGACGGTG CGACCCTCAC CTCGCTGTGG AACGCCGCCT ACCAGGTGGA CGGCCGTCAC GTGACCGTCA CCCCGCCGGC CTGGCAGACC ACACTCGCGC CACGGGAATC GGTCGACGTC GGCTTCGTCA TCGCCGCTCC GGGCGGCGCC ACCGACCCCC TCGGATGTCG GATCAACGGC GAGGACTGCA CGCCCGGCTC CGGCAACGGC GACCCGGGCC CGGAGCCTGA ACCCTCGGCC ACCGGGCCGG CCGCACCGTC GTCCCCGCCG CCGAACGCGT CGCCGACCGA CCCGGCCACG CCACCCGACA CGGCGCCGCG GCCCACCGCG GGCCCGGGCA CCGGCCAGCC GAGCGGCGCT CCGACGAGCA CCCCGAGCAC GCCGACCACG CCGCCCGCTT CGACCACCGC GCCGCCGGCA CCGCCGGCAC CACCGGCACC GCCGTCGAAC GGCTCCAGCG GATCGGGCGG GTTCGCGCCG TACGTCGACA CCTCGCTGTA CCCGCCGTTC GACCTGGTCG CGGCGGCGCG GACCGCCGGC CTGCGTGACG TCACGCTGGC CTTCGTCGTG GCCGGTGGAG GCGGCTGTAC GCCGAAGTGG GGCGGGGTCA GCGACCTCAC CATGGACGGC GTGCCCGGCC AGATCGGCCG GTTCCGTGAG CTGGGCGGCG ACGTCCGGGT GTCGTTCGGC GGGGCGTCCG GAACCGAGCT CGCCAGTGCC TGCGGCAGCG CGGGCGACCT GGCGGCCGCG TACCGCAAGG TGGTCGACGT CTACGGGGTG ACCCGGCTCG ACTTCGACGT CGAGGGCGGC ACGTTGCCGG ACGTCGCCGC GAACACCCGG CGTGCCCAGG CGATCGCCCG GCTTCAGCGG GAGGCCGCGG CCGGAGGCCG GCCACTGGAG GTCTCGTTCA CGCTGCCGGT GCTGCCGTCC GGCCTGACCC AGGCCGGCGT GGACCTGCTG GCCAACGCCC GGGAGAACGG CGTGACGGTG AACGCCGTCA ACATCATGGC GATGGACTAC GGCGACGGCG CCGCGCCGAA CCCGGCGGGC CGGATGGGCC AGTACGCCAT CGACGCCGCC ACCGCGACCC AGGCCCAGGT CAAGGGCGTG TTCGAGCTGT CCGACGCGCA GGCGTGGGGG CGGGTGGCCG TCACCCCGAT GATCGGTGTG AACGACGTCG CCAGCGAGGT GTTCACCCTG GCCGACGCGC GGCGGCTGGT GCGGTTCGCG TCCGAGGTCG ACCTCGCCTG GCTGTCGATG TGGTCGCTGA CCCGCGACCA GCCCTGCCCC GGTGGGCCGG TGCCGTACGC GCAGCCGACC TGCGGCGGCA TCGAGGCGCA GCCGTTCGAT TTCACCCGCG CCTTCAACGC CGCCCAGTGA
|
Protein sequence | MSPLRGRHRR RRRQIISGTA ALLTVGGAVG LAVVVPGTAD AAGLDAAYSR TNDWGTGYSA QYQVTNSADS PDGFTLEFDL PDGATLTSLW NAAYQVDGRH VTVTPPAWQT TLAPRESVDV GFVIAAPGGA TDPLGCRING EDCTPGSGNG DPGPEPEPSA TGPAAPSSPP PNASPTDPAT PPDTAPRPTA GPGTGQPSGA PTSTPSTPTT PPASTTAPPA PPAPPAPPSN GSSGSGGFAP YVDTSLYPPF DLVAAARTAG LRDVTLAFVV AGGGGCTPKW GGVSDLTMDG VPGQIGRFRE LGGDVRVSFG GASGTELASA CGSAGDLAAA YRKVVDVYGV TRLDFDVEGG TLPDVAANTR RAQAIARLQR EAAAGGRPLE VSFTLPVLPS GLTQAGVDLL ANARENGVTV NAVNIMAMDY GDGAAPNPAG RMGQYAIDAA TATQAQVKGV FELSDAQAWG RVAVTPMIGV NDVASEVFTL ADARRLVRFA SEVDLAWLSM WSLTRDQPCP GGPVPYAQPT CGGIEAQPFD FTRAFNAAQ
|
| |