Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4669 |
Symbol | |
ID | 5673011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5574344 |
End bp | 5576833 |
Gene Length | 2490 bp |
Protein Length | 829 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243526 |
Product | alpha-L-rhamnosidase |
Protein accession | YP_001508942 |
Protein GI | 158316434 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.170324 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCTC GCTGGAACGC CGCGTTCATC GCCGCCCCGT CCGATCCGGC CGGGCGCGGT CCCGCCCCGG CGCCGTACCT GCGCCGCGAG TTCACCGTGG GCAATGGCCT GCGCTCCGCC ACCCTGCACG TCACCGCGGT GGGCCTGATC GAGGCTCACC TGAACGGGGC CCCGGTGGGC GACGAGGTCC TCGCCCCGGG CTGGACGTCC TACCGGCATC GCCTGGTCGT GAGCAGTCAC GACGTCACCG GCCTGCTGGT GGAGGGCGCC AACGCCCTCG GCGCCGTCCT CGGTGAGGGC TGGGCGGTGG GCAGGCTCAC CTGGGAGAAG GACAAGCGGG CCGTCTGGGC GGACCACCCG GCCGGCTTCC TGCAGCTCGA CCTCGACTAC GGCGACCGGG TGGACGTGAT CAGAAGCGGT GCCGACTGGC GGGCGGGCAC CGGGGCGACC CTGACGGACA GCATCTACGA CGGCGAGACC CATGACGCGC GCCTGGAGCC GGCGGGCTGG GCAGAGCCCG GCTTCGACGA CGAGCACTGG TCGCCAGTGG AGGTCGTCGC ACGCGATCTG ACCACGCTGA TCGCGCCGAG CGCGCCGCCG ATCCGACGGG TGCAGGAGCT GCCCGCCGTC GACATCCTGA CGACGCCGGC GGGCCGGACG GTCGTCGACT TCGGGCAGAA CCTGTCGGGC TGGGTCCGCC TGACCGTCCG CGGGGAGGCC GGCACCACGA TCGTCCTGCG GCACACCGAG ACGTTGATCG ACGGCGAGGC GGACTTCCGG CCCAACCGGA CGGCGCTGGC GACCGACTGC TACGTCCTGC GGGGCGGTGA TCCGGAGACG TGGGAGCCTC GGTTCACCTT TCACGGTTTC CGTTACGTCG AGATGGAGGG GTGGCCCGGC AGCCTCGACG CCGACGCGAT GACCGCGGTG GTGGTGCACA GCGACATGCG CCGGACCGGG TGGTTCGAGA CGTCCGACGA ACTGCTCAAC CAGCTGCATC GCAACGTCGT CTGGTCGATG CGCGGCAACT TCGTCGGCGT GCCCACCGAC TGCCCGCAGC GCGACGAGCG GCTCGGCTGG ACCGGCGACA TCAACGCCTT CGGCCCCACC GCCGCCTTCC TCTACGACGT GCGCGGCGTG CTGGGCTCGT GGCTCACCGA CCTCGCCGTC GAGCAGCGCG CGCAGGGGCA CGTGCCGCTG GTCGTCCCGG ACGTGGGGGG CATGCCGATC ACGGCGCCCA CCGCGCTGTG GGGAGACGTC GCGGTCAGCC TGCCCTGGAC GCTCTACCAG GAGTACGGCG ACCGGGAGCT GCTCGCCGAC CAGTACGAGT CCATGACGGC CTTCATCGAC AGCGTGGAGG GCCTGCTGGA CGAGCGGGGG CTGTGGAACT CCGGTTTCCA GTTCGGTGAC TGGCTCGATC CGGACGCGCC GCCGAAGAAC CCGGCCGGCG GCAGGACGGA CGCCTACCTG GTCGCCAGCG CCTTCTTCTG CCACACGACC CGCCAGCTGG CGCAGGCCGC CGAGGTCCTG GGCCACACCG GCGACGCCGC CCGGTACACG GCCCTGCACC AGCGTGTCCG CGCCGCCTTC CGCGACGAGT GGGTCACCCC GTCCGGCCTG GTCGCGAACG ACACGGCGAC CGCCTACGCC CTGGCCATCT GCTTCGACAT CCTCGACCCG GCCCAGCAGG CGCGCGCCGG GCGCCGGCTC GCCGACCTGG TCAGCAAGGC CGACCACAGG ATCAGCACGG GCTTCGCCGG CACGCCGCTG GTCGCACACG CGCTGAGCCG CACCGACCAG CTCGACACCG CCTACCGGTT GTTGCTGCAG ACCGAGTGCC CGTCGTTCCT CTACCCGGTG ACGCGGGGCG CGACGACGAT CTGGGAGCGG TGGGACGCGA TCCGGCCCGA CGGATCACTG CACGACACCG GCATGACCTC GCTCAACCAC TACGCCCTGG GCGCCATCGC GGACTGGCTG CACCGCGTCG TCGGCGGCCT CGAACCCGTC GAGCCCGGCT ACCGGCGGAT GCGGATCGCG CCGCGGCCCG GCGGCGGCCT CACCCACGCC ACGCTCACCC ACGACACCCC GCACGGGAGG GTCCGTGTCG CCTGGCGCCG GCAGCCCGAC AGCCGGATCA CCGTCGAGGT CGACGTCCCG CCGGGCACGG CCGCCGACGT CGTCCTGCCG GGCCACCCCG ACAGGCTGAG CGTCCCCGTC GGGCCGGGCA GCCACCGCTG GGAGTACGAC GTCCCGACGC CGGACCGGCC CGATTACAGC CTCGACACCC CACTCAGGCA GGTCTTTCGA GATTCGGCGC TGTGGGCCGA GCTCCAGTCC GTCCTTCGCC GGCACCTGCC CCAGTTCGCG GACGCCGACA GCGGGACGGA ACCATCCCTG CCGAGCCTGC GCGCCCTGCT GGGGTACTTC CCGGCGCAGG CCCCGGCTCT GGAGGCCGAC CTGGTCGCCG TGCTTGGGAC GCGAACCTAG
|
Protein sequence | MPPRWNAAFI AAPSDPAGRG PAPAPYLRRE FTVGNGLRSA TLHVTAVGLI EAHLNGAPVG DEVLAPGWTS YRHRLVVSSH DVTGLLVEGA NALGAVLGEG WAVGRLTWEK DKRAVWADHP AGFLQLDLDY GDRVDVIRSG ADWRAGTGAT LTDSIYDGET HDARLEPAGW AEPGFDDEHW SPVEVVARDL TTLIAPSAPP IRRVQELPAV DILTTPAGRT VVDFGQNLSG WVRLTVRGEA GTTIVLRHTE TLIDGEADFR PNRTALATDC YVLRGGDPET WEPRFTFHGF RYVEMEGWPG SLDADAMTAV VVHSDMRRTG WFETSDELLN QLHRNVVWSM RGNFVGVPTD CPQRDERLGW TGDINAFGPT AAFLYDVRGV LGSWLTDLAV EQRAQGHVPL VVPDVGGMPI TAPTALWGDV AVSLPWTLYQ EYGDRELLAD QYESMTAFID SVEGLLDERG LWNSGFQFGD WLDPDAPPKN PAGGRTDAYL VASAFFCHTT RQLAQAAEVL GHTGDAARYT ALHQRVRAAF RDEWVTPSGL VANDTATAYA LAICFDILDP AQQARAGRRL ADLVSKADHR ISTGFAGTPL VAHALSRTDQ LDTAYRLLLQ TECPSFLYPV TRGATTIWER WDAIRPDGSL HDTGMTSLNH YALGAIADWL HRVVGGLEPV EPGYRRMRIA PRPGGGLTHA TLTHDTPHGR VRVAWRRQPD SRITVEVDVP PGTAADVVLP GHPDRLSVPV GPGSHRWEYD VPTPDRPDYS LDTPLRQVFR DSALWAELQS VLRRHLPQFA DADSGTEPSL PSLRALLGYF PAQAPALEAD LVAVLGTRT
|
| |