Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2256 |
Symbol | |
ID | 5670655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2695649 |
End bp | 2696824 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241176 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_001506597 |
Protein GI | 158314089 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.020335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGGCG GCACCGTGCG CGTGATCCAC ATCGTGAGCC TGATCGCCGC CGCCGGTACG GTGGCGGCGA TGGTCACCGT CCTGCTCGGG CGCGACAGGC CGTCACGCGA GGCCCTCGGC GTCCGGGGCG CAGACATCTC GTTCACGCTC CAGGAGGAGG CCGCTGGGAC CAGCCTCTCC GATGGCGGGC GACGCCTCCC CGTCGAGCGG ATCCTCGCGG CGCACGGCGC GAACTACGTC CGGCTGCGGG TCTGGGTCGA TCCGCCCGCC GGATACAGCG ACGAGCAGTC TGTCCTCACC CTGGCACGCC GCGCGACGGA CGCGGGGCTC AGAATCATGC TTAATCCGCA CTATTCCGAC TTCTGGGCCG ACCCGCATTC CCAGGAGATC CCGGCGGCGT GGCGTGGCGC CGATCTCGTG ACGACGGCAG CGAAGGTCCG GGAATACACA CGCGGTCTGG TCGCCAAACT GGCGGCGCAG GGAACGCCCG TCGACATGGT TCAGATCGGC AACGAAATCA CCAACGGGAT GCTCTGGCCG CTCGGCGATG TTCGGGACGG CACGCGAACA CAGTGGTCAC GCCTCGCGGA ACTGGTCAAC AGCGCGATCG AGGGTGCCCG CGAAGGCGCC TCGGGCCGGC CCGTGGACAT CGTCCTCCAC GTCGACAGCG GGGGCGACCT CGGCCGCTCA GAGTATTTCT TCGGCAACCT CATGCAGGCG GGCGTCACCG CGTTCGACGT CATCGGGGTG AGTTACTACC CGTACTGGAA CGGCTCGCTG GCGACGCTGC GGGCGACGCT CGACGGCCTG GCACGGCAGT ACCACCGGGA CATCCTCATC GTGGAGACCG CGTATCCGTG GACGTTGGGT GATCCCGGGC CGGGCGACTG GGTCACGTCC CCCGACCAGC TACCGGACGC CGTGACAGTG CCCGCGACGC CGGCCGGGCA GGCAGCCTTC TTCGCCGAGT TGCGGCAGAT CCTCCACGAC GTGCCGGACG GTCGAGGCCT GGGGTTCCTC GCCTGGGAGC CCGGCTGGCT CACAGTGGCC CCGAAACCCG GCGAGCCGAT CCCCGGAGCG AACCTCGCGA TGTTCGACCG GGACGGAGTC GGGCTGCCCA GCCTCGCCGC GTTCGCACCC CCCGACGCCG CCGGCTCGGC GCTGGCGGGT CGATGA
|
Protein sequence | MRGGTVRVIH IVSLIAAAGT VAAMVTVLLG RDRPSREALG VRGADISFTL QEEAAGTSLS DGGRRLPVER ILAAHGANYV RLRVWVDPPA GYSDEQSVLT LARRATDAGL RIMLNPHYSD FWADPHSQEI PAAWRGADLV TTAAKVREYT RGLVAKLAAQ GTPVDMVQIG NEITNGMLWP LGDVRDGTRT QWSRLAELVN SAIEGAREGA SGRPVDIVLH VDSGGDLGRS EYFFGNLMQA GVTAFDVIGV SYYPYWNGSL ATLRATLDGL ARQYHRDILI VETAYPWTLG DPGPGDWVTS PDQLPDAVTV PATPAGQAAF FAELRQILHD VPDGRGLGFL AWEPGWLTVA PKPGEPIPGA NLAMFDRDGV GLPSLAAFAP PDAAGSALAG R
|
| |