Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2221 |
Symbol | |
ID | 5670620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2657317 |
End bp | 2659206 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241141 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001506562 |
Protein GI | 158314054 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.47464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0451069 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCATCCC TGATCGAGGA TTACGCGCTG ATAGGAGACA CTCACTCGGC GGCGCTGGTC TCCCGCGACG GCTCGATCGA CTGGCTGTGC CTGCCTCGGT TCGACTCCCC GGCCTGCTTC GCCGCGCTGC TCGGGGACAA CGAGTCCGGC CACTGGAAGA TCGCGCCGGT CGAGCCGGTG ATCGGGGTGA GCCGCCGCTA CCGCGGCGAC ACCCTGGTGC TCGAGACCGA CATGACCACC GCGTCGGGCA TCGTCCGGAT CGTCGACGCG ATGTTCCCCC GGGCGGGGGC GCACGTCGTC CTGCGGCTGG TGGAGTGCCT GCAGGGCACC GTCCGGCTCC GGTCGGAGAC CCGCTTCCGG TTCGACTACG GCTCGATCGT GCCGTGGGTG CGCCGGCTCG ACGAGCACTC GGTCGCCGCG ATCGCCGGCC CCGACTCGGT GACGCTGTGG ACCACCGCCC CGATGGAGGG GCACGACATG GCCACCTACG CGGAGTTCAG CGTCTCGGCC GGGCAGTCCG TCCCGTTCTC GCTCACCTGG CGGCCCTCGC ACGAACCGGC GCCAGCCCCG CAGGATGTCC GGCGGATGAT CTCGCAGACC GAGGCATGGT GGTCGGACTG GATGGCCGGC TGCACCTACG ACGGCCAGTG GCAGCCGGCG GTCCGCCGGT CGCTGATCAC CCTGAAGGCG CTCACCTACG CCCCGACGGG CGGCATCGTC GCCGCGGTCA CCACCTCGCT GCCCGAGCAC ATCGGCGGCG TCCGCAACTG GGACTACCGC TACTGCTGGC TGCGCGACGC CACGATCACC CTGCTGGCGC TGCTCGACGC CGGCTTCACC AGCGAGGCGA CGGCCTGGCG GGAATGGCTG CTGCGCGCCG TGGCCGGCGA CCCGTCCCGG GTGCAGATCA TGTACGGCGT GGCCGGGGAG CGGCGGCTGC CCGAGTACGA GGTCCCGTGG CTGCCGGGCT ACGAGAACTC CTCCCCCGTC CGGGTGGGAA ACGCCGCCGT CGACCAGTTC CAGCTCGACG TCTACGGGGA GGTCCTCGAC GCCCTGCACG TGGCCCGCGT CGCGGTGGCG AACCGGCGTC CCGGCTCGGC CCGTGACGGT TTCCTCGCGG GGATCCACTC CTCCGAGGAC GACGGCCGGG ACGGCTCCTG GCAGCTGCAG ACCAAGCTCA TGGACTTCCT CGAGACCGGG TGGCGCAAGG CCGACGAGGG CATCTGGGAG GTCCGCGGGC CGCGCCGGCA TTTCGTCCAC TCGAAGGTCA TGGCCTGGGT GGCCGCCGAC CGGGCCGTCC GGGGGATCGT GGAGTCCGGG CTACCCGGCC CGGTCGAGCG GTGGTCGGCG CTGCGGGACG AGATCCACCA CGAGGTGTGC GCCCGCGGGT TCGACTCCGA CCGCAACACG TTCACCCAGT TCTACGGCTC GAAGGAGCTC GACGCGGCCC TGCTCTACAT GTCACTCGTC GGCTTCCTGC CGGCGACCGA CCCACGGGTC GTCGGCACCG TCGCCGCCAT CGAGCGCGAG CTGATGGAGG ACGGCTTCGT CATGCGCTAC CCGACGGCCG AGGACGGCGC CGTCGACGGC CTGCCGGCCG GGGAGGGTGC CTTCCTCGCC TGCACCTTCT GGCTGGCGGA CAACTACGCG CTGTCCGGGC GGGTGCACGA GGCGCAGGAG CTGTTCGAGC GGCTGCTGGC GCTGCGCAAC GACGTCGGGC TGCTGGCCGA GGAGTACGAC CCGCGGCTGG GCCGGATGAC CGGCAACTTC CCGCAGGCGT TCAGCCACGT CCCGCTGGTC AACACCGCGC GCACGCTCAC CGATGCGCTG CGCGGCCGGC CGCGCTCGCG CACCGACCGC GCGCACCCGC CGGGCCACTT CTTCGGCTGA
|
Protein sequence | MPSLIEDYAL IGDTHSAALV SRDGSIDWLC LPRFDSPACF AALLGDNESG HWKIAPVEPV IGVSRRYRGD TLVLETDMTT ASGIVRIVDA MFPRAGAHVV LRLVECLQGT VRLRSETRFR FDYGSIVPWV RRLDEHSVAA IAGPDSVTLW TTAPMEGHDM ATYAEFSVSA GQSVPFSLTW RPSHEPAPAP QDVRRMISQT EAWWSDWMAG CTYDGQWQPA VRRSLITLKA LTYAPTGGIV AAVTTSLPEH IGGVRNWDYR YCWLRDATIT LLALLDAGFT SEATAWREWL LRAVAGDPSR VQIMYGVAGE RRLPEYEVPW LPGYENSSPV RVGNAAVDQF QLDVYGEVLD ALHVARVAVA NRRPGSARDG FLAGIHSSED DGRDGSWQLQ TKLMDFLETG WRKADEGIWE VRGPRRHFVH SKVMAWVAAD RAVRGIVESG LPGPVERWSA LRDEIHHEVC ARGFDSDRNT FTQFYGSKEL DAALLYMSLV GFLPATDPRV VGTVAAIERE LMEDGFVMRY PTAEDGAVDG LPAGEGAFLA CTFWLADNYA LSGRVHEAQE LFERLLALRN DVGLLAEEYD PRLGRMTGNF PQAFSHVPLV NTARTLTDAL RGRPRSRTDR AHPPGHFFG
|
| |