Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5316 |
Symbol | |
ID | 5673650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6403091 |
End bp | 6404875 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244173 |
Product | alpha amylase catalytic region |
Protein accession | YP_001509580 |
Protein GI | 158317072 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0561676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.278605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCCG TTGTCGCATA CCGTGCGGGC GAACCCGGGC AGCCCGGCGG CCGCTCCGGC CCGGCCGCGC CACCAGCGGG AACCGAGGAG ACCGATTACT CGGCGTGGTG GCACGACGCC GTGATGTACG AGGTCTACGT CCGCAGCTTC GCCGACGCCG ACGGCGACGG GGTCGGCGAC ATCGAGGGCA TCCGCCGGCG CCTGCCCGAT CTCGCCGACC TGGGCGTCGA CGGGATCTGG GTCAGCCCCT TCTACCGCTC CCCCATGGCC GATCACGGCT ACGACGTGGC CGACCACACC GACGTCGATC CCTTGTTCGG CACTCTCTCG GACATCGACG CGCTGCTGCG CGACGCCCAC GAGGCCGGCC TGAAAGTCGT CGTCGACCTG GTCCCGAACC ACTCCAGCAG CGCGCACCCC GCCTTCCAGG CGGCGCTCGC GGCCGGGCCG GACGCGCCGG AGCGGGACCT CTACCTTTTC CGGGACGGGC GCGGCCCGGA CGGTGCGCTC CCCCCGAACA ACTGGATCTC GGTGTTCGGT GGCCCCGCCT GGACGAGGGT TCCGGACGGC CAGTGGTACC TGCACCTGTT CGCCCCGGAG CAGCCGGACT GGAACTGGCA GCATCCGCGG GTGCGGGCGG CGCACGCCGA GATCATCCGA TTCTGGCTCG ACCGGGGGGT CGACGGCTTC CGGATCGACG TCTCGCACGG CCTGGTCAAG GACGGCGAGC TGCGCGACCA TCCGACCGGC GCGCTCCCCA CGCCGGAGAC CGGCTTCCGG GAGGAGATCG AGCCGCACGC GTGGGATCAG GACGGCGTCC ACGAGATCTA CCGCGAGTGG CGGGCGATCG TCGACGCCCA CGACCGGCGC GACGGCCGAC AGCGGGTGCT CGTCGGCGAG ACCTGGGTCG CCGACCCGGG GCGCCTCGCC CGGTACGTCC GTCCCGACGA GCTGCACCTG ACCTTCACGT TCTCGCTGCT GTACGCGCCG TTCTCCGCCC CGGCGTGGCG GGCGGCGATC GATGCCGCCC GGGCCGCGAC AGCCGCCGTC TGCGCGCCGC CGACCTGGGT GCTGGCCAAC CACGACGTCG TCCGTCCGGT CAGCCGCTAC GGCGGCGGCG AGACCGGCCT GCGCCGGGCC CGGGCGGCGC TGCTGACGCT GCTGGCGCTG CCCGGCACGG TCTACCTGTA CCAGGGCGAC GAACTCGGCC TGCCGCAGGT CGACATCCCG CCCGAGGCCC GCCAGGACCC GGTCTGGGAA CGGTCCGGCC ACACCTCGCC CGGCCGGGAC GGCGCACGCG TGCCGCTGCC GTGGTCCGGC GACGCGCCTC CGTACGGGTT CAGCGCCGGG GCAGCCGAGC CGTGGCTCCC GCAACCGCCC GACTGGGCGA CGCTGACCGC GTCCGCCCAG TCCGTGGACC CGATGTCGAC CAGGGTGCTC GTCCGCGGCG CGCTGGCGCT ACGCCGCGCG CTCCCCTTCC TCGGCGGACC GGCCGGGCCG GGCAGCGCGG CCGAGCCGGC CGAGCCGGGG CACTCGGGCG AGCCGGGGCG GCCGGCAGGC ACGCGGCCCG GGTTCCGCTG GCGGGACGAT CTGCCCGCGG ACTGCCTGGC CTTCGACCGG ACGTCCGCCG CCGGCGCCCT GACCTGTGTG ATGGCCACGC GGAGCGAGAT ACGCCTGGAG ATCGCCGGCC GGCTGGTGCT GGCGAGCGGG CCAGTCGGCT ACGACGGCGC GACGCTCGTC CTGCCACCGG ACACCACCGC GTGGGTCATC CCCCGTTCGG GTTGA
|
Protein sequence | MDPVVAYRAG EPGQPGGRSG PAAPPAGTEE TDYSAWWHDA VMYEVYVRSF ADADGDGVGD IEGIRRRLPD LADLGVDGIW VSPFYRSPMA DHGYDVADHT DVDPLFGTLS DIDALLRDAH EAGLKVVVDL VPNHSSSAHP AFQAALAAGP DAPERDLYLF RDGRGPDGAL PPNNWISVFG GPAWTRVPDG QWYLHLFAPE QPDWNWQHPR VRAAHAEIIR FWLDRGVDGF RIDVSHGLVK DGELRDHPTG ALPTPETGFR EEIEPHAWDQ DGVHEIYREW RAIVDAHDRR DGRQRVLVGE TWVADPGRLA RYVRPDELHL TFTFSLLYAP FSAPAWRAAI DAARAATAAV CAPPTWVLAN HDVVRPVSRY GGGETGLRRA RAALLTLLAL PGTVYLYQGD ELGLPQVDIP PEARQDPVWE RSGHTSPGRD GARVPLPWSG DAPPYGFSAG AAEPWLPQPP DWATLTASAQ SVDPMSTRVL VRGALALRRA LPFLGGPAGP GSAAEPAEPG HSGEPGRPAG TRPGFRWRDD LPADCLAFDR TSAAGALTCV MATRSEIRLE IAGRLVLASG PVGYDGATLV LPPDTTAWVI PRSG
|
| |