Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1052 |
Symbol | |
ID | 5669466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1235451 |
End bp | 1237469 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239981 |
Product | alpha amylase catalytic region |
Protein accession | YP_001505414 |
Protein GI | 158312906 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.776114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATCG GACGTTCAGT AGGCCGGGTT GTCATCACTG ACGCGCGCCC GGTCGTCTCC TGTGGCCAGT GGCCGTCGCG GGCGGTGGAG GGGGAAACCC TCACCGTCAG CGCGACGGTC TTCCGTGAGG GGCATGACCT CATCGGGGCC AATGTCGTTC TTTCCGGCCC CGACGGCCAG GGAGTTCCGT TTACGCGGAT GAAACTCGCC GGCGCGGGCA CGGACCGCTA CGAGGCCGAT GTCGTCATGG GCCGTGAGGG CCTGTGGGGC TATCGCGTGG AGGCCTGGGC TGATCCGCTG GCCACCTGGC GGCACGGCAT CGAGCTCAAG GTCGGTGCCG GCCAGACGGT GGACGAGCTG GCGGTCGACT TCGAGGACGG CGCGCGGCTG CTGCTGCGCG CGCTGCCCGG AGTGCCCGAG CCCCGCCGGG TCGACATCGC GCGGGCCGTC GCGGTCCTGC GCGACGACGA CGTGACGGAT CCGCACGAGC GCATCGCCGT CGCCTGCTCC CCGGAGCTGG CCGGCCTGCT CGACGAGCAC CCGCTGCGCG AGCTCGTGAC CCGCACCCCG TTGTACCGGG TGTGGGTGGA CCGCGAGCGG GCGCTCTACG GCAGCTGGTA CGAGCTGTTC CCCCGGTCCG AGGGCGCGAG CCTCGACCCA CCGCGATCGG GCACCTTCCT CACCGCCGCG GAACGACTGC CGGCCGTCGC CGCGATGGGC TTCGACGTGG TCTATCTGCC ACCGATCCAC CCGATCGGCG AGATCAACCG CAAGGGCCCG AACAACACCC TCACGCCCGG CCCGGAGGAC CCCGGTTCAC CGTGGGCCAT CGGCAGCTCG CAGGGCGGCC ACGACGCCGT CCACCCGGAT CTGGGGACCC TGGACGACTT CGACCTCTTC GTCGCGCGGG CCCGCTCGCT CGGCATGGAG GTCGCGCTCG ACCTGGCGTT GCAGTGCGCG CCGGACCACC CCTGGGCGAA GCACCATCCG GAGTGGTTCG TCGTGCGCAG CGACGGCTCG ATCGCGTACG CGGAGAATCC GCCGAAGAAG TACCAGGACA TCTATCCCCT GAACTTCGAC GCCGACCCGA TCGGTCTATA TCACGAGATT CTGCGGGTGG TCCGGTTCTG GACGGCCCGC GGCGTCCGGA TCTTCCGGGT GGACAATCCG CACACGAAGC CGGTCGAGTT CTGGGAGTGG CTCATCGGCC AGGTCAAGGC GACCGAGCCG GACGTGCTGT TCCTGGCCGA GGCGTTCACC CGCCCGGCGA TGATGCACAC GCTCGCCAAG ATTGGCTTCA CCCAGTCCTA CACCTACTTC ACCTGGCGCA ACGAGCGCCG TGAGCTGGAG GAGTACGCGC AGGAGCTGGT CGACGCGGCG CACTACATGC GGCCCAACTT CTTCGTCAAC ACCCCGGACA TCCTGCCCGG GTTCCTGCAG ACGGGCGGGC CGGCGGCGTT CCGCATCCGG GCGGTGCTCG CCTCGATGCT CTCCCCGACC TGGGGAGTGT ACGCGGGGTA CGAGCTGTAC GAGAACTCCC CCGTGCGGGC CGGGAGCGAG GAGTACCTCG ACTCGGAGAA ATACCAGTAC AAGCCGCGGG ACTGGGCCGG CGCGGAACGC GCCGGGGCCT CGCTGGCGCC CTACCTCACC CGGCTCAACC AGATCCGGCG CGACCACCCG GCCCTGCACT GGATGCGCAA CCTGCACATC CACGAGTCCG CGACGCCCGA GATCACAGTC TTCTCCAAAC GGCACACCAC GGCCCGGGCG GGCGGACCCG CCCTCGGCCG CCTCCGCCCC GACGACGACC TCGTCATCGT CGTCGTCAAC CTTGACCCGC ATTCGGCCCG GGAGACCACC GTGCGACTGG ACATGCCCGC TCTCGGGCTT GACTGGGGCG ACCGCTTCGA AGTGCACGAC GAGATGACCG GCGTCACCTA CCAGTGGGGC CGTGAGAACT ACGTCCGCCT GGAGCCGACC GAGCCCGCCC ACATCCTGAC CGCGCGGCGC CTGCCGTGA
|
Protein sequence | MMIGRSVGRV VITDARPVVS CGQWPSRAVE GETLTVSATV FREGHDLIGA NVVLSGPDGQ GVPFTRMKLA GAGTDRYEAD VVMGREGLWG YRVEAWADPL ATWRHGIELK VGAGQTVDEL AVDFEDGARL LLRALPGVPE PRRVDIARAV AVLRDDDVTD PHERIAVACS PELAGLLDEH PLRELVTRTP LYRVWVDRER ALYGSWYELF PRSEGASLDP PRSGTFLTAA ERLPAVAAMG FDVVYLPPIH PIGEINRKGP NNTLTPGPED PGSPWAIGSS QGGHDAVHPD LGTLDDFDLF VARARSLGME VALDLALQCA PDHPWAKHHP EWFVVRSDGS IAYAENPPKK YQDIYPLNFD ADPIGLYHEI LRVVRFWTAR GVRIFRVDNP HTKPVEFWEW LIGQVKATEP DVLFLAEAFT RPAMMHTLAK IGFTQSYTYF TWRNERRELE EYAQELVDAA HYMRPNFFVN TPDILPGFLQ TGGPAAFRIR AVLASMLSPT WGVYAGYELY ENSPVRAGSE EYLDSEKYQY KPRDWAGAER AGASLAPYLT RLNQIRRDHP ALHWMRNLHI HESATPEITV FSKRHTTARA GGPALGRLRP DDDLVIVVVN LDPHSARETT VRLDMPALGL DWGDRFEVHD EMTGVTYQWG RENYVRLEPT EPAHILTARR LP
|
| |