Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3678 |
Symbol | |
ID | 3905362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4411051 |
End bp | 4413069 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881004 |
Product | alpha amylase, catalytic region |
Protein accession | YP_482759 |
Protein GI | 86742359 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.763851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATCG GACGTTCAGT AGGCCGGGTT GTCGTCTCCG ATGTGACACC GACCGTCTCG TGTGGCCAGT GGCCGGCGCG GGCGGTCGCG GGCGAAATTC TCACCGTCGG TGCAACGGTG TTCCGCGAGG GTCATGACCT CATCGGCGCG AACGTCGTGC TGTCAGGTCC CGACGGGCAG GGAACTCCGT TCATCCGGAT GCGCTCCGCC GGCCCCGGTA CCGACCGTTA TGAGGCGGAG ATCACCGCCG GGACCGAAGG CCTGTGGGGA TATCGGGTCG AGGCGTGGGC CGATCCGGTC GCCACCTGGC GGCACGGTAT TGCCCTCAAG GTCGGCGCGG GCCAGAGCAC GGACGAACTC GCGGTGGACT TCGAGGACGG TGCGCGCCTG CTGCTGCGGG CCCTGCCCGC AGTCCCCGAA CCGCGCCGGG CGGAGATCGC TCTCGCCGTG GCCGCGCTGC GCGACGACGA CTGCACCGAC CCCCGCGACC GGATCGCCGC GGCCCTCGAT CCCCAGCTGG TGAGCCTGCT GGACGCCTGT CCGCTGCGCG AACTGGTGAC CCGCTCACCG CTGTACCGGC TGTGGGTGGA TCGTCGGCGC GCCCTCTACG GCAGCTGGTA CGAGATGTTC CCGCGCTCGG AGGGCGCGAG CCTCGACCCG CCCCGGTCCG GGACCTTCCT CACCGCGGCC GAACGCCTGC CCGCGGTCGC GGCGATGGGC TTCGACGTGG TGTACCTGCC GCCGATCCAT CCGATCGGCG AGGTCAACCG CAAGGGTCCC AACAACACCC TCACCCCCGG TCCGACGGAC CCCGGCTCGC CGTGGGCCAT CGGCAGCGAA CACGGCGGCC ACGACGCCGT GCATCCCGAC CTCGGCACGA TCGACGACTT CGACCTGTTC GTCGCCCGGG CACGCTCGCT GGGCATGGAG ATCGCGCTGG ACCTCGCCCT GCAGTGCGCG CCGGACCATC CATGGGCGAA GCATCACCCG GAGTGGTTCG TCGTGCGTAG TGACGGCTCC ATCGCCTACG CGGAGAATCC GCCGAAGAAG TACCAGGACA TCTATCCGCT GAACTTCGAC GCCGACCCGA CCGGGCTCTA TCAGGAGATC CTGCGCGTCG TCCGGTACTG GACTGCACAC GGAGTACGAA TCTTCCGTGT CGATAATCCG CATACAAAGC CCGTCGAGTT CTGGGAATGG CTCATCGCCC AGGTGAAGTC GACCGAACCA GATGTGCTCT TCCTCGCGGA GGCATTCACC CGGCCGGCGA TGATGCACAC GCTCGCCAAG GTCGGTTTCA CCCAGTCATA TACCTATTTC ACCTGGCGCA ACACGAAGTG GGAGCTCGAG AAGTACGCGC GCGAACTGGT GTCGGCCGCG CACTACATGC GGCCGAACTT CTTCGTCAAC ACCCCGGACA TCCTGCCGGA GTACCTGCAG CACGGCGGCC CGGCGGCGTT CCGGATCCGG GCGGTGCTCG CTGCGACGCT GTCACCGACC TGGGGCGTCT ACTCCGGGTA CGAGTTGCGC GAGAACACCC CGGTCCGACC GGGCAGCGAG GAGTACCTGG ACTCCGAGAA GTACCAGTAC CGGCCACGCG ACTGGGCCGC GGCGGAGCGT GCGGGCCAGT CGCTCGCGCC GTACCTGACC AGACTCAACC AGATCCGCCG TGCCCACCCC GCCCTGCAGT GGTTGCGCAA CCTGCACTTC CACCATGCCG ACGGGGACGA GATCATGGTC TTCTCCAAGC GGGTGGACTC CCTGCGGGCG GACGGCACGG ATCCCGGGGA CACGGCCGCC GCCGACACCG TGCTCATCGT CGTCAACCTC GACCCGCACG CTCCCCGGGA GACCACCGTG CGGCTCGACA TGCCGGCCCT CGGCCTCGGC TGGGAAGACT CCTTCGAGGT CACCGATGAG ATCACTGGTG CCACCTACGC GTGGGGCAAG CAGAACTACG TGCGGCTGGA CCCGGCGGTC GAGCCCGCGC ACGTCTTCGC TGTGCGGGCC CGGTCGTGA
|
Protein sequence | MMIGRSVGRV VVSDVTPTVS CGQWPARAVA GEILTVGATV FREGHDLIGA NVVLSGPDGQ GTPFIRMRSA GPGTDRYEAE ITAGTEGLWG YRVEAWADPV ATWRHGIALK VGAGQSTDEL AVDFEDGARL LLRALPAVPE PRRAEIALAV AALRDDDCTD PRDRIAAALD PQLVSLLDAC PLRELVTRSP LYRLWVDRRR ALYGSWYEMF PRSEGASLDP PRSGTFLTAA ERLPAVAAMG FDVVYLPPIH PIGEVNRKGP NNTLTPGPTD PGSPWAIGSE HGGHDAVHPD LGTIDDFDLF VARARSLGME IALDLALQCA PDHPWAKHHP EWFVVRSDGS IAYAENPPKK YQDIYPLNFD ADPTGLYQEI LRVVRYWTAH GVRIFRVDNP HTKPVEFWEW LIAQVKSTEP DVLFLAEAFT RPAMMHTLAK VGFTQSYTYF TWRNTKWELE KYARELVSAA HYMRPNFFVN TPDILPEYLQ HGGPAAFRIR AVLAATLSPT WGVYSGYELR ENTPVRPGSE EYLDSEKYQY RPRDWAAAER AGQSLAPYLT RLNQIRRAHP ALQWLRNLHF HHADGDEIMV FSKRVDSLRA DGTDPGDTAA ADTVLIVVNL DPHAPRETTV RLDMPALGLG WEDSFEVTDE ITGATYAWGK QNYVRLDPAV EPAHVFAVRA RS
|
| |