Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1889 |
Symbol | |
ID | 3906838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2221126 |
End bp | 2223012 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637879227 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_480994 |
Protein GI | 86740594 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.510271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCATCCC TGATCGAGGA CTACGCGCTG ATCGGCGACA CCCACTCCGC GGCGCTCGTG TCGCGCACCG GGTCGATCGA CTGGCTGTGC CTGCCCCGAT TCGACTCGCC GTCGTGCTTC GCCGCCCTGC TCGGCGACTC CGAGGCGGGC CACTGGAAGA TCGCTCCCGT CGAGCCGGTC CTGGGCGTCA GCCGGCGCTA CCGGGGCGAC ACGCTGGTCC TGGAGACGGA CATGACGACC GCGTCCGGCA CCGTGCGCAT CGTCGACGCG ATGTTTCCCC GCGCAGGCAC GCACACCGTG CTCCGCCTGG TCGAATGCCT CGAAGGCGCG GTCCACCTGC GATCGGAGAC CCGGTTTCGC TTCGACTACG GCTCCATCGT GCCGTGGGTC CGCCGGGTGG ACGAGCACAC CATGTCGGCG GTGGCCGGGC CGGATGCGGT CACCCTGCGG ACGACCGCCC CGATGGAGGG GCACGACATG GCCACCTACG CCGACTTCCA CGTCGCCGCC GGCCAGTCGG TACCGTTCTC ATTGACCTGG ACCCCCTCCC ATCAGACTCC CCCGCCGTCC CACGACGTCC GGCGCATGAT CACCCTCACC GAGGCGTGGT GGTCGGACTG GATGGCGGGC TGCACCTACG ACGGCCAGTG GCAGCCGGCC GTCCGCCGGT CGCTGATCAC CCTCAAGGCG CTCACCTACG CCCCGACCGG AGGGATCGTC GCCGCCGTCA CGACCTCGTT GCCGGAGCAC ATCGGCGGCG TGCGCAACTG GGACTACCGG TACTGCTGGC TTCGGGACGC GACGATCACG CTGCTCGCCC TGCTCGACGC CGGGTTCACC AGCGAGGCGA CCGCATGGCG GGAGTGGCTG CTGCGCGCGG TCGCCGGTGA CCCCTCCCGG GTACAGATCA TGTACGGCGT TGCCGGGGAA CGCCGGCTGC CCGAATACGA GATCCCGTGG CTACCGGGGT ACGAGAACTC CGCCCCGGTG CGGGTCGGCA ACGCCGCCGT CGACCAGTTC CAGCTCGACG TCTACGGCGA GGTCCTCGAC GCCCTGCATG TCGCGCGGGT CGCGGTCGCC AACCGTCGCC AGACCGTGCC TGGGCTGGCG CTCCCCGGCG GGCAGACCAT CACCGAGAGC CATGCGGACG ACTCCTGGCC GCTGCAGACC AAGCTGATGG ACTTCCTCGA GACCGGCTGG CGGAAGACCG ACGAGGGCAT CTGGGAGGTG CGCGGCCCCC GCCGCCACTT CGTCCACTCG AAGGTGATGG CCTGGGTCGC GGCCGACCGG GCGGTACGCG GGATCGTTGA GTCCCGGCTA CCGGGCCCTG TCGACCGCTG GTCGGCGCTG CGGGACGAGA TCCACGCGGA GGTCTGTACC CGTGGGTTCG ACTCCGAACG CAACACCTTC ACCCAGTTCT ACGGCTCCAA GGAACTCGAC GCGGCGCTGC TGTACATGCC GCTGGTGGGG TTCCTGCCCG CCACCGACCC CCGCGCCGTG GGAACAGTCG CCGCCATCGA GCGGGAGCTG ATGGAGGACG GGTTCGTCCT GCGGTATCCG ACGGCCGAGG ACGGCGCGGT CGACGGACTG CCCGCGGGCG AGGGCGCCTT CCTGGCCTGC ACCTTCTGGT TGGCCGACAA CTACGCCCTG TCCGGGCGGG TCCACGAGGC TCAGGAACTG TTCGAACGCC TGCTGTCGTT GCGTAACGAC GTCGGGCTCC TCGCGGAGGA GTACGACCCC AGGCTGGGCC GGATGACGGG CAACTTCCCG CAGGCGTTCA GCCACGTCCC CCTGGTCAAC ACCGCGCGGA CGCTCACCGA CGCGCTGCGC GGCACACCGC GCTCGCGCAC CGACCGGGCC CACCCGCCCG GCCACTTCTT CGGCTGA
|
Protein sequence | MPSLIEDYAL IGDTHSAALV SRTGSIDWLC LPRFDSPSCF AALLGDSEAG HWKIAPVEPV LGVSRRYRGD TLVLETDMTT ASGTVRIVDA MFPRAGTHTV LRLVECLEGA VHLRSETRFR FDYGSIVPWV RRVDEHTMSA VAGPDAVTLR TTAPMEGHDM ATYADFHVAA GQSVPFSLTW TPSHQTPPPS HDVRRMITLT EAWWSDWMAG CTYDGQWQPA VRRSLITLKA LTYAPTGGIV AAVTTSLPEH IGGVRNWDYR YCWLRDATIT LLALLDAGFT SEATAWREWL LRAVAGDPSR VQIMYGVAGE RRLPEYEIPW LPGYENSAPV RVGNAAVDQF QLDVYGEVLD ALHVARVAVA NRRQTVPGLA LPGGQTITES HADDSWPLQT KLMDFLETGW RKTDEGIWEV RGPRRHFVHS KVMAWVAADR AVRGIVESRL PGPVDRWSAL RDEIHAEVCT RGFDSERNTF TQFYGSKELD AALLYMPLVG FLPATDPRAV GTVAAIEREL MEDGFVLRYP TAEDGAVDGL PAGEGAFLAC TFWLADNYAL SGRVHEAQEL FERLLSLRND VGLLAEEYDP RLGRMTGNFP QAFSHVPLVN TARTLTDALR GTPRSRTDRA HPPGHFFG
|
| |