Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4442 |
Symbol | |
ID | 3907418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5308495 |
End bp | 5310402 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881774 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_483517 |
Protein GI | 86743117 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.807083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.234797 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACATT CACCGGTGCG GCGCAGCGGG GGCAGTCCCT TCCCGCCGAT CGCCGAGTAC GGATTCCTGT CCGACTGTGA GACGTCCTGT CTCGTTGCCC CCAGCGGCAA CGTCGAGTGG ATGTGTGTGC CCCGGCCCGA TGCCCCGAGT GTCTTCGGGT CGGTGCTGGA CCGGTCGGCC GGGGGCTTCC GTTTCGGTCC CGAGCGTACG CAGATCCCGG CCGGCCGGCG CTACCTCCCC GGGACGAACA TTCTCGAGAC CACCTGGCAG ACGCCGAACG GCTGGCTCAT CGTCACCGAT TGCCTGGTGG TCGGGCGCTG GCATCGCACC CACAAGCGTT CGAACACGCA CCGTCGGACG CCGAGCGACT GGGACGCCGA CCACGTGCTG CTCCGGCTGG CTCGCTGTGA GCACGGCTCG GTGGATCTCA GCCTGGTGTG CGAGCCGAAC TTCGACTACG GCCGTGAGCC CGAGTCCTGG CAGTACGAGG GAGAGGATTA CTCCTCCGGC GTGATCAGCC ATGCGGGCAC CAACGTCACG CTGCGGCTGC GTACCGACCT GCGGCTCGGT TTCGACGGCC GGCGGGCACT GGCTCGCACC ACGCTGCGGG AGGGGGACAC CGCGTTCGTC GCGCTCACCT GGCGCCCGGA GGAGCCCCTG CTTCCGGACA CCTACCTGCA GGCGTGCGTC GCCGTCGACC GGACAACGGA GTTCTGGCGC CAGTGGCTGT CGCGGGGAAT GTTTCCCGAC CATCCCTGGC GGCGCCATCT GCAGCGCAGC GCCCTCGCGC TGAAAGGACT GACCTACGCG CCGACCGGTG CTCTGCTCGC CGCCTCGACC ACCTCGTTGC CGGAGACCCC GCGCGGCGAA CGGAACTGGG ACTACCGATA CAGCTGGATC CGCGATTCCA CCTTCGCGCT GTGGGGCCTG TACACCCTCG GGCTGGACTA CGAGGCCAAC GATTTCTTCT CGTTCATCGC CGACGTCGCC GAGAACGATG ACGACGACAT CCAGGTGATG TACCGGGTGA GCGGCGAGCC GAAGATCGAC GAGGAGGTCC TCGGTCACCT GTCCGGCTAC GAGGGAGCCT ACCCGGTACG GATCGGCAAC GCAGCGGCAC TGCAACGTCA ACACGACGTC TGGGGTGTCG TTCTCGACTC CGTGTACCTG CACACCAAGT CCCGTGACTA CCTCTCCGAA CGGCTCTGGC CGGTGCTCGT GCGCCTCGTC GAGGCGGCGG CCACGCACTG GCGGGAGCCG GACCGCGGGA TGTGGGAGGT GCGCGGGGCG CCGCAGCACT TCACCGTGTC CAAGATGATG TGCTGGGTTG CCCTGGACCG GGGTCGGCGG CTGGCGCAGA TGCGGGGGGA CGCCAAGACG GCGGCGCGCT GGCGGGCCGT GGCGGAGGAG ATCCACGCCG AGGTGTGCGA GAAGGGTGTG GATCATCGCG GTGTGTTTAC CCAGTACTAC GGATCGAAGG CGCTCGACGC CTCACTGCTC CTGATTCCGC TGCTCGGGTT CCTGCCGGCG GCCGACGAAC GCGTGAAGGC CACCGTGCTC GCGATCGCCG ACGAGCTGAC CGTTGACGGG CTGGTCCTGC GCTACCGCAC GGAGGAGACC GACGACGGGG TCTCCGGCAC CGAGGGCGCC TTCCTGATCT GCTCGTTCTG GCTGGTCTCC GCCCTGGTGG AGATCGGCGA GCTGACCCGT GCCCGGCAGC TGTGTGAACG GCTGCTGAGC CTCGCCAGCC CGCTGGATCT CTATGCCGAG GAGATCGATC CGGTCAGTGG CCGCCATCTC GGAAACTTCC CCCAGGCGTT CACCCATCTG GCCCTGATCA ACGCGGTGAT GTACGTGATC AGGGCCGAGG ACGCCGAGGC CTACGCCCGG CCGTCCCCCT CGACCTGA
|
Protein sequence | MAHSPVRRSG GSPFPPIAEY GFLSDCETSC LVAPSGNVEW MCVPRPDAPS VFGSVLDRSA GGFRFGPERT QIPAGRRYLP GTNILETTWQ TPNGWLIVTD CLVVGRWHRT HKRSNTHRRT PSDWDADHVL LRLARCEHGS VDLSLVCEPN FDYGREPESW QYEGEDYSSG VISHAGTNVT LRLRTDLRLG FDGRRALART TLREGDTAFV ALTWRPEEPL LPDTYLQACV AVDRTTEFWR QWLSRGMFPD HPWRRHLQRS ALALKGLTYA PTGALLAAST TSLPETPRGE RNWDYRYSWI RDSTFALWGL YTLGLDYEAN DFFSFIADVA ENDDDDIQVM YRVSGEPKID EEVLGHLSGY EGAYPVRIGN AAALQRQHDV WGVVLDSVYL HTKSRDYLSE RLWPVLVRLV EAAATHWREP DRGMWEVRGA PQHFTVSKMM CWVALDRGRR LAQMRGDAKT AARWRAVAEE IHAEVCEKGV DHRGVFTQYY GSKALDASLL LIPLLGFLPA ADERVKATVL AIADELTVDG LVLRYRTEET DDGVSGTEGA FLICSFWLVS ALVEIGELTR ARQLCERLLS LASPLDLYAE EIDPVSGRHL GNFPQAFTHL ALINAVMYVI RAEDAEAYAR PSPST
|
| |