Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0130 |
Symbol | |
ID | 5668555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 155425 |
End bp | 157335 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239058 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001504503 |
Protein GI | 158311995 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.322371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.176249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCACA CACCGACGCG GCGTGGCGGG GGCAGCCCGT TCCCGCCGAT CGCCGAGTAC GGGTTCCTCT CCGACTGCGA GACGAACTGC CTCGTGGCGC CCAGCGGCAA CGTCGAGTGG ATGTGCGTGC CACGCCCGGA CGCGCCGAGC GTGTTCGGTG CCGTCCTCGA TCGCTCGGCC GGCGGGTTCC GCTTCGGACC GGACCGGACC TTCATTCCGG CCGGCCGGCG GTACCTGCCC GGCACGAACG TCCTTGAGAC GACCTGGCAG ACGCCCACGG GCTGGCTGAT CGTCACCGAC TGCCTGGTCG TGGGCCGCTG GCACCGGACG CACCGGCGCT CCAAGACGCA CCGGCGCACC CCGGGCGACT GGGACGCCGA CCACGTGCTG CTCCGGCTGG CCCGCTGCGA GCACGGCGAC GTCGACCTCA GCCTCGTCTG CGAGCCCAAC TTCGACTACG GCCGCAGCCC GGCGTCCTGG CGGTACGAGG ATGAGGACTA CTCGACCGGG ATCATCACCC ACGACTTCCC GGCCGGCGAC CCGGCAGCGG GCGTCGCGCT GCGGCTGCGC ACCGACCTGC GGCTCGGGTT CGACGGCCGC CGCGCGCTGG CCCGGACGAC GTTGCGGGAG GGCGACACCG CGTTCGTGGC GATGACCTGG CGGGCCGAGG ACCCGCTGCT GCCCGGTAAC TATCCGCAGG CCTGCGCCGC CGTCGACAGC ACCACCGAGT TCTGGCGGCA GTGGCTTTCG CGTGGGCGGT TCCCCGACCA TCCGTGGCGC CGGCACCTGC AGCGCAGCGC ACTGGCGCTC AAGGGCCTCA CCTACGCCCC GACCGGGGCG CTGCTCGCCG CCGCGACCAC CTCGCTGCCG GAGACGCCGC AGGGCGAGCG CAACTGGGAC TACCGCTACA GCTGGATCCG GGACTCCACG TTCGCCCTGT GGGGCCTGTA CACCCTCGGG CTCGACTACG AGGCCAACGA CTTCTTCTCC TTCATCGCGG ACGTCGCCGA GCACGCCGAC GACATCCAGG TGATGTACCG GGTCGGTGGC GAGCCGAAGA TCGACGAGGA GATTCTCGGG CATCTGTCCG GCTATGACGG CGCCGTCCCG GTGCGGGTCG GTAACGAGGC GGCGAAGCAG CGCCAGCATG ACGTTTGGGG GGCCGTTCTC GACTCGGTCT ACCTGCACAC CCGGTCGCGG GACTATCTCT CGGAGCGGCT GTGGCCGGTG CTGGTCCGGC TGGTGGAGGC GGCGGCCGCG CACTGGCGGG AGACGGACCG CGGCATGTGG GAGGTCCGGG GCGAGCCGCG GCATTTCACC TCGTCGAAGA TGTTCTGCTG GGTCGCGCTG GACCGGGGGC GCCGCCTCGC GCAGATGCGC GGTGACCTGC GAACCGCGGG CCGCTGGGAC GACATCGCCG ACGAGATCCA CGCCGACGTG CTCGCGAACG GCGTCGACCA CCGCGGTGTC TTCACCCAGT ACTACGGCTC GACGTCGCTG GACGCCTCGG TGCTGCTGAT GCCGCTGCTG GGTTTCCTGC CGTCGACGGA CGACCGGGTG AAGGCGACCG TGCTCGCCAT CGCCGACGAG CTGACGGTGG ACGGCCTGGT GCTGCGCTAC CGGACGGACG AGACCGATGA CGGGGTCGAG GGCGAGGAGG GCGCCTTCCT CATCTGCTCG TTCTGGCTCG TCTCCGCTCT GGTGGAGATC GGTGAGCTCA CCCGGGCCCG GCAGCTGTGC GAGCGGCTGC TGAGCCTGGC CAGCCCGCTG GACCTCTACG CCGAGGAGAT CGATCCGGCC GACGGCCGGC ACCTGGGCAA CTTCCCGCAG GCCTTCACCC ACCTGGCGCT GATCAACGCG GTCATGTACG TGATCCGGGC CGAGTCCGGG GAGTCCTTCA CCCGCTCCTA G
|
Protein sequence | MAHTPTRRGG GSPFPPIAEY GFLSDCETNC LVAPSGNVEW MCVPRPDAPS VFGAVLDRSA GGFRFGPDRT FIPAGRRYLP GTNVLETTWQ TPTGWLIVTD CLVVGRWHRT HRRSKTHRRT PGDWDADHVL LRLARCEHGD VDLSLVCEPN FDYGRSPASW RYEDEDYSTG IITHDFPAGD PAAGVALRLR TDLRLGFDGR RALARTTLRE GDTAFVAMTW RAEDPLLPGN YPQACAAVDS TTEFWRQWLS RGRFPDHPWR RHLQRSALAL KGLTYAPTGA LLAAATTSLP ETPQGERNWD YRYSWIRDST FALWGLYTLG LDYEANDFFS FIADVAEHAD DIQVMYRVGG EPKIDEEILG HLSGYDGAVP VRVGNEAAKQ RQHDVWGAVL DSVYLHTRSR DYLSERLWPV LVRLVEAAAA HWRETDRGMW EVRGEPRHFT SSKMFCWVAL DRGRRLAQMR GDLRTAGRWD DIADEIHADV LANGVDHRGV FTQYYGSTSL DASVLLMPLL GFLPSTDDRV KATVLAIADE LTVDGLVLRY RTDETDDGVE GEEGAFLICS FWLVSALVEI GELTRARQLC ERLLSLASPL DLYAEEIDPA DGRHLGNFPQ AFTHLALINA VMYVIRAESG ESFTRS
|
| |