Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3356 |
Symbol | |
ID | 5671727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3973682 |
End bp | 3975316 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242244 |
Product | cholesterol oxidase |
Protein accession | YP_001507664 |
Protein GI | 158315156 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACA ACGCTTCCAC ACCTCTGGAA TCCGGCCCGG GGCTGACCCG CCGGAGGATC CTCGGCACAG TGGCGCTTGG CGCGGCTGCG GCTGGTGGGC TGGCGGCCGG CCCTGGCAGC GGACGGGCAC GGGCCGCCAC AGCCCCGGCG GACGCGGCCG TCACCGAGCA GCGGGAGCGG GCCGTCGTCA TCGGCAGCGG ATTCGGCGGC GGCGTGACCG CGCTGCGGCT GGCACAGGCC GGAGTGCCCA CTCTTGTCCT GGAGCGTGGC CTGCGGTGGT CGACGGGGCC GAACGCGACG ACCTTCTGCC GCTGGGCGAA CATCGACAAC CGGTCCGCCT GGCTGACCGA CCACAGCACG ACCCCGGGCG TGGACAAGAC CTGGACGCCC TACACCGGAG TGATCGAGAG CCTGCCGGGC GACGGCATAA CGGTCAACTG CGGTGCCGGC GTCGGTGGCG GCTCGCTGGT GTACCACGGG ATGACACTGC AGCCGACCAG CCAGAACTTC GCGGCATCCA TCCCGGAGGC CGCGAGTCTC TACCGCGACC TGAACCGGTG GGCCTACCCC ACCGTCGCCA CCATGCTGGG CGTCTCCACG GTGCCGAACG ACGTGCTGAA CGCGGATCAG TACCGGTCGT CCCGGCTCTT TCTGGACGTG GCCGCCAACA GCGGCCTGGA GCCGTTCCGG GTTCCGCTGC CGGTCGACTG GAGCTACGTC CGGGGTGAGC TCACCGGTCA GTACGAGCCG ACCTACACCA CCAGCGACAT CGTCTTCGGC GTCAACAACG GCGGCAAGCA CTCGATCGAC GTCACCTACC TGGCCTCGGC CGAGGCCACG GGCCGGGTAC GGGTGAGCCC ACTGCACGTG GTGCGGGACG TCCAGATGGA CCCGGACGGC CGCTGGGTGC TCTCCGTCGA CCGCATCGAC ACCGGCGGCA CCGTGCAGGA GCGGAAGCGG ATCACCGCGG ACGCGGTGTT CCTCAACGCC GGTTCCGCCG GTACCACGCG GCTCCTGGTC AAGGCGCGAG CCAAGGGGCT GGTGCCCGAC CTCCCCGACG CCATCGGCAC GAAGTGGGGA AACAACGGTG ATCGGATCTA CGCCTGGGTC GGCATGAACG GTGATCCTGG CACCCGGCAG GGCGGCCCGG CGTGTGTGGG CGGCCGGGAC ACACAGGGGC CGATTCCGGC CACCGTCATC CATGCGGGTG CGCCCGCCGA CACCGGCGGC GTGAAGCTGA TGACGGTCGT CGGTTTCGGG ATCGTCGACG CCGCTGGCAC GTGGGCCTAC GACCCGAACA CGGATGACGC CAGGCTCACC TGGCCGGCGA CCGGTGACGC CGCGCTCCAG ACCCAGATCG CCGCCCGGAT GCAGGCGATC GTCGCCGCGG GTGGCGGGAT GATGATCGAC ACGAACGCCC AGGCGAACTC GACCTGGCAT GCCCTGGGCG GGGTGCCGAT GGGATCGGCG GTCGACCTGT ACGGCCGGAT CATCGGCCAC AAGGGCCTCT ACGTACTCGA CGGGTCGCGG ATCCCGGGCT CCACCGGTGC CTGTAACCCC TCCATGACGA TCGCGGCCCT GGCCGAGCAC AGCATGTCCA CGATCATCCG TGAGGACGTC GGACGTGTCT TCTGA
|
Protein sequence | MSDNASTPLE SGPGLTRRRI LGTVALGAAA AGGLAAGPGS GRARAATAPA DAAVTEQRER AVVIGSGFGG GVTALRLAQA GVPTLVLERG LRWSTGPNAT TFCRWANIDN RSAWLTDHST TPGVDKTWTP YTGVIESLPG DGITVNCGAG VGGGSLVYHG MTLQPTSQNF AASIPEAASL YRDLNRWAYP TVATMLGVST VPNDVLNADQ YRSSRLFLDV AANSGLEPFR VPLPVDWSYV RGELTGQYEP TYTTSDIVFG VNNGGKHSID VTYLASAEAT GRVRVSPLHV VRDVQMDPDG RWVLSVDRID TGGTVQERKR ITADAVFLNA GSAGTTRLLV KARAKGLVPD LPDAIGTKWG NNGDRIYAWV GMNGDPGTRQ GGPACVGGRD TQGPIPATVI HAGAPADTGG VKLMTVVGFG IVDAAGTWAY DPNTDDARLT WPATGDAALQ TQIAARMQAI VAAGGGMMID TNAQANSTWH ALGGVPMGSA VDLYGRIIGH KGLYVLDGSR IPGSTGACNP SMTIAALAEH SMSTIIREDV GRVF
|
| |