Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3901 |
Symbol | |
ID | 5672262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4665120 |
End bp | 4666745 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242780 |
Product | cholesterol oxidase |
Protein accession | YP_001508197 |
Protein GI | 158315689 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0511249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAA ACACGCCCGA CGCCGTCGAA TCCACCCCCG GCCTCAACCG CAGGCACTTA ATCGGCACGG CGGCGATCGG CGCCGCCGCG GTCGGCGGTT TCACCGCGGG ACCGGGTGCC GTCCGGGCCC GGGCCGCCAC CCCCGTGCCG GCCACTGTGC AGCAGGAGCG GGCCGTTGTC ATCGGGAGCG GCTTCGGCGG CGGCGTGACC GCGCTGCGGC TGGCCCAGGT AGGCGTGTCG ACGCTGGTCC TGGAGCGCGG GCTGCGGTGG CCCACCGGGC CGAACGCAAC GACGTTCTGC CGCTTCGCCA ACATCGACAA CCGCTCCGCC TGGCTGACCG ACCACGCCAC GGTCGGCGGC GTGGTGAAGA CGTGGGAGCC CTACACCGGG GTGATCGAGA GCATTCCCGG CAACGGAATC ACGGTGAACT GCGGGGCGGC CGTCGGCGGC GGCTCCCTCA TGTACCACGG CATGACACTG AAGCCCTCGA AGGCGAACTT CGCCGCGTCG ATCCCGGTGG CCGCGAACCT CTACGACGAG CTGAACCTGT GGGCGTATCC GCTGGTGGCC AGCATGCTGG GAGTCTCCAC GATTCCCAAC GACATCCTGA ACAGCGACCC GTACAAGTCC TCCCGGCTCT TCCGGGACGT CGCTCCGGGC GCGGGCCTGG AGCCGTTCCA GGTTCCCCTG CCGATCGACT GGCAGTACGT GCGCGGTGAG CTGAACGGCC AGTACCAGCC GACCTACACC ACCAGCGACA TCGCCTTCGG GGTCAACAAC GGCGGTAAGC GCTCCATCGA CGTCACCTAC CTCAAGGCGG CGGAGGCGAC CGGCCGGGTC CGCGTCGCGA CGCTGCACGT CGTCCGCGAC ATCGCGCTCG ACGCGAACAA GAAGTGGGTA CTGACGGTCG ACCGCATCAA CACCGGCGGC ATCGTCCAGG AGACGAAGAC CATCGTCGCG GACGCGGTAT TCCTCAACGC GGGTTCCGCC GGCACGACCC GCCTGCTGGT GAAGTCCAAG GCGAAGGGGC TTATCCCCAA CCTGCCGGAC GCCGTCGGAA CCCAGTGGGG AAACAACGGC GACCGCATCT ACCTGTGGAA CGGCATGAAC GGCGACATCG GGACCCAGCA GGGTGGTCCC GCCTGCGTCG GCGGTCGCGA CACCACCAGC TCGATCCCGC TCACCATCAT CCACGCGGGC TCCCCCATTC CGAGTACCGC CGGCAAGCTG ATGACGGTCG TCGGCTTCGG GATCGTGAAC CCCGCCGGCA CCTGGGCGTA CGACTCCGCG AAGGACGACG CCGTCCTGAC CTGGCCGTCC AGCGGTGACG CCGCGCTGCA GGCGCTGATC GCCGCGCGCA TGCAGAAGAT CGCCCAGGTG GGCGGCGGCA TCATGATCGA CACGAACGCC CAGGCGAACT CGACCTGGCA CGCCCTGGGC GGCGTGCCCA TGGGGTCCGC GGTCGACCTC TACGGCCGGG TCATCGGCCA GAGTGGCCTC TACGTGCTCG ACGGCGCGCG GATCCCGGGC TCCACCGGCG CCTGCAACCC GTCCATGACG ATCGCGGCCC TCGCCGAGCA CAGCATGGCC AAGATCGTGC TCCAGGACGT CGGACGCGTC TTCTAG
|
Protein sequence | MPENTPDAVE STPGLNRRHL IGTAAIGAAA VGGFTAGPGA VRARAATPVP ATVQQERAVV IGSGFGGGVT ALRLAQVGVS TLVLERGLRW PTGPNATTFC RFANIDNRSA WLTDHATVGG VVKTWEPYTG VIESIPGNGI TVNCGAAVGG GSLMYHGMTL KPSKANFAAS IPVAANLYDE LNLWAYPLVA SMLGVSTIPN DILNSDPYKS SRLFRDVAPG AGLEPFQVPL PIDWQYVRGE LNGQYQPTYT TSDIAFGVNN GGKRSIDVTY LKAAEATGRV RVATLHVVRD IALDANKKWV LTVDRINTGG IVQETKTIVA DAVFLNAGSA GTTRLLVKSK AKGLIPNLPD AVGTQWGNNG DRIYLWNGMN GDIGTQQGGP ACVGGRDTTS SIPLTIIHAG SPIPSTAGKL MTVVGFGIVN PAGTWAYDSA KDDAVLTWPS SGDAALQALI AARMQKIAQV GGGIMIDTNA QANSTWHALG GVPMGSAVDL YGRVIGQSGL YVLDGARIPG STGACNPSMT IAALAEHSMA KIVLQDVGRV F
|
| |