Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4426 |
Symbol | |
ID | 5672778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5287771 |
End bp | 5289510 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243295 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_001508711 |
Protein GI | 158316203 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGTT CCTCTTCACA CCGTTCGCTG CTCGACTTCT ACGCCGACCC CGACGCGGGC AGCTGGGATG AGGTCGGGTA CGTCCCCTAT CCGCTCGGGC CCCCGGCCGT CGACACACCG GTCGTGGCCC GCCTGCGGAC GCGGGCGTTG TCTGACCTGC GTGAACGTTA CGACGCGGTC GTTGTGGGAA GCGGAGCCGG TGGCGGTGTC GCCGCATTCG TGCTCGCGTC CGCGGGAGCA TCCGTGCTCG TCGTCGAACG CGGCCAGTGG TACGGCGCGG GCGACCTCGC TACCGACCAT ATCCGCAACC ACCGGTTCTT CCTCGGGGGC GACCTCGACA CACCCCCCGG GCACCCCCGG GCAGTCCTCG GCGACGACGG TGAGATCGCG GTCGAGTCCG AGGACGGCCG CTACCACCAC AACGCAATCA CCGTCGGAGG TGGCACCCGG TTCTTCGGCG CCCAGGCCTG GCGGTTCCGG CCCGAGGACT TCCGGATGGC CTCCCTGTAC GGCGTCCCAG AAGGATCCGC GCTCGCGGAC TGGCCGATCA CCTACGGTGA CCTCGAACCG TTCTACGACC AGGTGGAATG GGAACTCGGC GTGGCCGGCC GCCCGCACCC CGATGACGCG CACCGCAAAC GCGACTACCC GATGCCCCCC TTCCCGCCGA GCGCCCTGGC TGCCGCGCTC GCGGCCGGCG CCGACAGGCT CGGATGGCCG ACGGGACCAA CCCCGTTGCT GCTGAACACC CGACCCCGCG CTGGCCGGGC CGCATGTATC CGCTGCGGGA TGTGTGTCGG CTTCACCTGC CCCGTCGACG CCCGCAACGG CACCCACAGC ACCGTCCTGC CCCGCGCCGT CGACCTCGGC GCCGACCTGA CTGTTGGCGC GCAGGTCACG CGAGTATCCG CCGCCGGCGA CGTCGAGATC GCCGCCGGCG GCGCGTCACG CGTCATCCGC GCCGGAACGG TCGTGCTCGC CGGCGGCGCG GTCGAGACCG CCAGGCTGCT CCAGCTCAGC GGCCTGGGCA ACGACTGGGT CGGCGACTGC CTCCAAGGAC ACCTCTACGC AGGCGCGCTC GGCGTGTTCG ACGAGCCGGT CCACGACGGA CTCGGACCAG GCCCGTCCAT CGCGACCAGA CGCTTCGCCC ACGGAAACGA CAGCGTCGTC GGTGGAGGCC TGCTCGGGGA TGACTTCGTC AGGCTCCCCG TCATGCACTA CTTCATGACC CGCCTCATGG GACTTTCCCC CGACGTCAGC CAGGCGGACG TGCGTCGCGC CATGGCGGAG AGTTACCGCC ACACCGGCAT CGTCTCCGGG CCGGTCCAGG AAGTGACGAT TCGCGGCGCA CGCGTCCGGT TGGCGTCAGG CGTGACCGAC CACCTCGGCC TCCCTGTGGC CCGACTCGAA GGTTTCCATC ATCCCGAGGA CCTCCGAACC GTCGCGTTCC TCACCGACCG AGCCGAGGAA TGGCTACACG CATCCGGCGC TCGCCAAACC TGGCAGATAG GCCCGATGAA GGAAACGACA CTATCCGCGG GCCAGCACCA GGCAGGCACC GCGCGCATGT CCGACTCACC CCGCCACGGC GCAACCGACC CGTTCGGCAA AGTCTGGGGA ACAGATCGCG TATACGTCGC CGATGCAAGC CTCCACGTCA CCAATGGCGG CGCAAACCCC GTCCTCACCA TCATGGCCCT CGCCTGGCGA ACCGCAGCAC ACATCGCCGC CAGCGGATAA
|
Protein sequence | MNRSSSHRSL LDFYADPDAG SWDEVGYVPY PLGPPAVDTP VVARLRTRAL SDLRERYDAV VVGSGAGGGV AAFVLASAGA SVLVVERGQW YGAGDLATDH IRNHRFFLGG DLDTPPGHPR AVLGDDGEIA VESEDGRYHH NAITVGGGTR FFGAQAWRFR PEDFRMASLY GVPEGSALAD WPITYGDLEP FYDQVEWELG VAGRPHPDDA HRKRDYPMPP FPPSALAAAL AAGADRLGWP TGPTPLLLNT RPRAGRAACI RCGMCVGFTC PVDARNGTHS TVLPRAVDLG ADLTVGAQVT RVSAAGDVEI AAGGASRVIR AGTVVLAGGA VETARLLQLS GLGNDWVGDC LQGHLYAGAL GVFDEPVHDG LGPGPSIATR RFAHGNDSVV GGGLLGDDFV RLPVMHYFMT RLMGLSPDVS QADVRRAMAE SYRHTGIVSG PVQEVTIRGA RVRLASGVTD HLGLPVARLE GFHHPEDLRT VAFLTDRAEE WLHASGARQT WQIGPMKETT LSAGQHQAGT ARMSDSPRHG ATDPFGKVWG TDRVYVADAS LHVTNGGANP VLTIMALAWR TAAHIAASG
|
| |