Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6327 |
Symbol | |
ID | 5674646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7684621 |
End bp | 7685643 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245180 |
Product | aldo/keto reductase |
Protein accession | YP_001510575 |
Protein GI | 158318067 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.563142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGCC GACGTGGGGT GGGGCGGTCG GGGCTGCGGT TCAGCTCGCT GAGCCTCGGC GGTATGACCT TCGGCAGCAA CGAGGCGTTC GGCGCCATCG GGTCGGATCC GACCGAGGCG GCCCGCGTCG TCGCGCTCGC TCTGGACGCT GGCATCGACA CGTTCGACAC CGCCAACGTC TACGGCGAGA GCGAGCAGCT GCTCGGACGG CTGCTCCGGC GGCGGCGTGA CGATGTCGTG ATCTGCACGA AGGCGCGGTT CCCGACCTCC GGACACGTCG GTGACGGACT GCCGCCCTCG AGCAGCTACG GCCTGTCGCG CGCGGCGGTC GTCCGGTCCG TCGAGGCGTC GCTGCGGCGC CTCGACACCG ATCGCATCGA CGTCCTCTGG CTGCACATGC AGGACAGGTC CGTCCCAATC GAGGAGACTC TGTCCGCCAC CGACCTGCTG GTCCGCCAGG GCAAGGTGCG TTATCTCGGG TTGTCGAACT TCATGGCGTA CCGGGTGGCC GAGGCTGTCC TCACCGCGCG GGCGCGGGCA CTGGAACCAC CGGTGGCCCT GCAGGTGCCC TGGTCGGCGG TGAGCCGGGA CGTCGAGCGG GAGCTCGTCC CGGCGGCCCG TCACCTCGGG CTCGGGGTGG CCGTCTACAG CCCGCTGGCC CGCGGTTTCC TCAGCGGCAA GTACGACCGA GGCGCAGAGC CGGTGGTGGG CAGCCGGCTG GGGGAGTGGC GCGACGAGTT CGCCCGCTAC GACTCCGACC GCAACTGGGC CGTCCTGGCC GCGCTGCGGG AGACAGCGCG CGCGCATCAG GTGCCGGTGA GCGCGGTCTC CCTCGCCTGG CTGCTGGCCC GGCGTCCGGT GGTCAGCGTC GTGATCGGGG CACGCACCGA GGAGCAGCTA CGGGAGAACC TGATGGCGGC GGACGTGCGG CTCACCGCCG AGGAGATCGC GCGACTGGAC AAGGTGTCGG AACCTGACTG GGGCTATCCG GAGTCCTTCA TCGCGCGCTT CGAGCCGTGG TGA
|
Protein sequence | MISRRGVGRS GLRFSSLSLG GMTFGSNEAF GAIGSDPTEA ARVVALALDA GIDTFDTANV YGESEQLLGR LLRRRRDDVV ICTKARFPTS GHVGDGLPPS SSYGLSRAAV VRSVEASLRR LDTDRIDVLW LHMQDRSVPI EETLSATDLL VRQGKVRYLG LSNFMAYRVA EAVLTARARA LEPPVALQVP WSAVSRDVER ELVPAARHLG LGVAVYSPLA RGFLSGKYDR GAEPVVGSRL GEWRDEFARY DSDRNWAVLA ALRETARAHQ VPVSAVSLAW LLARRPVVSV VIGARTEEQL RENLMAADVR LTAEEIARLD KVSEPDWGYP ESFIARFEPW
|
| |