Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4612 |
Symbol | |
ID | 5672957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5496797 |
End bp | 5497819 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243473 |
Product | aldo/keto reductase |
Protein accession | YP_001508889 |
Protein GI | 158316381 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.593922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACC GCGTACTTGG CCGCACCGGC GTGCGGGTCT CCCCGCTCTG CCTGGGTGCG ATGATGTTCG GGACCTGGGG CAACCAGGAC CATGACGACT CAATAAAGAT CATTCACAGG GCGTTGGACG CCGGCGTCAA CTTCGTCGAC ACCGCGGATG TCTACTCCGC CGGCGAGTCC GAGGAGATCG TCGGTAAGGC GCTGGCCGGC CGCCGCGACG ACGTCGTGCT CGCAACCAAG CTGTACATGC CGATGGGACC GGGGCCCAAC CGGCGCGGCC TGTCACGGCG CTGGATCGTC ACCGAGGTGG AGAACAGCCT GCGGCGGCTC GGCACCGACT GGATCGACCT CTATCAGGTG CATCGTCCCG ACCCGTCGAC CGACATCGAC GAGACGCTCG GCGCGCTCAC CGACCTCGTC CGGGCCGGCA AGATCCGCTA CTTCGGCAGC TCGACGTTCC CGGCGCACGA GGTGGTCGAG GCACAGTGGG TGGCCGAGCG CCGCAACCGC GAGCGGTTCG TCACCGAGCA GCCGCCGTAC TCGCTGCTCG TCCGTGGGAT CGAGGCCGAC CTGCTGCCCG TGGCGCAGAA GTACGGACTC GGGGTGCTGC CGTGGAGCCC GCTGGCCGGC GGCTTCCTGT CCGGGCGCCA CACCCGCGAC GGCGGCGAGG TCACGAGCAC CCGGATGTCC CGGGTGCCGA ACCGGTTCGA TCTGTCCGTG CCCGCGAACC AGCGCAAGGT CGAGGCCGCG ATCGCGTTCG CCGACCTCGC CGCCGAGGTG GGCGTCACGC TGATCGAGCT GGCGCTGGCG TTCGTGCTGC GCCACCCGGC GGTGACGTCG GCGATCATCG GCCCGCGCAC GATGGAGCAC CTGGAAAGCC AGCTCACCGG CGGCGCCGTC ACCCTGGACG AGGCGACGCT GGACCGGATC GACGAGATCG TCCCGCCTGG CGTCAACGTC AACCCGGAGG ACGCCGGGTA TGTCCCGCCG TCCCTGGCCC AGCCCGCGCT CCGCCGGCGC TGA
|
Protein sequence | MDYRVLGRTG VRVSPLCLGA MMFGTWGNQD HDDSIKIIHR ALDAGVNFVD TADVYSAGES EEIVGKALAG RRDDVVLATK LYMPMGPGPN RRGLSRRWIV TEVENSLRRL GTDWIDLYQV HRPDPSTDID ETLGALTDLV RAGKIRYFGS STFPAHEVVE AQWVAERRNR ERFVTEQPPY SLLVRGIEAD LLPVAQKYGL GVLPWSPLAG GFLSGRHTRD GGEVTSTRMS RVPNRFDLSV PANQRKVEAA IAFADLAAEV GVTLIELALA FVLRHPAVTS AIIGPRTMEH LESQLTGGAV TLDEATLDRI DEIVPPGVNV NPEDAGYVPP SLAQPALRRR
|
| |