Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7138 |
Symbol | |
ID | 5675441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8718698 |
End bp | 8719738 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641245977 |
Product | D-isomer specific 2-hydroxyacid dehydrogenase NAD-binding |
Protein accession | YP_001511365 |
Protein GI | 158318857 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1052] Lactate dehydrogenase and related dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.731191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA GCGAACCCGC ACGAACAGCG GCCCGCCGCA CCCGGTCGCC GGCGCCCTCG TCCTCCTCAG CCGTCCCGGC AACGGGAATC ACGATCTATG GATGCGGGCA GGACGAGGCC GTTCTTTTCC GAGAGATGGC TCCCCGTTTC GGCGTACTGC CAACCATCAC AGACGTGGCG GTATCTGAAG CCAACATCGA ACTGGCACTC GGAAACCGAT GCATCAGCGT CGACCACAGG ACTCGGGTCA CGAGCTCCAC TCTTCTTGCA CTCAGCCAGG TCGGCGTGAC GTACATCTCC ACAAGAAGCA TCGGGTGCAA CCATATCGAC GTGAAATACG CTGCGGGCGT CGGCATCTCC GTCGGAAACG TCGCCTATTC GCCCGACAGC GTGGCCGACT ACACGCTGAT GCTGATGCTG ATGGTGGTGC GGAACGCAAA GTCCATCATC CGCCGCGCGG ATATTCATGA CTACAGACTG AATGATGTGC GCGGGAGGGA ACTACGCGAT CTGACCATCG GGGTGGTTGG AACAGGACGC ATCGGCGCGG CGGTCATGGA CAGGCTGCGG GGTTTCGGCT GCCGAACACT GGCCTATGAC GATCGCCCCG AGGCCGCCGC CGAATACGTT CCGCTCGACG AATTGCTGGA ACTGAGCGAC ATTGTGACGC TCCATACCCC CCTCAATGCG GATACACACC ACCTCCTAAA TCGTCGACGT ATCGAGAGGA TGAAGCGCGG CGCGTTCCTT GTCAATACCG GACGCGGCCC ACTCCTTGAT ACCGAGGCCC TTGTTCAGGC ATTGGAAAGC GGCAGACTGG GCGGTGCGGC GCTGGATGTC CTCGAAGGAG AGGAAGGAAT ATTCTACGCC GATCACAGGA ACAAGCCCAT CGAATGCGCA CCGCTGCTAC GGCTGCAAGA ACTGCCGAAT GTTCTGGTCA GTCCCCACAC CGCCTACTAC ACAGACCACG CCCTGCGTGA CACCGTTGAA AACTCCATCA CCAACTGCCT CGAATTCGAA AGCAGGATTC AGCATGGATA G
|
Protein sequence | MPRSEPARTA ARRTRSPAPS SSSAVPATGI TIYGCGQDEA VLFREMAPRF GVLPTITDVA VSEANIELAL GNRCISVDHR TRVTSSTLLA LSQVGVTYIS TRSIGCNHID VKYAAGVGIS VGNVAYSPDS VADYTLMLML MVVRNAKSII RRADIHDYRL NDVRGRELRD LTIGVVGTGR IGAAVMDRLR GFGCRTLAYD DRPEAAAEYV PLDELLELSD IVTLHTPLNA DTHHLLNRRR IERMKRGAFL VNTGRGPLLD TEALVQALES GRLGGAALDV LEGEEGIFYA DHRNKPIECA PLLRLQELPN VLVSPHTAYY TDHALRDTVE NSITNCLEFE SRIQHG
|
| |