Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5661 |
Symbol | |
ID | 5673988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6873295 |
End bp | 6874284 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244515 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001509918 |
Protein GI | 158317410 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.894525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCATCC TCGTCACCGG CGCGTCCGGG TTCGTCGGCG GCGTCACCGC CGACCTGCTC TCGGCCGCCG GCCACCAGGT GACCGCGCTC GTCCGGGACG CGACGGCCCG GACGAGGCTG TCGAGGGTGA TCGAGGTGGT CCAGGCCGAC CTGCTCGAAC CACGCCAGCT CGCCGCGGCG GGCGTCAGCC GCGGGTTCGA CGGGGTGTGC CACCTCGCCG CTCTGACCAG GGTGCGGGAG TCCCGCGAGA CGCCGCTGCG GTACTTCGCG GCGAACGTCA CGGGCACGAC CAACCTGCTG GCGGCCCTCG ACGCGGGCAC CCGGGCCACC GGGGTGGCCC CGCGGTTCGT CTTCGGCTCG AGCTGCGCCG TCTACGGGGA CACGGGTACC TCCCCTATCC CGGAGACACG CGCACCCGCG CCGACGAATC CCTACGGCGC CTCGAAACTC GCCGCGGAGC AGGCGGTGGC CTACCAGGCC GCCACCGGGC GGCTGGGCGC CGTCGTGCTG CGCTCGTTCA ACGTCGCGGG GGCGGTCGGC TCGCACGCCG ACCGCGACAG CAGCCGGATC ATCCCGGCCG CGCTCGGCGT CGCAACGGGC CGGCGCGACG CCTTCCGGGT GAACGGTGAC GGCGCGTCGA TCCGCGAGTA CGTCCACGTC GTCGACATGG CGCGGGCGTA CCTGACCGCG CTGCGGGCGA CCGTGCCGGG CCGCTGCACC GTCTACAACG TCGGCAGCGG CCTCGGCGTG AGCGTCACCG ACGTGCTGCG GACGGTGGAG AGCGTGACGG GCCGGGACGT GCCGCGGGTG ACCCTGCCCC CGGTGCCCGA ACCCAGAGCG CTCATCGCCG ACAGCCGCCG CATCCGGGCC GACCTGGGCT GGACCTCTCC GTCCTCGACC ATCGAGAAGA TCGTCACGGA TGCCTGGCGC TCGACGGCGG TGCCCGAGCC GGTCGCGGCG CGGCGCGGCG ACGTCCCGAT CGTCTCGTGA
|
Protein sequence | MRILVTGASG FVGGVTADLL SAAGHQVTAL VRDATARTRL SRVIEVVQAD LLEPRQLAAA GVSRGFDGVC HLAALTRVRE SRETPLRYFA ANVTGTTNLL AALDAGTRAT GVAPRFVFGS SCAVYGDTGT SPIPETRAPA PTNPYGASKL AAEQAVAYQA ATGRLGAVVL RSFNVAGAVG SHADRDSSRI IPAALGVATG RRDAFRVNGD GASIREYVHV VDMARAYLTA LRATVPGRCT VYNVGSGLGV SVTDVLRTVE SVTGRDVPRV TLPPVPEPRA LIADSRRIRA DLGWTSPSST IEKIVTDAWR STAVPEPVAA RRGDVPIVS
|
| |