Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0454 |
Symbol | |
ID | 5668876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 536845 |
End bp | 537933 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641239386 |
Product | oxidoreductase domain-containing protein |
Protein accession | YP_001504824 |
Protein GI | 158312316 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCC GCTGGGCGAT CGCGGGCACC GGCGCCATCG CACACCAGTT CGCCGCCGCG CTGGGCCGCT TGCCCGACGC CGAGCTCGTG GCGGTCGGCT CCCGCCGCCA GCAGACGGCT GACGCGTTCG GCGAACGGTT CGGCATCCCG CGCCGGCGCA GGTACAGCTC ATACGAGCGG CTCGCCGCCG ACGACGGTGT CGATGTCGTC TACGTGGCCT CCCCGCACTC CCACCATCAC CGTCACACCC TGCTGTTCCT GAGCGCGGGG CGGGGTGTGC TGTGCGAGAA GCCGTTCGCG CTTGACGCCG AGCAGGCCGC CGAGATGGTG GTTGCCGCCC AAACCCATGG GCAGTTCCTG ATGGAAGCCA TGTGGAGCCG CTTCCTGCCT GCCTACGTCA AGATCCGGGA GTTGGTCGCC GCAGGCGCCA TCGGCACCGT CCTGGCCGTC GAGGGTGACT TCGGATTCCC CCGCCCGGTG GATCCCGCCA ACCGGGTGTT CGATCTCGCC CAGGGCGGCG GCGCGCTGCT CGACCTGGGC GTCTATCCGG TGTCGCTGGC CAGCATGCTG CTGGGCGAAC CCGACCGGGT GGTGGCACTC GGCCAGCTCG GCGAGACCAG CGTCGACGAG CAGGTCGCCG TCCTGATGGG CTACCACACG GGCGCGGTGG CTGTCGCGAA GGCATCCCTG CGCGCGAGTC TCGCCTGCAC CGGCCGGGTC TCCGGCACGG AGGGCAGCAT CGAACTCGCC ACGTTCATGC ACTGCCCGGA CAACCTGACC GTCCGACGGA AGTCCGGCAC CCAACGGCTA TACCTGCCCG CTGACAGTGA CGTCACTGCC ACCGACGCCA CTGCCACCGC CGGTGATATC GACGGTGCCG ACCGCCGGAA CCATCGGGAC CGGGCGGCCG GCGGCGGGCT ACATCACCAA ATCCGTCACG TGCACTTCCG GTTACAGGCC GGCCACCTCG ACAGCGACAT CATGTCCCAG GCCGAATCCG TGTCGGTGAT GCGGACACTC GATGCAGCCC GGGCCCAGAT CGGTCTGCGC TACCTGGCCA CCCTGATGGA TGGTAGCCGG CAGAGTTGA
|
Protein sequence | MSFRWAIAGT GAIAHQFAAA LGRLPDAELV AVGSRRQQTA DAFGERFGIP RRRRYSSYER LAADDGVDVV YVASPHSHHH RHTLLFLSAG RGVLCEKPFA LDAEQAAEMV VAAQTHGQFL MEAMWSRFLP AYVKIRELVA AGAIGTVLAV EGDFGFPRPV DPANRVFDLA QGGGALLDLG VYPVSLASML LGEPDRVVAL GQLGETSVDE QVAVLMGYHT GAVAVAKASL RASLACTGRV SGTEGSIELA TFMHCPDNLT VRRKSGTQRL YLPADSDVTA TDATATAGDI DGADRRNHRD RAAGGGLHHQ IRHVHFRLQA GHLDSDIMSQ AESVSVMRTL DAARAQIGLR YLATLMDGSR QS
|
| |