Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1048 |
Symbol | |
ID | 5669462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1228328 |
End bp | 1229323 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239977 |
Product | thioredoxin domain-containing protein |
Protein accession | YP_001505410 |
Protein GI | 158312902 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | [TIGR01068] thioredoxin |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.157412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.991017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGGA GGTCACCTGT CCCACCCGGC TCGCCGCGCA GGGGCGGGCC GGGCAGTATG CCCGACACCC TCGGCGGCCT ACGGCTCGCC GGTGCGGTTC CCCTAGACCC GAAGCCGGCC CAGCCGCCGG CGCCCCCGGC CGGCCGGGCC GGCCCCGGTG GCTCTCCCGG CGGCGCGGCG CCCGCCGGTG CCGCCGGCCC GGTCGTGATC GACGTGACCG AGGCGACGTT CGCCGAGGAC GTCGTGAACC GTTCCATGCA GGTTCCCGTG GTCATCGACT TCTGGGCCGA GTGGTGCGGG CCGTGCAAGC AGCTCAGCCC CATCCTGGAG CGCCTCGCCG CCGCCGACGG CGGTCGCTGG GTGCTCGCCA AGGTGGACGT CGACGCCAAC CCCGGTCTCG CGCAGGCCGC GGGCGTGCAG GGCATCCCCG CGGTCAAGGC GGTGGTCGGC GGCCGGATCA TCGGAGAGTT CACCGGCGCG GTCCCCGAGC GGGAGGTGCG CGGCTGGCTG GACCAGCTCC TGAGCGTCGT CGGGGAGGCG ATGGGCGGGC TGCCGGGGGC GGGAGCCGAG GGCGGTCCCG CGCTGCCGCC GAACATCGCC GCCGCGGAGG ACGCGATGGC CACCGGCGAC CTGGACGCGG CGGCCGCCGC CTACCAGGCC CAGCTCGCCG AGGCCCCCGG GGACGCGGAC GCCACCCTCG GCCTGGCCCG GGTGGAGCTG CTGCGGCGGG TGCGCGGCTA CGACCCGGCC TGGCTGCGTC AGCGGCTCTC GGAGAACCCC GACGACATCG AGGCGGCGCT CGCGGTGGCC GACCTGACCA TCGCCCAGGG CGACCCGGCC ACCGGCCTGG GCCGCCTCGT CGACCTCGTC CGCCGCACCT CGGGCGACGA CCGGGAGAAG CTGCGGGCGC ATCTGGTCGG GCTGTTCCAG GCGCTGGGCG ACGGCGAGCC GGCGGTCGCC CCGGCCCGGC GGGCCCTCGC CGCCGCCCTG TTCTGA
|
Protein sequence | MQRRSPVPPG SPRRGGPGSM PDTLGGLRLA GAVPLDPKPA QPPAPPAGRA GPGGSPGGAA PAGAAGPVVI DVTEATFAED VVNRSMQVPV VIDFWAEWCG PCKQLSPILE RLAAADGGRW VLAKVDVDAN PGLAQAAGVQ GIPAVKAVVG GRIIGEFTGA VPEREVRGWL DQLLSVVGEA MGGLPGAGAE GGPALPPNIA AAEDAMATGD LDAAAAAYQA QLAEAPGDAD ATLGLARVEL LRRVRGYDPA WLRQRLSENP DDIEAALAVA DLTIAQGDPA TGLGRLVDLV RRTSGDDREK LRAHLVGLFQ ALGDGEPAVA PARRALAAAL F
|
| |