Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0533 |
Symbol | |
ID | 3905444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 619171 |
End bp | 620358 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637877862 |
Product | hypothetical protein |
Protein accession | YP_479646 |
Protein GI | 86739246 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.271829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.131592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTCG ATGGTGAGAT TCCGGTGGGT CGGGGTGGTG GCGAGATCCG GTCTGTGCTG GACCGGGCCG CAGCCGGCGG GCGTATCTCC GCGGAGGAGG CGCTGCTCCT CTATACGAGG GCGCCGCTGC ACGCGCTCGG CGGCGCGGCG GACACGGTCC GTCGGCGCCG TTTCCCCGAT GGCATCGCGA CGTACATCAT CGACCGGAAC ATCAACTACA CGAATGTCTG CGTGACCGCC TGCCGGTTTT GCGCCTTCTA CCGCGCTCCG AAGCATGCCG AGGGCTGGGT CCGCGACGTC GAGGACATCG TCGCCAAGTG CGGGGAGGCG GTCGAGCTCG GCGCCACGCA GATCATGCTG CAGGGCGGGC ACCATCCCGA CTTCGGCATC GAGTGGTATG AGCGTACCTT TGCCGCCATC AAGAAGGCAT ATCCCCAGTT GGCGCTGCAC TCGCTGGGCG CCAGCGAGGT TGTGCACATC GCCCGGACGT CCGATCTGAC TTTTCCCGAG GTCATTACCC GGCTGCGGGA CGCGGGCCTG GACAGCTTCG CGGGCGCGGG AGCGGAGATT CTCACCGAAC GGCCCCGGCA GGCGATCGCT CCGCTGAAGG AGCCCGGTCA CGTCTGGCTG TCCGTGATGG AGACCGCCCA CAACCTCGGC CTGGAATCCA CCGCCACCTT CATGATGGGC ACGGGGGAGA CGAACGCCGA GCGCATCGAA CACCTGACGA TGATCCGGGA CGTCCAGGAC CGGACCGGCG GGTTCCGTTC GTTCATCCCC TGGACCTACC AGCCGGAGAA CAATCATCTC GGCGGGCGCA CCCAGGCGAC GACCCTGGAG TACCTCCGCC TCGTCGCGGT CGCGCGGCTG TTCTTCGACA ACATCACGCA CCTGCAGGGC TCCTGGCTGA CCACCGGCAA GGAGATCGGC CAGCTCACCC TGCACATGGG CGCCGACGAC CTCGGCTCGG TGATGCTGGA GGAGAACGTC GTCTCCTCCG CCGGGGCGCG CCACCGCACC AACCGGTCGG AGCTGATCTC CCTGATCCGT GCTGCCGGCC GCATCCCCGC TCAGCGCGAC ACCCGCTACC AGCACCTCGT CGTGCACCGC GACCCGGCGC AGGACCCGGT TGACGACCGG GTGGCCTCGC ACTTCTCCTC CACCGCGCTA CCGCTCATCT CCGCGTAG
|
Protein sequence | MDVDGEIPVG RGGGEIRSVL DRAAAGGRIS AEEALLLYTR APLHALGGAA DTVRRRRFPD GIATYIIDRN INYTNVCVTA CRFCAFYRAP KHAEGWVRDV EDIVAKCGEA VELGATQIML QGGHHPDFGI EWYERTFAAI KKAYPQLALH SLGASEVVHI ARTSDLTFPE VITRLRDAGL DSFAGAGAEI LTERPRQAIA PLKEPGHVWL SVMETAHNLG LESTATFMMG TGETNAERIE HLTMIRDVQD RTGGFRSFIP WTYQPENNHL GGRTQATTLE YLRLVAVARL FFDNITHLQG SWLTTGKEIG QLTLHMGADD LGSVMLEENV VSSAGARHRT NRSELISLIR AAGRIPAQRD TRYQHLVVHR DPAQDPVDDR VASHFSSTAL PLISA
|
| |