Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0519 |
Symbol | |
ID | 3905177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 605148 |
End bp | 606296 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637877848 |
Product | hypothetical protein |
Protein accession | YP_479632 |
Protein GI | 86739232 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.520088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCTG GGCTCAAGCG GGAGATCGAG GCCAAAGTTT CCGCCGGCGA GAGACTGAGC CGTGCCGACG GGGAGGCGCT GTACGCCAGC GACGACCTCG TCTGGTTGGG CGGTCTCGCC CACGAGGTTC GCACCAGAAA GAACGGCGAC AAGACCTTCT TCAACGTCAA CCGGCATCTG AACCTGACGA ACGTCTGCTC GGCGTCCTGC GCGTACTGCT CGTTCCAGCG CAAGCCCGGT GAGTCCGATG CCTACACCAT GCGCATCGAG GAGGCTGTCC GGCTGGCGAA GGAGATGGAG CCGGCCGGGA TCACCGAGCT GCATATCGTC AACGGTCTGC ATCCGACGCT GCCCTGGCGT TACTATCCGC GCTCGCTGCG GGAGCTCGGC AAGGCGCTGC CCGGCGTCGC CTTGAAGGCG TTCACCGCTA CCGAGATCCA CTGGTTTGAG AAGATCAGTG GCCTGTCCGC CGATGAGATC CTCGACGAGC TCATCGACGC GGGACTCGAG TCGCTCACCG GCGGCGGCGC GGAGATCTTC GACTGGGAGG TCCGCCAGAA GATCGTCGGT CACGAGACGC ACTGGGAGGA CTGGTCGCGC ATCCACCGGC TCGCCCACGC CAAGGGCCTG CGCACTCCGT GCACGATGCT GTACGGGCAC GTCGAGGACC CCCGGCACCG GGTGGACCAC GTGCTGCGGC TGCGTGAGCT GCAGGATTCC ACGGGCGGGT TCACGGTCTT CATCCCGCTG CGCTTCCAGC ACGACGCCGC CGGCGACCCG CGCAACCGGT TGATGAACCA GCCGATGGCG ACGGGGGCCG AGGCGTTGAA GACGTTCGCC GTCTCCCGGC TGCTGTTCGA CAACGTGGAC CACATCAAGT GCTTCTGGGT GATGCATGGT CTCACCACGG CGCAGCTCGC GTTGAACTTC GGCGCCGACG ACCTCGACGG TTCCGTTGTC GAGTACAAGA TCACGCACGA TGCGGACCGG TTCGGGACGC CGCACACCAT GACCCGCGAG GATCTGCTCG CAATCATCCG CGACGCCGGC TTCCGCCCGG TCGAGCGGGA CACCCGCTAC CGGGAGATCC GTGTCTACGA CGGTCCCGAC CCGGCTCGGC GTGACGTCCC GACCTCGATC GACGCCTGA
|
Protein sequence | MDAGLKREIE AKVSAGERLS RADGEALYAS DDLVWLGGLA HEVRTRKNGD KTFFNVNRHL NLTNVCSASC AYCSFQRKPG ESDAYTMRIE EAVRLAKEME PAGITELHIV NGLHPTLPWR YYPRSLRELG KALPGVALKA FTATEIHWFE KISGLSADEI LDELIDAGLE SLTGGGAEIF DWEVRQKIVG HETHWEDWSR IHRLAHAKGL RTPCTMLYGH VEDPRHRVDH VLRLRELQDS TGGFTVFIPL RFQHDAAGDP RNRLMNQPMA TGAEALKTFA VSRLLFDNVD HIKCFWVMHG LTTAQLALNF GADDLDGSVV EYKITHDADR FGTPHTMTRE DLLAIIRDAG FRPVERDTRY REIRVYDGPD PARRDVPTSI DA
|
| |