Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3717 |
Symbol | |
ID | 3903818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4449309 |
End bp | 4450565 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637881043 |
Product | serine hydroxymethyltransferase |
Protein accession | YP_482798 |
Protein GI | 86742398 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0112] Glycine/serine hydroxymethyltransferase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.982412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.848378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCC CGTTCTGGGG CCCGGACTTC GACCAGCTGA GCGCGACGGA TCCGCAGATC GCGGAGGTGG TCCTCGATGA GCTGGACCGG CTGCGCGGCG GCCTGCAACT CATCGCGAGC GAGAACTTCA CGTCCCCGGC GGTGCTGGCG GCGCTGGGCT CGACGTTGTC GAACAAGTAT GCCGAAGGGT ATCCGGGTCG GCGTTACTAC GGCGGTTGCC AGGTGGTCGA CCGGGCCGAG GAGATCGGCA TCGCCCGGGC GAAGCAGCTC TTCGGCGCGG AGCACGCCAA CCTGCAACCG CATTCGGGCT CGTCGGCGAA CTTCGCCGTG TACGCGGCGC TACTCACGCC AGGTGACACG GTCCTGGCGA TGTCGTTGCC GCATGGCGGT CACCTCACCC ACGGCAGCAA GGTGAGCTTC TCCGGTAAGT GGTTCAACGT GGTGGCCTAC GGCGTGCGGG AGGACACCGA GCTGATCGAC TACGACCAGG TGCGGGAGCT CGCCCGCCAG CACCGGCCCA AGATGATCAT CTGTGGGGCG ACGGCCTACC CACGTCTGAT CGACTTCGCC GCGTTCCGCT CGATCGCCGA CGAGGTCGGT TCGTGGCTGA TGGTGGACGC GGCGCACTTC ATCGGTCTGG TCGCCGGGGG CGCGATCCCG AGCCCCGTTC CCTACGCCGA CGTTGTCAGC TTCACCACCC ACAAGGTGTT GCGCGGCCCG CGAGGGGGCA TGATCCTCGC GCGTGAGGAG CTGGCTTCCC GCATCGACAA GGCCGTGTTC CCGTTCAGCC AGGGTGGCCC GCTGATGCAC GCGGTCGCGG CGAAGGCCGT CGCGTTGCGG GAGGCGGCCT CGCCCGCTTA CGCGCAGTAC GCTCGCCAGG TGGTGGCCAA CGCGCAGCGG CTCGCCGACG AGCTTGCCGC CGAGGGCATC CGGCCCGTCG CCGGTGGCAC CGACACCCAT CTCGCCCTGC TCGACCTGCG GGAACTCGGG GTCAGCGGCA AGGAAGCCGA GGCGCGTTGC GACGCGGCCG GCATCACCCT GAACAAGAAC GCCATTCCCT ATGACCCGCA GCCGCCCGCG ATCTCCTCCG GCATCCGGGT GGGAACCCCG GCGGTCACCA CCCAGGGGAT GGGCGAGGGG GAGATGAAGG AGATCGCGGG GCTGATCGCC CACGCGGTGC GTGAGCCGGA AGCCGCCGCC GACGTCGCTG CGGCGGTGTC CGCGCTCGTC GCCCGGCATC CGGCCTATCC GCGGTAG
|
Protein sequence | MSTPFWGPDF DQLSATDPQI AEVVLDELDR LRGGLQLIAS ENFTSPAVLA ALGSTLSNKY AEGYPGRRYY GGCQVVDRAE EIGIARAKQL FGAEHANLQP HSGSSANFAV YAALLTPGDT VLAMSLPHGG HLTHGSKVSF SGKWFNVVAY GVREDTELID YDQVRELARQ HRPKMIICGA TAYPRLIDFA AFRSIADEVG SWLMVDAAHF IGLVAGGAIP SPVPYADVVS FTTHKVLRGP RGGMILAREE LASRIDKAVF PFSQGGPLMH AVAAKAVALR EAASPAYAQY ARQVVANAQR LADELAAEGI RPVAGGTDTH LALLDLRELG VSGKEAEARC DAAGITLNKN AIPYDPQPPA ISSGIRVGTP AVTTQGMGEG EMKEIAGLIA HAVREPEAAA DVAAAVSALV ARHPAYPR
|
| |