Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2276 |
Symbol | |
ID | 3904810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2654498 |
End bp | 2656957 |
Gene Length | 2460 bp |
Protein Length | 819 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637879607 |
Product | HAD family hydrolase |
Protein accession | YP_481373 |
Protein GI | 86740973 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0560] Phosphoserine phosphatase [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | [TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like [TIGR01490] HAD-superfamily subfamily IB hydrolase, TIGR01490 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000198235 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000271662 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCACCC GGACGAGCAG GGCCCAGGCG GCGCAGCGGT GCAGCGGATG CTCACCGGAC GAGCAGGTGG CGGTGCAGGT GACGGCAGCG GAGGTCAGGG TGGGGCTGCG TGAGCGGCTC GCGGGCAGGC GGGTCCTGGT GACCGGGGTG ACCGGGTTCG TCGGGGAGGC GCTGCTGGAA CGGTTGCTGT CGGACTTCCC CGACACGGCG GTGGTGGCGC TGGTGCGGCC GCGGGGCAGT CACAGCGGGG CGGCGCGGCT GGCGCGGATG ATGCGGAAGC CGGCGTTTCG CGGGTTGCGG GAGGGGTTGG GTGCGGCGGG GGTGGCGGCG CTGCTCGCCG AGCGGGTGGA GGTGATCGAG GGGGATCTGG CGTCGATGCC GGATCTGCCC GCCGACCTCG ACGTGGTCAT CCACTGTGCC GGGGAGGTGT CGTTCGACCC GCCGGTCGAC GAGGCGTTCA CCACGAACGT CGGTGGGGTG GCGGAGCTGC TGCGGGCGCT TGCGGCCGGG GGGGCCAGGC CTCATCTGGT GCATGTGTCG ACGGCGTATG TGGCGGGGTT GCGTTCGGGG CACATCGCCG AGGGGCCGCT GGCCCATGAC GTGGACTGGC GGGTCGAGTG GGATGCGGCG TCGCGGGTGC GCCAGCAGAC CGAGGACGCC TCGCGGGCGC CGGAGTGTTC GGCGCGGTTT CGGGCGCAGG CGTCGCGGCG GTTCGCCGCC GCCGGGGCGC AGACGGTGTC GGCGGAGGCG GAGCGGCTGC GCCGGGCGTG GGTGGCGCGG CGGATGGTGA CGGCCGGGGG TGAGCGGGCG CAGGTGCTGG GGTGGACGGA TGCGTACACG TTCACCAAGG CGTTGGGTGA GCGGTATCTG GAGGATCATC ACGGGGATCT GCCGTTGACG GTCGTCCGTC CCTCGATCAT CGAGAGTGCG CTGCGCCGGC CGTTCCCGGG GTGGATCGAG GGGTTCAAGA TGGCCGAGCC GCTGATCCTG GCCTATGGCC GCGGTGAGCT GCCGGACTTC CCCGCCTCCC CGGACGCGGT GGTGGACATC ATCCCGGTGG ATCTGGTTGT CAACGCGATC CTGGCGGCGG CGGCGGTGGT GCCGCCGGCG GACACCCCGG CCTATTACAC GGTGTGTTCG GGTTTTCGTA ATCCGTTGCT GTTTCGGGAT CTGTACGCCT ATGTGCGGGA CTATTTCCAG GCCGATCCGT TGCCGCGGCG CGGTCGGGGG ACGTTCGCGG TGCCGGAGTG GCCGTTCGCG GGGGCGGCGG CGGTGGAGGC GAAGCTGCGT CGCTCGGAGC GGCTGGTGGG GTTGGCGGGC CGGGCGTTGG AGCATGCCCC GCCCTCGGAT CGGGTGCGCC GGTTTGCCGG GGAGCTGGAG CGGGCGGAGA GCCGGGTGGG GTTTCTGCGC CGCTACTCGG ATGTGTACCG GGCGTACACG AAGGCGGAGC TGGTCTATGT CGACGACGCC ACGGGTGCGT TGCACGCGGC GATGGATCCG GCCGATCAGG CGGAGTTCGG GTTTGATCCG GCGTGTTTCG ACTGGCGGCA CTATCTGCAG GACGTGCACT GCCCGGCGGT GACGGCGGTG CTGCGCCGGC CGCGTGATCC CGCGCCGCCG CGGCGGCTGT CGGGGCATCT GGCCGCCGGT GACGGGGTGC TGGCGGTGTT CGACCTGGAT GGGGCGGTGG CGTCGGCGAC GGTGATCGAG TCGTATCTGT GGATGCGGTT GGCGGACGCC TCGGCGCCGC GGCGGGTGCG GGAGCTTGCG TCGCTGGCGG TGGCGTTGCC GCGGTATGTG CGTGCGGAGC GTCGTGACCG GGGGCATCTG ATGCGGTCGG TGTACGCGCG CTACGCGGGG GTGGATCCGG CGGAGTTGGA ACGGCTGGTG GTGGAGGTCG CCGGGGACAT TCTGCTGCGG CGGGTGAAAC CGGCGGGGAT CCGTCGGGTG CGTGAGCATC GGGCGGCGGG GCATCGCACG GTGCTGCTGA CCGGGGCGGT GGAGGTGTTG ACGCGGCCGT TCGCGCCGTT GTTCGACGAT GTGGTCGCCG CGCGGCTGGA GGTGGGTGCG GATGGTCTGC TGACCGGCCG GTTGGAGTCC TCGCCGCTGG TCGGGGATGC GCGGGCGGCG TTCATTGATC ATCATGCGCG GGTGTTGGGG GCGGATCTGG GGGTGTCGTG GGCGTATGCG GACAGCCAGT CGGATCTGCC GTTGCTGCGG GCGGTCGGTA ACCCGGTGGC GGTGAACCCG GATCTCGCGT TGCATCAGGT GGCGCGGGGG GCGGGGTGGC CGATCGAGGA GTGGGCGTCG GCGGCCGGGG AGCCGCGTCT GGTCGTCGGG GACCGTCGGG AGCGTGGCCT GCGGGCCGCG GCGGCGCGGG CGGGGGCGGC GGCCTTGAGC TCAAGGCCAG GCCCGGGCAC GGGGCCGGCG GGGGTCGTGG TGTCGGGGGG TGGGCGGTGA
|
Protein sequence | MPTRTSRAQA AQRCSGCSPD EQVAVQVTAA EVRVGLRERL AGRRVLVTGV TGFVGEALLE RLLSDFPDTA VVALVRPRGS HSGAARLARM MRKPAFRGLR EGLGAAGVAA LLAERVEVIE GDLASMPDLP ADLDVVIHCA GEVSFDPPVD EAFTTNVGGV AELLRALAAG GARPHLVHVS TAYVAGLRSG HIAEGPLAHD VDWRVEWDAA SRVRQQTEDA SRAPECSARF RAQASRRFAA AGAQTVSAEA ERLRRAWVAR RMVTAGGERA QVLGWTDAYT FTKALGERYL EDHHGDLPLT VVRPSIIESA LRRPFPGWIE GFKMAEPLIL AYGRGELPDF PASPDAVVDI IPVDLVVNAI LAAAAVVPPA DTPAYYTVCS GFRNPLLFRD LYAYVRDYFQ ADPLPRRGRG TFAVPEWPFA GAAAVEAKLR RSERLVGLAG RALEHAPPSD RVRRFAGELE RAESRVGFLR RYSDVYRAYT KAELVYVDDA TGALHAAMDP ADQAEFGFDP ACFDWRHYLQ DVHCPAVTAV LRRPRDPAPP RRLSGHLAAG DGVLAVFDLD GAVASATVIE SYLWMRLADA SAPRRVRELA SLAVALPRYV RAERRDRGHL MRSVYARYAG VDPAELERLV VEVAGDILLR RVKPAGIRRV REHRAAGHRT VLLTGAVEVL TRPFAPLFDD VVAARLEVGA DGLLTGRLES SPLVGDARAA FIDHHARVLG ADLGVSWAYA DSQSDLPLLR AVGNPVAVNP DLALHQVARG AGWPIEEWAS AAGEPRLVVG DRRERGLRAA AARAGAAALS SRPGPGTGPA GVVVSGGGR
|
| |