Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4170 |
Symbol | |
ID | 5672525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4955996 |
End bp | 4958359 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243043 |
Product | HAD family hydrolase |
Protein accession | YP_001508460 |
Protein GI | 158315952 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0560] Phosphoserine phosphatase [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | [TIGR01488] Haloacid Dehalogenase superfamily, subfamily IB, phosphoserine phosphatase-like [TIGR01490] HAD-superfamily subfamily IB hydrolase, TIGR01490 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.243579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.356217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCTGC GCGAGCGGCT CGCGGGCAGG CGTGTGTTCG TGACCGGGGT GACCGGGTTC GTCGGTGAGG CGCTGTTGGA GCGGCTGCTC TCGGAGTTCC CGGACACCCA GGTCGTGGCG CTCGTGCGGC CCCGGGGCAG CCATTCGGGG ATGGCCCGGT TGGAGCGGAT GACCCGTAAG CCGGCGTTCG CCGGGCTGCG GGAACGGTTG GGCCGTGAGG GGCTGGCGGC GCGGCTGGCG GCGCAGGTGC GGGTCATCGA GGGTGACCTG GCCAGTATGG GTGAGTTGCC CGCCGACCTG GACGTGGTGA TCCACTGCGC CGGGGAGGTG TCGTTCGACC CGGCGATCGA CGAGGGGTTC GCGACGAACC TGGGCGGGGT GCAGGAGCTG CTGCGCGCGG TGCGCGCTGC CGGGGCCCGC CCGCATCTGG TGCACGTGTC GACCGCCTAT GTCGCCGGGT TGCGTTCGGG GCACATCGCC GAGGGCCGGC TGGCCCACGA GGTGGACTGG GCGGCGGAGC AGGCGTCGGC CGCGCGGGCC CGCCACGCCG CGGAGGACGC GTCGCGCTCG CCGGAGAACT CGGCGCGGTT CCGCGAGCAG GCGTCGGCCG AGCATGTGCG GGCCGGCGCG CAGACGGTGT CGGCGCAGGC GGAGGCGGCG CGGCGGGCGT GGGTGGCGGC GCGGTTGGTC GCCGCGGGCT CCGAACGCGC GCAGGTGCTG GGCTGGACGG ACGCGTACAC GTTCACCAAG GCGTTGGGTG AGCGGTATCT GGAGGACTCC CACGGCGATC TGCCGTTGAC GATCGTCCGC CCGTCGATCA TCGAGAGCGC GCTGGCGAAG CCGTTCCCGG GGTGGATCGA GGGGTTCAAG ATGGCCGAGC CGCTGATCCT GGCGTTCGGC CGCGGGGAGC TGCCGGACTT CCCTGCGTCC CCGGATGCCG TCGTCGACAT CATCCCGGTG GATCTGGTCG TCAACGCGCT GCTGGCGGCG GCGGCGAGCC CCCCGCCGCC GGAGCGCCCG GCCTATTACA CGGTGTGTTC GGGGTTCCGG AACCCGCTGC TGTTCCGTGA CCTCTACGAC TATGTGCGGG GCTATTTCCT GGCCGATCCG CTGCCGCGGC GCGGCCGCGG GCATATCGGG GTTCCTGAGT GGCCGTTCGC CGGGGCGGTG GCGGTGGAGG CGAAGCTGCG CCGCGGGGAG AAGGCGGTGG AGTGGGCGAA CCGGGTGTTG GCGCACGCGC CGCGTTCGGA GCGGGTTCGC CGGCTGGCCG TCGACTTGGA ACGTACCGAG GGGCGGGTGG CGTTCCTACG CCGCTACTCC GATGTGTACC GGGCGTACAC GAAGGCCGAG CTGGTCTACG TCGACGACGC CACCGCGGCG CTGCACGCGG CGATGGACCC GGCCGACCAG GTCGACTTCG GGTTCGACCC GGCGTGTTTC GACTGGCGGC ACTACCTGCA GGACGTGCAC TGCCCGGCGG TGACGCAGGT GCTGCGCCGG CCGCGCGACG CCGCCCCGGC GCGGCGGATG GCCCGCAACC TCACCGCCGG CGAGGGGGTG CTGGCCGTGT TCGACCTGGA CGGGACGCTG GTGTCCTCGA CGGTGGTCGA GTCGTATCTG TGGCTGCGGC TGGCCGACGG CGACGTCGGG GAACGGGCCC GGGAGCTGGT GTCGCTGGCG CGGGCACTGC CGGGGTATCT GCGTGCCGAG CGCCGCGACC GGGGGCATCT GATTCGCTCG GTGTACGGCC GGTACGCCGG CGCCGACCCG GTGGAGCTGG CCCGGGTCGT CGACGAGGTC GCGGCGGATG TGGTGTTGCG GCGGGTGAAA CCGGCGGCGG TGCGCCGTGT GCGGGAGCAT CGCGCGGCCG GGCATCGCAC CGTGCTGTTG ACCGGGGCGG TCGATGTGCT GACCCGTCCG CTGGCGCCGT TGTTCGACGA GATCGTCGCG ACCGGCCTGG AAGTCGGCGC GGATGGCCGG TACACGGGAC GGTTGTTGTC GTCGCCGCTG GTCGGGGATG CTCGGGCGGC GTTCGTGGAT CATTACGCGC GGCGCCGGGG GGCGGACCTG TCCGCGTCGT GGGCGTACGC GGACAGCCTG TCGGATCTGC CGATGCTGCG CACGGTGGGC AATCCGGTGG CGGTGAACCC GGATGTGGCG TTGCACAAGG TGGCGCGGGG GGCGGGCTGG CCGATCGAGG AGTGGCCGTC GACGCCGGGT GAGCCGCGGC TGATGGTCGC CAGCCGCGCT GAGCGGGGCC TGTTCGCCGC GGCGGCCCGC GCCCGGGTTG CCGCGGGCGG CCTGTCAGGG TCGGGGTCGG GTGGGAGTGG TGTTGTGCGG GTGGAAGTGG GTGAGGGCCG GTGA
|
Protein sequence | MGLRERLAGR RVFVTGVTGF VGEALLERLL SEFPDTQVVA LVRPRGSHSG MARLERMTRK PAFAGLRERL GREGLAARLA AQVRVIEGDL ASMGELPADL DVVIHCAGEV SFDPAIDEGF ATNLGGVQEL LRAVRAAGAR PHLVHVSTAY VAGLRSGHIA EGRLAHEVDW AAEQASAARA RHAAEDASRS PENSARFREQ ASAEHVRAGA QTVSAQAEAA RRAWVAARLV AAGSERAQVL GWTDAYTFTK ALGERYLEDS HGDLPLTIVR PSIIESALAK PFPGWIEGFK MAEPLILAFG RGELPDFPAS PDAVVDIIPV DLVVNALLAA AASPPPPERP AYYTVCSGFR NPLLFRDLYD YVRGYFLADP LPRRGRGHIG VPEWPFAGAV AVEAKLRRGE KAVEWANRVL AHAPRSERVR RLAVDLERTE GRVAFLRRYS DVYRAYTKAE LVYVDDATAA LHAAMDPADQ VDFGFDPACF DWRHYLQDVH CPAVTQVLRR PRDAAPARRM ARNLTAGEGV LAVFDLDGTL VSSTVVESYL WLRLADGDVG ERARELVSLA RALPGYLRAE RRDRGHLIRS VYGRYAGADP VELARVVDEV AADVVLRRVK PAAVRRVREH RAAGHRTVLL TGAVDVLTRP LAPLFDEIVA TGLEVGADGR YTGRLLSSPL VGDARAAFVD HYARRRGADL SASWAYADSL SDLPMLRTVG NPVAVNPDVA LHKVARGAGW PIEEWPSTPG EPRLMVASRA ERGLFAAAAR ARVAAGGLSG SGSGGSGVVR VEVGEGR
|
| |