Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4825 |
Symbol | |
ID | 5673166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5762339 |
End bp | 5764099 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243681 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001509097 |
Protein GI | 158316589 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.308896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0497189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCG CGGACCCGGT GCTGGCCGGG ATCGTCGACA ACCGGCTGCA CGCGATCGCC GAGCAGATGG CGCAGGCCAT GCTGCGTTCC TGCCGGTCGA TGGTGTTCCA GAGCCGGGAC TTCGTCACCG GGATCTTCAC CGCCCGAGGC GAGTGGGTCG CGACCAAGGA CTACATCCCC GTCCTGGCCG GTTCGCTGCC CAGCGCCCTC GCCGGCGTGA CCGAGCGCTT CGCCGCCGAG TTCGCGGCCG GGGACCTGTA CGAGGGCGAC GTGTTCGTCC TCAACGATCC GTACCACGGC AACAACCACC CGCCGGATAT CACGATCATG AAGCCGGTGT TCCACCGGGG CGTGCACCGA TTCTGGACCG TGACGAAGGG GCATCACGCC GACACCGGCG GCGCCGTCGT CGGCTACAAC CCGTACGCCC GGGACTGCTG GGAGGACGCA CTGCGCATCC CGCCCGTCCG GCTGCACCAC CGCGGGCGGC GCCAGCGGGA CGTCTGGGAT CTCATCCTGC TGAACCTACG GCTGCCGGAC GTCGTCGAGG CCGACCTGGA CTGCCAGATC GGCGCCGCCA CGCTCGGCGA GCGCGAGCTG GTCGCGCTGC TGGACGCCTA CGGCGTCGAA CCGGTCGAGA ACGCCGTCGC CTACCAGCTC GACGCGTCCC GCCGGCACAT GCGAGCCGAG ATCGGACGGC TGCCGGACGG CGTCTACCGG TCGGTGCGCC ACCTCGACGA CGGGGGAGCA CACCACCGCG AGCCGATCGC GGTGCGCCTG GAGCTGACCA TCCGCGGCGA GTCGGCGCTG TTCGACTTCA CCGCCTCCGA CCCGCAGGTC GTCGGCTACG CGAACTCCAC CCCGGCGAAC ACCGCCGCCG CCGTCCTCAT CGGCCTGCTC GGGGTCACCG ACCCCACCCA ACGCCGCACC AGCGGCGCGC TGGCGGCCCT CGACATCCGC ACCACGCCGG GCACGATCGT GCACGCCGTC GAGCCGGCCG CGACGTCGCT GTGCACGCTG ACCGCCTGCG AGACCATCGT CGAGACGGTC TGGGAGGCCG TGGGGCAGGC CGACCCGACG GTGACGAACG CCGCCTGGTG CCGTGGCTAC ACGTTCGCGG CCATGGGGGT CGACGCGCGC ACCGGCCGGC CGTTCGCCGT GACCGGCGCG ATGAAGGGCG GCTCGGGCGC CACCGCCGGT TTCGACGGCT GGGACGCCGT GGGGCCGCCG GTCGCGATGG GCGGCGGACG GGAGATCGAC GTCGAGCTGC ACGAGCTGAC CGCGCCGACC ACCGTGCTCA GCCAGGAGTA CGAGACCGAC TCCGCCGGCG CCGGCCGGTG GCGCGGCGGC CTCGGCGCGG TGTCGCGATG GCGTATTGAC CAGGACGACC TGTCGGTGCT GTGCATCGGA TCAGGTGCCC TCGACCTGAC AGCGCCGTTC GGCGCGGCGG GCGGGGCGCC CGCCCCCGCC AACCGCGGCA TCGTCCACCA CGCCGACGGG CGGATCAGCC GCCCCCCGAC GAACTCCCTG CTACGGCTTA ACCGCGGGGA CGTGGTCGAG ATCCACACCT CCGGCGGAGG CGGTTTCGGC GACCCCCTCG ACCGTCCCGT CGACGCGGTC GCCGACGACG TCCGCGACGG CCTCGTCTCG CCGACGGCCG CCCGCGATGT CTACGGCGTC ATCCTCGACC CGATCACCGA TGAGGTGGAC GCCGCCGCGA CCGCCGCCCG CCGCGCGGCA GCCCGCTCCC GCTCCCGCTG A
|
Protein sequence | MTGADPVLAG IVDNRLHAIA EQMAQAMLRS CRSMVFQSRD FVTGIFTARG EWVATKDYIP VLAGSLPSAL AGVTERFAAE FAAGDLYEGD VFVLNDPYHG NNHPPDITIM KPVFHRGVHR FWTVTKGHHA DTGGAVVGYN PYARDCWEDA LRIPPVRLHH RGRRQRDVWD LILLNLRLPD VVEADLDCQI GAATLGEREL VALLDAYGVE PVENAVAYQL DASRRHMRAE IGRLPDGVYR SVRHLDDGGA HHREPIAVRL ELTIRGESAL FDFTASDPQV VGYANSTPAN TAAAVLIGLL GVTDPTQRRT SGALAALDIR TTPGTIVHAV EPAATSLCTL TACETIVETV WEAVGQADPT VTNAAWCRGY TFAAMGVDAR TGRPFAVTGA MKGGSGATAG FDGWDAVGPP VAMGGGREID VELHELTAPT TVLSQEYETD SAGAGRWRGG LGAVSRWRID QDDLSVLCIG SGALDLTAPF GAAGGAPAPA NRGIVHHADG RISRPPTNSL LRLNRGDVVE IHTSGGGGFG DPLDRPVDAV ADDVRDGLVS PTAARDVYGV ILDPITDEVD AAATAARRAA ARSRSR
|
| |