Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5974 |
Symbol | |
ID | 5674295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7278198 |
End bp | 7280684 |
Gene Length | 2487 bp |
Protein Length | 828 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244822 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001510224 |
Protein GI | 158317716 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) [COG0299] Folate-dependent phosphoribosylglycinamide formyltransferase PurN |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase [TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0783963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.20367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCTC GCCTAGTCGT CCTCGCCTCG GGGGCCGGCA CCACCCTGCA GGCTGTCCTC GAAGCCTGCG CGGACCCGGC CTTCGGCGCG CGGGTCGTCG CGGTCGGCAC CGACCGGCCG GACACGGGTG CGCAGCGGCG CGCGGAGGCG GTCGGGGTAC CGGTGTTCAC GGTGCGGCTC GAGGAGTGCG CCGATCGCGC CGCCTTCAAC GACGCGACCG CCACGCGGAT CGCCGAGCAC ACGCCGGACC TGCTCGTCCT CGCCGGGTAC ATGAAGATTC TCGGCAGCCA GGTGATCGGC CGGTTCCCCA CGGTGAACAC CCATCCCTCA CTGCTCCCGG CCTTCCCGGG CGCCCACGCC GTGCGCGACG CGCTCGCCGC CGGCGTCCGG GTCAGCGGGG TGACCGTGCA CTGGGTCGAC GAGGGCGTCG ACACCGGTCC GGTGATCGAC CAGGCGGCCG TCCCGGTCGA GCCCACGGAT GACGAGGACG CGCTGCGGGC ACGCATCCAG GAGGTGGAAC GGCGGCTGTT CGTAGCCGTC ATCGGTCGTG TCGTCCGGCG GGAGCTCCCG CTGGCGGGCG CACGCGCGGG TAGTACAGGA GCGGGGGAGG CGGCGATCGG CCCGGTGCTC GCTGCCGGCC CGAGCGCCGG CCGTGATCAG GTGGAACCCG CCCGCGCGGG CGGTGGGCCG GCACTCGCCG GCCCTGCCGG CGGGAGCCAT TCCGGTGGGG TCCACGCAGA GAAACATCAT GCCGACGGAG GCATCGTGGA AGGTTCAGGA GGGGTCATGG CGGGGGAGTC GAGCACGGGC GCGGGGCCTG GTGACACGAG CGGGGTCGCG GGTGTGGCAG GTGCGGCGGG CGCGGCAGGC GCCGCCCCGG CCGGCGGCCG GCCGGCCGAG GGCTCGGCGG AGCTGGTGGC GACGGGCCAC CGGAGGTTGC GCCGGGCCCT GGTGAGCGTC TACGACAAGG CCGGTCTGGA GGAGCTCGCC GCCGCGTTCG TCGAGGCGGG GGTCGAGGTG GTCTCCACCG GTTCGACGGC CGAGGTCCTG GCCCGCCACG GTGTGGCGGT CACCCCGGTC AGCACGGTCA CCGGCTTCCC GGAGGTGCTC GGCGGCCGGG TCAAGACGCT GCACCCGCGA GTGCACGCCG GTCTGTTGGC GGACCTGCGC AACGCCGAGC ACGCCGCCGT CCTGGCGGAG CTCGACATCG AGCCGTTCGA CCTGCTCGTT GTAAACCTCT ACCCGTTCCG CGAGACCGTC GCCTCGGGCG CGACCGAGGA CGAGGCGATC GAGCAGATCG ACATCGGCGG GCCGGCCATG CTGCGGGCGG CGGCGAAGAA CCACGCCTCG GTCGCCGTGG TGGTGTCGCC GCAGGACTAC GGCGACCTGG CCGCGGCCGT TCGCGGGTCC GGCTATGACC TCCCCGCCCG GCGCCGGCTG GCGGCCCGCG CCTTCGCGCA CACCGCGTCC TACGACGCGG CGGTCGCCTC CTGGTTCGCC AGCGCGCTGG CCCCCGACGA CACCGCCCGC GAGACCGGCT GGCCGGACAT CCTCGCGGCC CAGTGGCACC GGTCCAGCGT CCTGCGGTAC GGCGAGAACC CGCACCAGCG GGCCGCGCTG TACGTGGGCG AGGACGGCAG CCCCGGGCTC GCGTCCGCGC GCCAGCTCCA CGGCAAGCCG ATGTCCTACA ACAACTACAC CGACACCGAC GCGGCCTGGC GGTCGGTGTT CGACTTCGCG GACCCGGCCG TCGCGGTGAT CAAGCATGCC AACCCGTGCG GCATCGCCGT CGGTGCCACC GTCGCCGAGG CGCACCGCAA GGCCCACGCC TGCGACCCGG TGTCGGCGTT CGGCGGCGTG ATCGCGGTGA ACCGCCCGGT GAGTGTGGAG CTCGCCGAGC AGATCGCGGA GATCTTCACC GAGGTCGTCG TGGCGCCCGA CTACGAGCCC GGGGCCGTCG AGATCCTCGC CCGCAAGCCG TCGATCCGCC TGCTCGTCTG CGCGCCGCCG ACCCACTCGC GCGGGGTCGA GATGCGCCAG GTCAGCGGTG GGATGCTGCT GCAGTCACGG GACGCCCTCG ACACCCCCGG CGACCACCCG TCGGGCTGGA CGCTCGAGGC CGGCGCCCCG GTCGACGACA GCACCCTCGC CGACCTCGGC TTCGCCTGGC GCGCGGTTCG CTCGGTGAAG TCGAACGCGA TCCTGCTCGC CGCGGACAAC GCGACCGTCG GGGTCGGCAT GGGGCAGGTC AACCGGGTGG ACGCGGCGCG CCTCGCCGTG ACCCGCGCCG GCGACCGGGC GAAGGGCTCG GTGGCCGCCA GCGACGCGTT CTTCCCGTTC CCCGACGGCT TCCAGGTCCT CGCCGACGCG GGGGTGCGGG CGGTTGTCGA GCCGGGTGGC TCGGTGCGCG ACGATCTCGT GATCGCGGCC GCGCGGGAGT CCGGTGTCGC GCTGTACTTC ACCGGTGTCC GGCACTTCGC CCACTGA
|
Protein sequence | MPARLVVLAS GAGTTLQAVL EACADPAFGA RVVAVGTDRP DTGAQRRAEA VGVPVFTVRL EECADRAAFN DATATRIAEH TPDLLVLAGY MKILGSQVIG RFPTVNTHPS LLPAFPGAHA VRDALAAGVR VSGVTVHWVD EGVDTGPVID QAAVPVEPTD DEDALRARIQ EVERRLFVAV IGRVVRRELP LAGARAGSTG AGEAAIGPVL AAGPSAGRDQ VEPARAGGGP ALAGPAGGSH SGGVHAEKHH ADGGIVEGSG GVMAGESSTG AGPGDTSGVA GVAGAAGAAG AAPAGGRPAE GSAELVATGH RRLRRALVSV YDKAGLEELA AAFVEAGVEV VSTGSTAEVL ARHGVAVTPV STVTGFPEVL GGRVKTLHPR VHAGLLADLR NAEHAAVLAE LDIEPFDLLV VNLYPFRETV ASGATEDEAI EQIDIGGPAM LRAAAKNHAS VAVVVSPQDY GDLAAAVRGS GYDLPARRRL AARAFAHTAS YDAAVASWFA SALAPDDTAR ETGWPDILAA QWHRSSVLRY GENPHQRAAL YVGEDGSPGL ASARQLHGKP MSYNNYTDTD AAWRSVFDFA DPAVAVIKHA NPCGIAVGAT VAEAHRKAHA CDPVSAFGGV IAVNRPVSVE LAEQIAEIFT EVVVAPDYEP GAVEILARKP SIRLLVCAPP THSRGVEMRQ VSGGMLLQSR DALDTPGDHP SGWTLEAGAP VDDSTLADLG FAWRAVRSVK SNAILLAADN ATVGVGMGQV NRVDAARLAV TRAGDRAKGS VAASDAFFPF PDGFQVLADA GVRAVVEPGG SVRDDLVIAA ARESGVALYF TGVRHFAH
|
| |