Gene Franean1_5974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5974 
Symbol 
ID5674295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7278198 
End bp7280684 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content74% 
IMG OID641244822 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001510224 
Protein GI158317716 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful)
[COG0299] Folate-dependent phosphoribosylglycinamide formyltransferase PurN 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase
[TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0783963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.20367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCTC GCCTAGTCGT CCTCGCCTCG GGGGCCGGCA CCACCCTGCA GGCTGTCCTC 
GAAGCCTGCG CGGACCCGGC CTTCGGCGCG CGGGTCGTCG CGGTCGGCAC CGACCGGCCG
GACACGGGTG CGCAGCGGCG CGCGGAGGCG GTCGGGGTAC CGGTGTTCAC GGTGCGGCTC
GAGGAGTGCG CCGATCGCGC CGCCTTCAAC GACGCGACCG CCACGCGGAT CGCCGAGCAC
ACGCCGGACC TGCTCGTCCT CGCCGGGTAC ATGAAGATTC TCGGCAGCCA GGTGATCGGC
CGGTTCCCCA CGGTGAACAC CCATCCCTCA CTGCTCCCGG CCTTCCCGGG CGCCCACGCC
GTGCGCGACG CGCTCGCCGC CGGCGTCCGG GTCAGCGGGG TGACCGTGCA CTGGGTCGAC
GAGGGCGTCG ACACCGGTCC GGTGATCGAC CAGGCGGCCG TCCCGGTCGA GCCCACGGAT
GACGAGGACG CGCTGCGGGC ACGCATCCAG GAGGTGGAAC GGCGGCTGTT CGTAGCCGTC
ATCGGTCGTG TCGTCCGGCG GGAGCTCCCG CTGGCGGGCG CACGCGCGGG TAGTACAGGA
GCGGGGGAGG CGGCGATCGG CCCGGTGCTC GCTGCCGGCC CGAGCGCCGG CCGTGATCAG
GTGGAACCCG CCCGCGCGGG CGGTGGGCCG GCACTCGCCG GCCCTGCCGG CGGGAGCCAT
TCCGGTGGGG TCCACGCAGA GAAACATCAT GCCGACGGAG GCATCGTGGA AGGTTCAGGA
GGGGTCATGG CGGGGGAGTC GAGCACGGGC GCGGGGCCTG GTGACACGAG CGGGGTCGCG
GGTGTGGCAG GTGCGGCGGG CGCGGCAGGC GCCGCCCCGG CCGGCGGCCG GCCGGCCGAG
GGCTCGGCGG AGCTGGTGGC GACGGGCCAC CGGAGGTTGC GCCGGGCCCT GGTGAGCGTC
TACGACAAGG CCGGTCTGGA GGAGCTCGCC GCCGCGTTCG TCGAGGCGGG GGTCGAGGTG
GTCTCCACCG GTTCGACGGC CGAGGTCCTG GCCCGCCACG GTGTGGCGGT CACCCCGGTC
AGCACGGTCA CCGGCTTCCC GGAGGTGCTC GGCGGCCGGG TCAAGACGCT GCACCCGCGA
GTGCACGCCG GTCTGTTGGC GGACCTGCGC AACGCCGAGC ACGCCGCCGT CCTGGCGGAG
CTCGACATCG AGCCGTTCGA CCTGCTCGTT GTAAACCTCT ACCCGTTCCG CGAGACCGTC
GCCTCGGGCG CGACCGAGGA CGAGGCGATC GAGCAGATCG ACATCGGCGG GCCGGCCATG
CTGCGGGCGG CGGCGAAGAA CCACGCCTCG GTCGCCGTGG TGGTGTCGCC GCAGGACTAC
GGCGACCTGG CCGCGGCCGT TCGCGGGTCC GGCTATGACC TCCCCGCCCG GCGCCGGCTG
GCGGCCCGCG CCTTCGCGCA CACCGCGTCC TACGACGCGG CGGTCGCCTC CTGGTTCGCC
AGCGCGCTGG CCCCCGACGA CACCGCCCGC GAGACCGGCT GGCCGGACAT CCTCGCGGCC
CAGTGGCACC GGTCCAGCGT CCTGCGGTAC GGCGAGAACC CGCACCAGCG GGCCGCGCTG
TACGTGGGCG AGGACGGCAG CCCCGGGCTC GCGTCCGCGC GCCAGCTCCA CGGCAAGCCG
ATGTCCTACA ACAACTACAC CGACACCGAC GCGGCCTGGC GGTCGGTGTT CGACTTCGCG
GACCCGGCCG TCGCGGTGAT CAAGCATGCC AACCCGTGCG GCATCGCCGT CGGTGCCACC
GTCGCCGAGG CGCACCGCAA GGCCCACGCC TGCGACCCGG TGTCGGCGTT CGGCGGCGTG
ATCGCGGTGA ACCGCCCGGT GAGTGTGGAG CTCGCCGAGC AGATCGCGGA GATCTTCACC
GAGGTCGTCG TGGCGCCCGA CTACGAGCCC GGGGCCGTCG AGATCCTCGC CCGCAAGCCG
TCGATCCGCC TGCTCGTCTG CGCGCCGCCG ACCCACTCGC GCGGGGTCGA GATGCGCCAG
GTCAGCGGTG GGATGCTGCT GCAGTCACGG GACGCCCTCG ACACCCCCGG CGACCACCCG
TCGGGCTGGA CGCTCGAGGC CGGCGCCCCG GTCGACGACA GCACCCTCGC CGACCTCGGC
TTCGCCTGGC GCGCGGTTCG CTCGGTGAAG TCGAACGCGA TCCTGCTCGC CGCGGACAAC
GCGACCGTCG GGGTCGGCAT GGGGCAGGTC AACCGGGTGG ACGCGGCGCG CCTCGCCGTG
ACCCGCGCCG GCGACCGGGC GAAGGGCTCG GTGGCCGCCA GCGACGCGTT CTTCCCGTTC
CCCGACGGCT TCCAGGTCCT CGCCGACGCG GGGGTGCGGG CGGTTGTCGA GCCGGGTGGC
TCGGTGCGCG ACGATCTCGT GATCGCGGCC GCGCGGGAGT CCGGTGTCGC GCTGTACTTC
ACCGGTGTCC GGCACTTCGC CCACTGA
 
Protein sequence
MPARLVVLAS GAGTTLQAVL EACADPAFGA RVVAVGTDRP DTGAQRRAEA VGVPVFTVRL 
EECADRAAFN DATATRIAEH TPDLLVLAGY MKILGSQVIG RFPTVNTHPS LLPAFPGAHA
VRDALAAGVR VSGVTVHWVD EGVDTGPVID QAAVPVEPTD DEDALRARIQ EVERRLFVAV
IGRVVRRELP LAGARAGSTG AGEAAIGPVL AAGPSAGRDQ VEPARAGGGP ALAGPAGGSH
SGGVHAEKHH ADGGIVEGSG GVMAGESSTG AGPGDTSGVA GVAGAAGAAG AAPAGGRPAE
GSAELVATGH RRLRRALVSV YDKAGLEELA AAFVEAGVEV VSTGSTAEVL ARHGVAVTPV
STVTGFPEVL GGRVKTLHPR VHAGLLADLR NAEHAAVLAE LDIEPFDLLV VNLYPFRETV
ASGATEDEAI EQIDIGGPAM LRAAAKNHAS VAVVVSPQDY GDLAAAVRGS GYDLPARRRL
AARAFAHTAS YDAAVASWFA SALAPDDTAR ETGWPDILAA QWHRSSVLRY GENPHQRAAL
YVGEDGSPGL ASARQLHGKP MSYNNYTDTD AAWRSVFDFA DPAVAVIKHA NPCGIAVGAT
VAEAHRKAHA CDPVSAFGGV IAVNRPVSVE LAEQIAEIFT EVVVAPDYEP GAVEILARKP
SIRLLVCAPP THSRGVEMRQ VSGGMLLQSR DALDTPGDHP SGWTLEAGAP VDDSTLADLG
FAWRAVRSVK SNAILLAADN ATVGVGMGQV NRVDAARLAV TRAGDRAKGS VAASDAFFPF
PDGFQVLADA GVRAVVEPGG SVRDDLVIAA ARESGVALYF TGVRHFAH