Gene Franean1_0155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0155 
Symbol 
ID5668580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp184343 
End bp186652 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content74% 
IMG OID641239084 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001504528 
Protein GI158312020 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.513467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0402011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGAA CGCCTTCCGC CCCCACGACC AGCCCCGACG ACCAGGCCGA CGGCCCGGGC 
GAGGGCACCG TCGACCGCCA GCCCTACCGC GAGCTGGGCC TGACCGACGA CGAGTACGAA
CGGATCGTCG CCACCCTCGG CCGGGTCCCC ACCGACGCCG AGCTGGCCAT GTACTCGGTG
ATGTGGAGCG AGCACTGCTC GTACAAGTCG TCCAAGGTGC ACCTGCGCCA GTTCCGGGAC
ACTCCGGCCA CAGACCGGCT CCTGGTCGGC ATGGGTGAGA ACGCCGGCGT GGTGGACGTC
GGCGAGGGCC TGGCCGTCAC CTTCAAGGTC GAGTCGCACA ACCACCCGAG CTTCGTCGAG
CCGTACCAGG GCGCGGCGAC CGGTGTCGGC GGCATCGTCC GCGACATCCT GACCATGGGG
GCCCGGCCGA TCGGGATCCT CGACCCGCTG CGCTTCGGCG CCGCCGACGC CCCCGACACC
GCCCGGGTGC TGCCCGGCGT GGTCGCCGGG ATCGGCGGCT ATGGCAACTG CCTGGGCCTG
CCCACCATCG GCGGCGAGGT CGTCTTCGAC CCGGTGTACG GCGGGAACCC GCTGGTCAAC
GCGCTGTGCG TCGGCGTGAT GCCGGTCGGG CGGGTGCAGA CATCCGCCGC CACCGGGGTC
GGCAACGCCG TGGTGCTGCT CGGCGCGAAA ACCGGCCGGG ACGGCATCGG CGGCGTCTCA
GTGCTCGCCT CGGCCACCTT CGACGAGGGC GGCGGCCCGG CCCGGCGGCC GTCCGTGCAG
GTCGGGGACC CGTTCACTGA GAAGATCCTC ATCGAGTGCT GCCTGGAGCT GTTCGACCGC
GGCCTGGTCA CCGGCATCCA GGACCTCGGC GGCGCCGGCC TGACCTGCGC GCTCACCGAG
ACGACCGCCG CCGGCATCGC CACCGGCCAG CCCGGCGGCA TGGAGGTCGA CCTCGACCTC
GTACCGCTGC GCGAGGCGTC GATGGCCGCG CACGAGGTCC TGGCCAGCGA GTCCCAGGAG
CGGATGCTGG CCATCGTCAC CCCGGACGCG CTGCCCGAGG TGCTGGCGCT CGCCGAGCGG
TGGGGCGTGA TCGCCACCAA CATCGGCACG GTGACCGACA GCGGCCGTCT CGTCGTCCGC
TGGCACGGCG AGGTCGTCGT GGACGTCCCG CCGGGCTCGC TCGCCGACGA CGGGCCGGTC
TACGAGCGCC CGCTGCGCCG CCCGGCCGAC CTCGACCTGC TGCGGGCGGA CGCGCCGTCC
GCCCTGGAAC GGCCGCGTAC CGGCGACGCG CTGCGTGCGA CCCTGCTGCG GATGATCGCC
TCTCCGAACC TGTGCTCGCG GGCCTGGGTG ACCGAGCAGT ACGACCGGTA CGTGCAGGCC
AACACGGTGC TGGCCCAGCC CGAGGACGCC GGCGTCCTGC GGCTGTCGGC CTCCGGGCTC
GGCATCGCGC TGGCCACCGA CGGCAACGGC CGCTACGCCC GGCTGGACCC GTTCGCCGGG
GCGCAGCTCG CCCTGGCCGA GGCGTGCCGC AACGTCACCG CGGCCGGCGC CGAGCCGATC
GCGGTGACCA ACTGCCTGAA CTTCGGCTCC CCCGAGGACC CGGAGGTCAT GTGGCAGTTC
GCCCAGGCCT GCGCCGGGCT CGCCGACGCC TGCCGGCGGC TCGGCCTGCC GGTCACCGGC
GGGAACGTGT CCTTCTACAA CCAGACCGGC TCGGCGCCGA TCCATCCGAC GCCGGTCGTC
GGCGTGCTCG GCCTGTTCGA CGACGTCACT CGCCGCACCC CCATCGGCTT CACGGACGAG
GGCGACGCGC TGCTCCTGCT GGGCGACACC CGGGACGAGT TCGGCGGCTC CGAGTGGGCC
TGGGCGACCC ACGGCCATCT CGGCGGGACG CCGCCGGCCG TCGACCTGGA ACGGGAGAAG
CTGCTCGGCG AGATCCTCGT CGGGGGCTCC CGCGAAGGGC TGCTCACCGC GGCCCACGAC
CTCTCCGAGG GCGGGCTCGC CCAGGCGCTG GTCGAGTCCT GCCTGCGGGG CGGGCACGGG
GCACGGATCG AGCTGCCCGC CGGGGCGGAC GCGTTCGTCG AGCTGTTCAG CGAATCGGCC
GGACGGGCGG TCGTCGCCGT CCCCGCCGCC GAGCAGGACC GTTTCGCGCG GCTGTGCGCG
GACCGGGGCT TGCCATGCCG GCAGATCGGC GTCGTGACCG ACGGCGAAGG CGGAAGCCTG
AACGTCGCCG GCGAGTTCGC CATACCGCTG GACGAGCTGC GGGCGGCCCA CGAGGGCACG
CTGCCCCGCC TGTTCGGCCG CGGAGCCTGA
 
Protein sequence
MTGTPSAPTT SPDDQADGPG EGTVDRQPYR ELGLTDDEYE RIVATLGRVP TDAELAMYSV 
MWSEHCSYKS SKVHLRQFRD TPATDRLLVG MGENAGVVDV GEGLAVTFKV ESHNHPSFVE
PYQGAATGVG GIVRDILTMG ARPIGILDPL RFGAADAPDT ARVLPGVVAG IGGYGNCLGL
PTIGGEVVFD PVYGGNPLVN ALCVGVMPVG RVQTSAATGV GNAVVLLGAK TGRDGIGGVS
VLASATFDEG GGPARRPSVQ VGDPFTEKIL IECCLELFDR GLVTGIQDLG GAGLTCALTE
TTAAGIATGQ PGGMEVDLDL VPLREASMAA HEVLASESQE RMLAIVTPDA LPEVLALAER
WGVIATNIGT VTDSGRLVVR WHGEVVVDVP PGSLADDGPV YERPLRRPAD LDLLRADAPS
ALERPRTGDA LRATLLRMIA SPNLCSRAWV TEQYDRYVQA NTVLAQPEDA GVLRLSASGL
GIALATDGNG RYARLDPFAG AQLALAEACR NVTAAGAEPI AVTNCLNFGS PEDPEVMWQF
AQACAGLADA CRRLGLPVTG GNVSFYNQTG SAPIHPTPVV GVLGLFDDVT RRTPIGFTDE
GDALLLLGDT RDEFGGSEWA WATHGHLGGT PPAVDLEREK LLGEILVGGS REGLLTAAHD
LSEGGLAQAL VESCLRGGHG ARIELPAGAD AFVELFSESA GRAVVAVPAA EQDRFARLCA
DRGLPCRQIG VVTDGEGGSL NVAGEFAIPL DELRAAHEGT LPRLFGRGA