Gene Franean1_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0197 
Symbol 
ID5668622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp242234 
End bp243262 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content77% 
IMG OID641239126 
Productnicotinate-nucleotide pyrophosphorylase 
Protein accessionYP_001504570 
Protein GI158312062 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0157] Nicotinate-nucleotide pyrophosphorylase 
TIGRFAM ID[TIGR00078] nicotinate-nucleotide pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.702249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGCG GGCGGCCGGG GCCGCCGCTC GGCCCCGACC AGCTTTCGGC CGCCGTCCTC 
GCGGGGCTGA AGTCCGCGGG GCTGGAGGCG GCGGCGGTCC TCGACGTGAT CGGTCGGGCC
CTCGCCGAGG ATCTTCCCGT GGCCACGTCC CCGCAGGTCA GCGGGGCGCC CCGGCCGGTG
GATGAGGCAG CCCCGCCAGG TGGTGAGGCG GCCCCGCAGG TGGGCGAGCC GGCCTGGGCG
GTCGATGCGA CGTCCGCGGC GACCGTGGAC GCGGCGCTGA CCAGTACCGG TTCCGTGGTC
TCCCGGGCGG ACGGCGTGGT GGCGGGCGTG CCGGTCGCCG CGGCCGTGTT CGAGGTGCTG
CTCGGCGCCG CGGTCACCGT GACGCCGACG CTCGCCGACG GGGACCGGGT CGTTCCGGGC
ACGGAGGTGC TCCGGGTGCG TGGTCCCGTT CGTGGGCTTC TCACCGCCGA GCGGACCGCG
CTCAACCTGC TCTGCCACCT CTCCGGGGTG GCCAGTGTGA CCCGGCTGTG GGCCGACGCG
GTCGCCGGCA CCGGCGCGGC CGTCCGCGAC ACCCGCAAGA CCCTGCCCGG GCTGCGCGCG
CTGGAGAAGT ACGCGGTGCG CTGCGGCGGC GGGCGCAACC ACCGGATGTC GCTGGCCGAC
GCCGCTCTGG TCAAGGACAA CCACGTGATC GCGGCCGGCG GCGTGGCGGC GGCGTTCACC
GCGGTGCGCG CCCGGTACCC GGACCTGCCC GTCGAGGTCG AGTGCGACAC CGTCGAGCAG
GTCGTCGAGG CGGTCGGCGC CGGAGCCGAC CTGATCCTCT GTGACAACAT GTCCCTGGAC
GAGCTGCGCG CGTCGGTCGC CGTCGCCCGG CCGGCCGGGG TCCTGTTGGA GGCGAGCGGC
GGGCTCACCC TCGACGTGGC CGCCGCGGTG GCCGCCACCG GGGTCGACTT CCTCGCCGTC
GGCGGGCTCA CCCACTCGGC GCCCGCGCTG GACCTCGGCT TCGACCTCGC GGTGCCGGCT
CCCCGCTGA
 
Protein sequence
MSGGRPGPPL GPDQLSAAVL AGLKSAGLEA AAVLDVIGRA LAEDLPVATS PQVSGAPRPV 
DEAAPPGGEA APQVGEPAWA VDATSAATVD AALTSTGSVV SRADGVVAGV PVAAAVFEVL
LGAAVTVTPT LADGDRVVPG TEVLRVRGPV RGLLTAERTA LNLLCHLSGV ASVTRLWADA
VAGTGAAVRD TRKTLPGLRA LEKYAVRCGG GRNHRMSLAD AALVKDNHVI AAGGVAAAFT
AVRARYPDLP VEVECDTVEQ VVEAVGAGAD LILCDNMSLD ELRASVAVAR PAGVLLEASG
GLTLDVAAAV AATGVDFLAV GGLTHSAPAL DLGFDLAVPA PR