Gene Franean1_5997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5997 
Symbol 
ID5674318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7314567 
End bp7315697 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID641244845 
Productinosine 5-monophosphate dehydrogenase 
Protein accessionYP_001510247 
Protein GI158317739 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01304] IMP dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.833679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0905948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGACG TCGAAATCGG TATCGGGAAG AACGCGCGAG TGGGCTACGG CCTTGACGCC 
GTGGGCATCG TTCCCTCGCG GCGCACCCGG GACCCAGCCG ACGTGTCACT CGCCTGGGAG
ATCGACGCCT ATCACTTCGA TCTGCCGATC GTCGCGGCGC CGGCCGACGC GGTGACCTCG
CCGGAGTCCG CGATCGCCGT CGGGCGCCAG GGTGGGCTCG GCGTGCTGCA CCTCGAGGGG
CTGTGGACCA GGCACGAGGA CCCCGAGCCG CTGCTCGAGG AGGTCGCGGA GCTGGGCGCG
CGTTCCGGCG CGGTGGCGGC GACCCGGCGC CTGCGTGAGT TCTACGCGGC GCCGGTGCAG
CCCGAGCTCA TCGGGGCGCG GCTGGCGCGG ATGCGGGAGG CCGGCGTGGT CACCGCCGCC
GCGCTGCGCC CGCAGAAGGT CCGGGCACTG TGCCCGCACG TGCTGGCCGC GGGCGTGGAC
CTCCTCGTCA TCCATGGCAC GGCGGTGTCG GCCGAGCACC AGTCCAGGCG CACTGAGCCG
CTGAATCTCA AGCAGTTCAT AGGACAGCTC GACATCCCGG TGATCGTCGG TGGCTGCGCA
TCCTTCTCCA CGGCACTGCA TCTCATGCGG ACGGGCGCGG CCGGCGTGAT CGTCGGAGTC
GGTGCCGGCC TCGGCGACGA CACGGCCGAG ACCCTCGGAA TTGGTGTTCC GCTGGCGACT
GCGATCGCCG ACGCGGCCGG CGCCCGGATG CGGTACCTCG ACGAGTCCGG CGGCCGGTAC
GTCCACGTCA TCGCGCATGG CGACCTGCGC ACCGGCGGGG ACGCGGCGAA GGCCGTGGCC
TGCGGAGCCG ACGCCGTGAT GGTGGATTCG CCGCTCGCGC AGGCGGTGGA CGCCCCTGGG
CGGGGCTCGG TCTGGTCGAT GGAGATCCTG CACTCGGACC TGCCGCGCGG GCGGTGGGCG
CCGGTCGAGA CGTCACGTAC CGTCGCCGAG ATCCTCACCG GGGGCGAGGT CGCGGCCGAG
GACGGTGTGG CGAACATCGC CGGGGCCCTG CGGGCGGCGA TGGCCACAAC GGGCTACGCG
ACGTTGAAGG AGTTCCAGAA GGCGGAGATC ATGATCGCCG CCGGGCGCTA G
 
Protein sequence
MADVEIGIGK NARVGYGLDA VGIVPSRRTR DPADVSLAWE IDAYHFDLPI VAAPADAVTS 
PESAIAVGRQ GGLGVLHLEG LWTRHEDPEP LLEEVAELGA RSGAVAATRR LREFYAAPVQ
PELIGARLAR MREAGVVTAA ALRPQKVRAL CPHVLAAGVD LLVIHGTAVS AEHQSRRTEP
LNLKQFIGQL DIPVIVGGCA SFSTALHLMR TGAAGVIVGV GAGLGDDTAE TLGIGVPLAT
AIADAAGARM RYLDESGGRY VHVIAHGDLR TGGDAAKAVA CGADAVMVDS PLAQAVDAPG
RGSVWSMEIL HSDLPRGRWA PVETSRTVAE ILTGGEVAAE DGVANIAGAL RAAMATTGYA
TLKEFQKAEI MIAAGR