Gene Francci3_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0637 
Symbol 
ID3903315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp720954 
End bp722084 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID637877970 
Productinosine 5-monophosphate dehydrogenase 
Protein accessionYP_479750 
Protein GI86739350 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01304] IMP dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGAGG TCGAGATCGG CATCGGCAAG AGCGCGCGGA TCGCGTATGG TCTCGACGCC 
GTCGGCATCA TTCCGTCCCG TCGGACCCGT GACCCGGCGG ACGTCTCGCT CGCCTGGGAG
ATCGACGCCT ACCGGTTCGA CCTGCCGCTC GTGGCCGCTC CGGCCGACGC GGTGACCTCA
CCCGCGTCGG TGATCGCGCT CGGCCGGCTC GGTGGTCTCG GTGTTCTGCA CATCGAGGGG
CTGTGGACCC GGTACGAGGA GCCGGAGAAC CACATCGCCG AGCTCAGCAA GATCGGGGCC
GCCCAGGGCC CGGACGCGGC GACCGAGCGG CTGCGCGCGT TGTACTCCGC GCCGGTCCAG
CCGGGGCTGA TCGCGCAGCG TCTCACCGAG CTTCGGGACG CGGGGGTGGT CGTGGCCGCG
GCGCTGCGTC CGCAGAAGGT CAAGGCCCTG TGCCCGCACG TGCTGGCCGC CGGGATCGAT
CTGCTCGTCA TCCACGGTAC GGCGGTCTCG GCGGAGCATC AGTCCCGCCG CAGCGAGCCG
CTCAACCTCA AACGGTTCAT CGGCCAGCTG GACATCCCGG TGCTGGTTGG CGGGTGCGCG
TCGTTCTCCA CCGCACTGCA CCTCATGCGC ACCGGGGCGG CCGGTGTCAT CGTGGGCGTC
GGGTCCGGCT TCGGTGACCG CACCCGGGAC GAGCTCGGGG TCGGCGTGCC GCTTGCCACC
GCGATCGCGG ATGCGGCCGG TGCGCGCATG CGTTATCTCG ACGAGTCGGG CGGCCGCTAC
GTCCACGTCG TCGTGCATGG TGATCTTCGG ACTGGCGGCG ACGTCGCGAA GGCGGTGGCC
TGCGGCGCGG ACGCGGTCAT GGTGGACGCG GCGCTCGCGG CCGCACGGGA GGCCCCGGGC
CAGGGCGGGG CCTGGCCGAT GGACGTGCTG CACTCCGACC TGCCGCGGGG ACGCTGGTCG
CCGGTGACCC CGACCGGGAC GCTCGCGCAG ATCGTGACCG GTCCGGGCAC GGCGACCAGA
ACCGGTGTCC TCAACCTGGC CGGCGGTCTG CGCACGGCAA TGGCGACGAC GGGATACGCA
ACTTTGAAGG AGTTCCAGAA GGCGGAGATC ATGGTGACCG CCGGTCCGTG A
 
Protein sequence
MAEVEIGIGK SARIAYGLDA VGIIPSRRTR DPADVSLAWE IDAYRFDLPL VAAPADAVTS 
PASVIALGRL GGLGVLHIEG LWTRYEEPEN HIAELSKIGA AQGPDAATER LRALYSAPVQ
PGLIAQRLTE LRDAGVVVAA ALRPQKVKAL CPHVLAAGID LLVIHGTAVS AEHQSRRSEP
LNLKRFIGQL DIPVLVGGCA SFSTALHLMR TGAAGVIVGV GSGFGDRTRD ELGVGVPLAT
AIADAAGARM RYLDESGGRY VHVVVHGDLR TGGDVAKAVA CGADAVMVDA ALAAAREAPG
QGGAWPMDVL HSDLPRGRWS PVTPTGTLAQ IVTGPGTATR TGVLNLAGGL RTAMATTGYA
TLKEFQKAEI MVTAGP