Gene Francci3_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0636 
Symbol 
ID3903314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp719117 
End bp720730 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content70% 
IMG OID637877969 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_479749 
Protein GI86739349 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.644185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTG CCATGAACGG TCCCCTCGCC ACCGATTTCA CGTTGCCGGC CGGGGTGGGC 
ACCCACGTCG GCGCGCGTCC CGACCTGCTG GGGGCCGAGG CCGCCGCGGT ACCGCCGAAG
CTGGCGATGC TCGGGCTGAC CTTCGACGAC GTGCTGCTGC TTCCGGCCGC CTCCGATCTG
GTACCGGCGG AGGCGGACAC GACGACCCGC CTGTCGCGGA GCATCGATCT CGCCGTGCCG
TTGGTGTCGT CGGCCATGGA CACGGTGACC GAGGCCCGCA TGGCCATCGC GATGGCGCGC
CAGGGCGGGG TCGGGGTGCT GCACCGCAAC CTCTCGATCG ACGAACAGGC GCAGCAGGTC
GACATGGTGA AGCGTTCCGA GTCCGGGATG ATCACCGCGC CGGTGACCTG CGGACCCGGC
GCCACCCTGG AGGACGCGAA CGTCCTCATG GCGCGGTACC GGATCTCGGG TGTCCCGGTG
ACGGAATCCG ACGGCAGGCT GGTCGGCATC GTGACGAACC GGGACATCCG CTTCGAGCGG
GACTACTCGC GTCGAGTGCA GGACGTCATG ACTCCGATGC CGCTGATCAC CGCACCGGTG
GGCGTCTCGC CGGAGGACGC GCTCGCCCTG CTCCGCCGCC ACAAGGTCGA GAAGCTGCCG
ATCGTGGACG AGCGTGACCG GTTGCGCGGT CTGATCACCG TCAAGGACTT CACCAAGCGG
GAGCAGTACC CGCATGCCAC CAAGGACACC GACGGCCGGC TGATGGTCGG CGCCGCCGTC
GGCGTGGGGG AGGATGCCTA CAAGCGCGCC CAGGTCCTCG TCGCCGCCGG GGTCGACTTC
CTCGTCGTGG ACACCGCGCA CGGGCATCAT CGAGCCGTGC CCGACGTGGT GCGCCGGATC
AAGACCGACA TGCCGACCGG GGTGGACGGT CGGCCGCTCG ACGTGATCGG TGGCAACGTG
GCGACCGGAG CCGGCGCGGC GGCGCTGATC GCGGCCGGGG CGGACGCCAT CAAGGTCGGG
GTCGGGCCCG GCTCGATCTG TACCACCCGG GTGGTGAGCG GCGTCGGCGT CCCCCAGGTC
ACGGCCATCT ACGAGGCATC GCGCATCGCC CGGGAGCACG GTGTGCCGGT GATCGGCGAC
GGCGGCCTGC AGTACTCGGG TGACATCGCG AAGGCCATCG CCGTCGGCGC CGACACGGTC
ATGCTGGGCA GCCTGCTGGC CGGTGTCGAC GAGAGCCCGG GAGAGTTGAT CTTCATCAAC
GGCAAGCAGT ACAAGGCTTA CCGGGGTATG GGCTCACTCG GCGCCATGCG TAGCCGCGGC
GGCGCGCGGT CGTACTCGAA GGATCGGTAC TTCCAGGACG ATGTGCTCTC CGATGACAAG
CTGGTTCCCG AGGGCGTCGA GGGCCAGGTG CCCTACCGGG GTCCGCTGGC GGCCGTGGCC
CATCAGCTGG TCGGCGGTCT GCGGGCGGCG ATGGGCTACA CCGGCTCGCC CACCATCCGG
CGGATGCAGG ACGAGGCGCA GCTGATCCGT ATTACCTCGG CGGGGCTGAT CGAGAGCCAC
CCCCACGACA TCCAGATGAC CGTCGAGGCG CCAAACTACA ACTCCGCGCG CTGA
 
Protein sequence
MEGAMNGPLA TDFTLPAGVG THVGARPDLL GAEAAAVPPK LAMLGLTFDD VLLLPAASDL 
VPAEADTTTR LSRSIDLAVP LVSSAMDTVT EARMAIAMAR QGGVGVLHRN LSIDEQAQQV
DMVKRSESGM ITAPVTCGPG ATLEDANVLM ARYRISGVPV TESDGRLVGI VTNRDIRFER
DYSRRVQDVM TPMPLITAPV GVSPEDALAL LRRHKVEKLP IVDERDRLRG LITVKDFTKR
EQYPHATKDT DGRLMVGAAV GVGEDAYKRA QVLVAAGVDF LVVDTAHGHH RAVPDVVRRI
KTDMPTGVDG RPLDVIGGNV ATGAGAAALI AAGADAIKVG VGPGSICTTR VVSGVGVPQV
TAIYEASRIA REHGVPVIGD GGLQYSGDIA KAIAVGADTV MLGSLLAGVD ESPGELIFIN
GKQYKAYRGM GSLGAMRSRG GARSYSKDRY FQDDVLSDDK LVPEGVEGQV PYRGPLAAVA
HQLVGGLRAA MGYTGSPTIR RMQDEAQLIR ITSAGLIESH PHDIQMTVEA PNYNSAR