Gene Franean1_5998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5998 
Symbol 
ID5674319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7315857 
End bp7317650 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content72% 
IMG OID641244846 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_001510248 
Protein GI158317740 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.352263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0532211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGGCC TCTACGATGG AGTATCACCG TCGCCCGACC GGCGGGCCGC GCGGAGCCCA 
GCATGCTTCA CCCGCCTGCC CGGGCAGACC GGCTCGCGCC GGTTCCCGGG CGCGGGACTC
CCCAGACATC GCGGGGACGC TCCCGCCGGC GCCGGGCCGA TGCGAACGGA AGGTGCCATG
GACGGTTCCC CCGCCAACCC GACGACCTCG ATCACGCGCG CGACCGGGGA ACCCGACCTC
GACCAGTCGC CCGTCCGGCA CGTCGCTGAC AGCCTGGGCG CCGAGGGGCC GGCGCTCCCG
CCGAAGCTCG CGATGCTCGG CCTCACCTTC GACGACGTCC TGCTGCTGCC GGCGGCGTCC
GAGGTGGCGC CGTCGGGCGT CGACACGACC ACGCGGCTGT CCCGCAACAT CTCGCTGGCG
GTCCCGCTGG TGTCCTCCGC GATGGACACC GTGACCGAGG CCCGGATGGC GATCGCGATG
GCCCGCCAGG GCGGCGTCGG CGTGCTGCAC CGCAACCTGT CCGTCGACGA CCAGGCACAG
CAGGTCGACA TGGTGAAGCG GTCCGAGTCG GGGATGATCA CCTCCCCGAT CACCTGCGGG
CCGGACGCCA CCATCGAAGA GGCGAACGTG CTCATGGCGC GTTACCGCAT CTCCGGCGTC
CCGGTGACCG AGCCGGACGG CCGCCTCGTC GGGATCGTGA CCAACCGCGA CATCCGCTTC
GAGCGGGACT ACACCCGCCG CGTCCACGAA GTGATGACCC GCATGCCGCT GATCACGGCA
CCGGTCGGGG TCTCGGCGGA CGACGCGCTC GCTCTGCTGC GGCACAACAA GGTCGAGAAG
CTGCCCATCG TCGACGGGCA CGACCGGCTG TGCGGACTGA TCACGGTCAA GGACTTCACC
AAGCGCGAGC AGTACCCGCG GGCGACCAAG GACGCGGACG GCCGCCTCGT GGTCGGCGCC
GCGGTCGGCG TCGGCGAGGA CGCGCTGAAG CGCGCCCAGG TCCTCGTCGC CGCCGGGGTG
GACTTCCTCG TCGTCGACAC CGCGCACGGC CACCACCACG CCGTCCCCGA CATGATCGCC
CGGATCAAGG CCGAGATGCC CGCCGGGGTG GACGGCCGTC CGCTCGACGT CATCGGTGGC
AACATCGCCA CCGCGGCCGG GGCCGCCGCG CTCATCGCGG CCGGCGCCGA CGCCGTCAAG
GTCGGGGTCG GGCCGGGCTC GATCTGCACG ACCCGGGTGG TCACCGGCGT GGGTGTTCCG
CAGGTCACCG CGATCTACGA GGCGGCGCGG GCGGCGCGGG CCGCCGGCGT GCCGGTCATC
GGCGACGGCG GGCTGCAGTA CTCCGGTGAC ATCGCCAAGG CGATTGCCGT CGGCGCGGAC
ACCGTGATGC TCGGCAGCCT GCTCGCCGGT GTCGACGAGA GCCCCGGTGA GCTCATCTTC
ATCAACGGCA AGCAGTACAA GTCCTACCGG GGGATGGGCT CGCTCGGCGC CATGCGCAGC
CGCGGCGACA CCCGCTCCTA CTCCAAGGAC CGCTACTTCC AGGACGACGT CCTCTCCGAC
GACAAGCTCG TCCCCCAGGG TGTCGAGGGC CAGGTGCCCT ACCGCGGCTC GCTGGCCGGG
ATGGCCCACC AGCTGATCGG CGGCCTGCAG GCCGCGATGG GGTACACGGG TGCCGCGACC
ATCCGTGATC TCCAGGAGAA CAGCCAGCTG GTGCGGATCA CGTCGGCTGG CCTGACCGAG
AGCCACGCGC ACGACGTCCA GATGACGGTC GAGGCGCCGA ACTACACCCG GTGA
 
Protein sequence
MPGLYDGVSP SPDRRAARSP ACFTRLPGQT GSRRFPGAGL PRHRGDAPAG AGPMRTEGAM 
DGSPANPTTS ITRATGEPDL DQSPVRHVAD SLGAEGPALP PKLAMLGLTF DDVLLLPAAS
EVAPSGVDTT TRLSRNISLA VPLVSSAMDT VTEARMAIAM ARQGGVGVLH RNLSVDDQAQ
QVDMVKRSES GMITSPITCG PDATIEEANV LMARYRISGV PVTEPDGRLV GIVTNRDIRF
ERDYTRRVHE VMTRMPLITA PVGVSADDAL ALLRHNKVEK LPIVDGHDRL CGLITVKDFT
KREQYPRATK DADGRLVVGA AVGVGEDALK RAQVLVAAGV DFLVVDTAHG HHHAVPDMIA
RIKAEMPAGV DGRPLDVIGG NIATAAGAAA LIAAGADAVK VGVGPGSICT TRVVTGVGVP
QVTAIYEAAR AARAAGVPVI GDGGLQYSGD IAKAIAVGAD TVMLGSLLAG VDESPGELIF
INGKQYKSYR GMGSLGAMRS RGDTRSYSKD RYFQDDVLSD DKLVPQGVEG QVPYRGSLAG
MAHQLIGGLQ AAMGYTGAAT IRDLQENSQL VRITSAGLTE SHAHDVQMTV EAPNYTR