Gene Francci3_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2203 
Symbol 
ID3906342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2575017 
End bp2577236 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content70% 
IMG OID637879535 
Productisocitrate dehydrogenase, NADP-dependent 
Protein accessionYP_481301 
Protein GI86740901 
COG category[C] Energy production and conversion 
COG ID[COG2838] Monomeric isocitrate dehydrogenase 
TIGRFAM ID[TIGR00178] isocitrate dehydrogenase, NADP-dependent, monomeric type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.199816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACT CGACCATCAT CTATACACAC ACTGACGAGG CTCCGGCCCT GGCGACGTAC 
TCGTTCCTGC CCGTGGTCCA GGCGTACGCC GCGACGGCCG GGGTTAGTGT CGACAGCCGC
GACATCTCCC TGGCGGGGCG GATCATCGCC AGTTTCCCCG AGGATCTCGA GGAGGCCCAG
CGCATCGAGG ACGCGCTCGC CGAGCTCGGT GAGCTGGCCA GGACGCCCGA GGCGAACATC
ATCAAGCTGC CGAACATCTC GGCTTCGATT CCGCAGCTGA AGGCGGCGAT CGTCGAGCTG
CAGAAGCGGG GCTATGCGCT GCCGGACTAC CCGGACGACC CGAAGACCGA CGAGGAGCGG
GAGATCCGCG CCCGTTATGA CAAGGTGAAG GGCAGCGCCG TCAACCCGGT GCTGCGCGAG
GGCAACTCCG ACCGCCGCGC CCCGGCCTCG GTGAAGAACT ACGCCCGGAC CCACCCGCAC
CGGATGGGCG CGTGGACACC CGACTCGAAG ACGAACGTCG CCACGATGGG CGCCGGCGAC
TTCCGCTCCA CCGAGAAGTC CACGATCATC GCCGCGGCCG GCACGCTGCG CATCGAGCTT
GCGGGTGACG ACGGCAGCAC CACCGTCCTG CTCCCGTCCC TGCCGGTGCT GGCGGGCGAG
GTAGTGGACG CGGCCGTCCT GCGGGTGGCC GCGCTGCGCG AGTTCCTCAC CGCGCAGATC
GCCCGGGCCA AGGCCGAGGG CGTGCTGTTT TCGGTGCACC TGAAGGCCAC GATGATGAAG
GTCTCCGACC CGATCATCTT CGGCCACGTG GTACGGGCCT TCTTCCCGAA GACGTTCGCC
ACCCACGGCG CGACGCTCGC CGCGGCCGGC CTGTCCCCGA ACGACGGGCT CGGCGGTATC
CTCAAGGGCC TGGAATCGCT GCCCGAGGGC GCGCAGATCA AGGCCTCCTT CGCCGCGGAG
CTCGCCGAGG GCCCCGCCCT GGCAATGGTC GACTCCGACC GCGGCATCAC CAACCTGCAC
GTGCCCAGCG ACGTCATCGT CGACGCCTCC ATGCCGGCCA TGATCCGTAC CTCCGGCCAC
ATGTGGGGCC CGGACGGCCG GGAGGCGGAC ACCCTGGCGG TCCTGCCCGA CTCGAGCTAC
GCCGGCATCT ACCAGGTCGC CATCGACGAC TGCCGGGCGC ACGGCGCCTA CGACCCCGCG
ACGATGGGCT CGGTGGCCAA CGTCGGCCTG ATGGCGCAGA AGGCCGAGGA GTACGGCAGC
CACGACAAGA CCTTCGAGAT CCCCACCACG GGCACGGTGC GGATCGTCGA CCAGGCCGGC
ACCGTGGTGC TGGAGCAGGC GGTCGCCGCC GGCGACATCT TTCGCGCCTG CCAGACCAAG
GACGCGCCGA TCCGGGACTG GGTGAAGCTC GCCGTCGCGC GCGCCCGCGC CACCGGTGAC
CCGGCGGTGT TCTGGCTCGA CGAGAACCGC GCCCACGACG CGAGGCTGAT CGAGAAGGTC
CGGGCGTATC TGCCCGGGCA CGACACCTCC GGGCTGCGGA TCGAGATCAA GGCTCCGGTC
GACGCGATCG CGTTCTCACT GGAGCGGATC CGCCGCGGCG AGAACACGAT CTCAGTCACC
GGCAACGTGC TGCGTGACTA TCTGACCGAC CTGTTCCCGA TCCTGGAGCT CGGCACGAGT
GCCAAGATGC TCTCGATCGT CCCGCTCATG AACGGTGGCG GCCTGTTCGA GACCGGCGCC
GGCGGCTCCG CGCCCAAGCA CGTCCAGCAG CTGCTCAGGG AGAACTACCT GCGCTGGGAC
AGCCTGGGCG AGTTCCTGGC GCTCGCGGCC AGCTTCGAGC AGTTCGCGCG GACGACGGGC
AACGCGCGTG CCCAGGTGCT CGCCGACACC CTCGACCGCG CGACCGCCAC CTTCCTCAAC
GAGGACAAGT CGCCCAGCCG CCGGCTGGGT GGCCTCGACA ACCGCGGCAG CCATTTCTAC
CTGGCGCTCT ACTGGGCGCA GGAGCTGGCC GCGCAGATCG GCGACCCCCG TCTCGCGGAG
GCGTTCGCCG GTCTCGCCAA GACGCTCGCC GACCAGGAGA AGGCGATCGT CGACGAACTG
ATCGCGGTCC AGGGCTCGCC TGCCGACATC GGGGGGTACT ACCAGCCCGA CCCCGTCAAG
GCCGCGGCCG TCATGCGTCC GTCGAAGATC TTCAACGAGG CCATCGCCAG CCTCGGCTGA
 
Protein sequence
MTDSTIIYTH TDEAPALATY SFLPVVQAYA ATAGVSVDSR DISLAGRIIA SFPEDLEEAQ 
RIEDALAELG ELARTPEANI IKLPNISASI PQLKAAIVEL QKRGYALPDY PDDPKTDEER
EIRARYDKVK GSAVNPVLRE GNSDRRAPAS VKNYARTHPH RMGAWTPDSK TNVATMGAGD
FRSTEKSTII AAAGTLRIEL AGDDGSTTVL LPSLPVLAGE VVDAAVLRVA ALREFLTAQI
ARAKAEGVLF SVHLKATMMK VSDPIIFGHV VRAFFPKTFA THGATLAAAG LSPNDGLGGI
LKGLESLPEG AQIKASFAAE LAEGPALAMV DSDRGITNLH VPSDVIVDAS MPAMIRTSGH
MWGPDGREAD TLAVLPDSSY AGIYQVAIDD CRAHGAYDPA TMGSVANVGL MAQKAEEYGS
HDKTFEIPTT GTVRIVDQAG TVVLEQAVAA GDIFRACQTK DAPIRDWVKL AVARARATGD
PAVFWLDENR AHDARLIEKV RAYLPGHDTS GLRIEIKAPV DAIAFSLERI RRGENTISVT
GNVLRDYLTD LFPILELGTS AKMLSIVPLM NGGGLFETGA GGSAPKHVQQ LLRENYLRWD
SLGEFLALAA SFEQFARTTG NARAQVLADT LDRATATFLN EDKSPSRRLG GLDNRGSHFY
LALYWAQELA AQIGDPRLAE AFAGLAKTLA DQEKAIVDEL IAVQGSPADI GGYYQPDPVK
AAAVMRPSKI FNEAIASLG