Gene Franean1_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4274 
Symbol 
ID5672629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5108256 
End bp5110475 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content70% 
IMG OID641243147 
Productisocitrate dehydrogenase, NADP-dependent 
Protein accessionYP_001508564 
Protein GI158316056 
COG category[C] Energy production and conversion 
COG ID[COG2838] Monomeric isocitrate dehydrogenase 
TIGRFAM ID[TIGR00178] isocitrate dehydrogenase, NADP-dependent, monomeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.876075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.344018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGATT CGGTGATCAT CTACACCCAG ACGGACGAGG CGCCGGCGCT GGCGACGTAC 
TCGTTCCTGC CGGTGGTGCA GGCGTACGCG TCGACGGCCG GGGTCGGCGT TGTCAGCCGC
GACATTTCCC TGGCCGGGCG GATCATCGCC AGCTTCCCGG AGCACCTCGA GCCGTCCCAG
CGCATCGACG ACGCGCTCGC CGAGCTCGGC GACCTGGCCC GCACGCCTGA GGCGAACATC
ATCAAGCTGC CGAACATCTC GGCGTCCATC CCGCAGCTGC GGGCCGCCGT CGTCGAGCTG
CAGAAGCAGG GCTACGCGCT GCCGGACTAC CCCGACGACC CGAAGACCGA CGAGGAGCGG
GAGATCCGCG CCCGTTACGA CAAGGTCAAG GGCAGCGCCG TCAACCCCGT GCTGCGCGAG
GGCAACTCCG ACCGCCGCGC GCCGGCGTCG GTGAAGAGCT ACGCCAAGGC CCACCCGCAC
CGCATGGGCG CCTGGAGCGT CGAGTCGAAG ACGAACGTCG CCACCATGAG CGCCGACGAC
TTCCGTTCGA CCGAGAAGTC CGCGGTGATC GCCGCGGACG GCTCCCTGCG TATCGAGCTG
GTCGGCGACG ACGGCACCAC GTCCGTCCTG CGCGAGTCGG TACCGGTGCT CGCCAGCGAA
GTCGTCGACG TCTCGGTCAT GCGGGTCGGC GCGCTGCGCG AGTTCCTCAC CGAGCAGGTG
GCCCGGGCGA AGGCCGAGGG CGTGCTGTTC TCGGTGCACC TGAAGGCCAC GATGATGAAG
GTCTCCGACC CGATCATCTT CGGCCACGTC GTGCGTGCGT TCTTCCCCGA GACGTTCGCC
CGCCACGGCG CGGCGCTCGC GGCGGCCGGC CTGACCCCGA ACGAGGGCCT CGGCGGCATC
TTCAAGGGGC TGGAGGCGCT GCCCGAGGGC GCCGAGATCA AGGCCTCCTT CGAGGCCGAG
CTCGCCGCCG GGCCGCCGCT GGCCATGGTC GACTCCGACC GGGGCATCAC GAACCTGCAC
GTGCCCAGCG ACGTCATCGT CGACGCGTCC ATGCCGGCGA TGATCCGCAC CTCCGGCCAC
ATGTGGGGGC CGGACGGTGC CGAGGCGGAC ACCCTCGCCG TCCTGCCCGA CAGCAGCTAC
GCCGGCATCT ACCAGGCCGT CATCGACGAC TGCCGCGCGC ACGGCGCCTA CGACCCGGCG
ACGATGGGCT CGGTGCCCAA CGTCGGCCTG ATGGCGCAGA AGGCCGAGGA GTACGGCAGC
CACGACAAGA CCTTCGAGAT CCCGACGACC GGCACCGTGC GGCTCGTCGA CGGAGCGGGC
ACGGCGGTCC TCGAGCTGAC CGTGAGCGCG GGTGACATCT TCCGCGCCTG CCAGACCAAG
GACGCCCCGA TCCGCGACTG GGTGAAGCTC GCCGTCACCC GGGCCTGCGC CACCGGCAAC
CCGGCCGTGT TCTGGCTTGA CGAGACCCGC GCGCACGACG CCCAGCTCAT CACCAAGGTC
CGGACCTACC TGGCCGACCA CGACACCACC GGGCTGCAGA TCGAGATCAA GGCCCCGGTC
GACGCGATCA CCTTCTCGCT GGAGCGGATC CGCCGCGGCG AGGACACCAT CTCGGTGACC
GGCAACGTGC TGCGTGACTA CCTGACCGAC CTGTTCCCGA TCCTGGAGCT CGGCACGAGC
GCAAAGATGC TCTCGGTCGT CCCGCTGATG AACGGCGGCG GCCTGTTCGA GACCGGTGCC
GGCGGGTCGG CGCCCAAGCA CGTCCAGCAG CTCGTCAAGG AGAACTACCT GCGCTGGGAC
AGCCTGGGTG AGTTCCTGGC GCTGGCGGTG AGCTTCGAGC ACCTCGCGCA GCGCACCGGC
AACGCCCGCG CCCAGGTGCT GGCCGACACC CTCGACCGGG CGACCGCGAC CTTCCTCAAC
GAGGACAAGT CGCCCACCCG CCGGGTCGGC GGCATCGACA ACCGGGGCAG CCACTTCTAC
CTGGCCCTGT ACTGGGCCCA GGAACTCGCG GCGCAGACCG ACGACGCCGC GCTCGCGCAG
GCGTTCACCA GCCTGGCCGA GAGCCTCGCG GCCCAGGAGA AGACCATCGT CGACGAGCTG
CTCGCCGTGC AGGGCTCGCC GGCCGACATC GGCGGCTACT ACCAGCCCGA CCCGGCCAAG
GCCGCGGCCG TGATGCGCCC GTCGAAGACC TTCAACGAGG CCATCGCGAC GCTTGCCTGA
 
Protein sequence
MTDSVIIYTQ TDEAPALATY SFLPVVQAYA STAGVGVVSR DISLAGRIIA SFPEHLEPSQ 
RIDDALAELG DLARTPEANI IKLPNISASI PQLRAAVVEL QKQGYALPDY PDDPKTDEER
EIRARYDKVK GSAVNPVLRE GNSDRRAPAS VKSYAKAHPH RMGAWSVESK TNVATMSADD
FRSTEKSAVI AADGSLRIEL VGDDGTTSVL RESVPVLASE VVDVSVMRVG ALREFLTEQV
ARAKAEGVLF SVHLKATMMK VSDPIIFGHV VRAFFPETFA RHGAALAAAG LTPNEGLGGI
FKGLEALPEG AEIKASFEAE LAAGPPLAMV DSDRGITNLH VPSDVIVDAS MPAMIRTSGH
MWGPDGAEAD TLAVLPDSSY AGIYQAVIDD CRAHGAYDPA TMGSVPNVGL MAQKAEEYGS
HDKTFEIPTT GTVRLVDGAG TAVLELTVSA GDIFRACQTK DAPIRDWVKL AVTRACATGN
PAVFWLDETR AHDAQLITKV RTYLADHDTT GLQIEIKAPV DAITFSLERI RRGEDTISVT
GNVLRDYLTD LFPILELGTS AKMLSVVPLM NGGGLFETGA GGSAPKHVQQ LVKENYLRWD
SLGEFLALAV SFEHLAQRTG NARAQVLADT LDRATATFLN EDKSPTRRVG GIDNRGSHFY
LALYWAQELA AQTDDAALAQ AFTSLAESLA AQEKTIVDEL LAVQGSPADI GGYYQPDPAK
AAAVMRPSKT FNEAIATLA