Gene Franean1_7269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7269 
Symbol 
ID5675570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8873275 
End bp8875032 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content75% 
IMG OID641246106 
Productdehydrogenase catalytic domain-containing protein 
Protein accessionYP_001511494 
Protein GI158318986 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.325123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCATC GACAGTTCCG CCTCCCCGAT CTCGGCGAGG GGCTGACCGA GGCGGAGATC 
GTCCGGTGGC TGGTGGAGGT CGGCGAGACC GTGACGGTCA ACCAGCCGCT GGTCGAGGTC
GAGACGGCCA AGGCGGTCGT CGAGATCCCG TCCCCGTTCG CGGGTGTGCT CGTCGAACGG
CACGGCGAGG CCGGCACCGA GCTCGCCGTG GGCACGCCGC TGCTGACCAT CGACGAGCCG
GGCGACGAGC CCGCGACCGG CCCGACGACC GGGTCAGTCA CCGGAGCTAC CGAAGCCACC
GGCCAGGAGA CCACGCCGGG GGACGCCACT CGCAACGGCG CGGCATCCCT GCCCGTGTCC
CGGACCGGAG AGGCCGCGGA CGTCCCAGGG GGGGCCGGCA CGGCCGACCT GTCACCGATG
GCCGCCGGGC GCACCCCGAT GCTGGTGGGG TACGGCCCGC GCAGCGACTC CGGCCCGCGC
CGCCGGCGGC GCCCCCGCAA CCCGGACGGC CCCCTGCCCG GCTCGACGAC GCCCGCGGCG
ACCATCGCCG CCCCCGCGCC CGCCCCCGGC ACAGCGCCCA CTGCCAGCTC AGCGCCCAGG
GCCATGGCAC CCGTCGTGTC GTCGGTGCCG GCCTCGGCGC CAGCCACATC CCCGGCGTCG
GCCGCGGCCG GTGCGCCGGA CCGATCAGCC GTAGTGCCGA TCGGGGCAGC CCCGCGGCAT
GGGCGCGTCG CGGCGAAGCC ACCCGTCCGC AAGCTCGCGC GTGATCTGGG TGTGGACCTC
TCCACGCTCG CGGGCACCGG TCCCGCGGGC ACCATCAGCC GCGCCGACGT CGAAACCGCG
GCGCGTCAGG CCACGCCCCC CGAGCCCGCG CCCGTCCCCA CCACCGCCAC TCCGACGACC
CGCACTGGGC CGGTCCGTGT CCCTGGCGTT GTCCCTTCGT CCAACGGAAT CGACGGCCCG
CGCCAGCCGG AACTGAACGG ACACGCGGCG ACCACACGGC GGGTGCCCGG AGCCATCCCA
CCCGACGCCG GATTCAACGA CACCGACCGG ATCTGGCGGA TCCCCGTCAC GGGCGTGCGG
CGCACCATGG CACGGGCGAT GGTGGCCAGC GTGTTCTCGG CCCCGCACGC CACCGAGTTC
CTCAGCGTGG ACGTCACCGA GACGATGGCG GCCCGCGAGC GGATCGCCGC CCTGCCGGAC
TTCGCCGGCA TCCGGGTCAC GCCGCTGCTG CTCGTGGCGA AGGCGCTCCT CACCGCCGTC
CGGCGCCACC CAATGATCAA CTCGACCTGG GTGGGCGACA CGTCCGGGGA GAACGCCGAG
ATCCAGGTGC ACGAGCGGAT CAACCTCGGC ATCGCCGTGG CCGGGCCGCG TGGCCTGGTC
GTCCCGAACA TCCCGGACGC CGGATCGCGC GGCCTGGTCG ACCTCGCCCG CAGCCTGCAC
TCCCTCACCG AGGCCGCGCG CGCCGACCGG CTGCGCCCGG CCGACCTCTC CGGCGGGACC
ATCACCATCA CCAACGTCGG AGTTCTCGGG GTGGACACCG GGGCACCGGT CCTCAATCCC
GGTGAGGCCG CGATCCTCGC CCTCGGCGCG ATCCGCCCGG CTCCCTGGGT GCACGAAGGC
GAGCTGGCGG TACGGACGGT GGCCCACCTC GCGCTGTCCT TCGACCACCG CGTCGTGGAC
GGCGAGCTCG GCTCGGCGGT CCTGGCCGAC GTCGCGGCCG TCCTCGCCGA CCCCGTCATC
GCGCTCGCCT GGAGCTGA
 
Protein sequence
MTHRQFRLPD LGEGLTEAEI VRWLVEVGET VTVNQPLVEV ETAKAVVEIP SPFAGVLVER 
HGEAGTELAV GTPLLTIDEP GDEPATGPTT GSVTGATEAT GQETTPGDAT RNGAASLPVS
RTGEAADVPG GAGTADLSPM AAGRTPMLVG YGPRSDSGPR RRRRPRNPDG PLPGSTTPAA
TIAAPAPAPG TAPTASSAPR AMAPVVSSVP ASAPATSPAS AAAGAPDRSA VVPIGAAPRH
GRVAAKPPVR KLARDLGVDL STLAGTGPAG TISRADVETA ARQATPPEPA PVPTTATPTT
RTGPVRVPGV VPSSNGIDGP RQPELNGHAA TTRRVPGAIP PDAGFNDTDR IWRIPVTGVR
RTMARAMVAS VFSAPHATEF LSVDVTETMA ARERIAALPD FAGIRVTPLL LVAKALLTAV
RRHPMINSTW VGDTSGENAE IQVHERINLG IAVAGPRGLV VPNIPDAGSR GLVDLARSLH
SLTEAARADR LRPADLSGGT ITITNVGVLG VDTGAPVLNP GEAAILALGA IRPAPWVHEG
ELAVRTVAHL ALSFDHRVVD GELGSAVLAD VAAVLADPVI ALAWS