Gene Franean1_6838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6838 
Symbol 
ID5675151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8335206 
End bp8338925 
Gene Length3720 bp 
Protein Length1239 aa 
Translation table11 
GC content74% 
IMG OID641245687 
Productindolepyruvate ferredoxin oxidoreductase 
Protein accessionYP_001511078 
Protein GI158318570 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGGG CGACCGGCGA GCGGGCGGCG TCCGGCGACG TGATCGAGGC GCCCCCGCTG 
GAGGGCCCGC CCGTCGCTGA TCCGCCCGCG GTCGAGGCCC CGGTGCCCGC CCCGGCCGCG
GCGGCGCTGC TCTCCGGAGT CGAGACCCTG GCGCGCCTGC TCGTCGTGCG CGAGCGGCTG
GATGCCAAGG ACGGCCTGAC CACCGGGACG ATGGTGTCGG GCTACCCGGG CTCGCCCCTG
GGCACGTTCG ACCTGACCCT GGACAAGCTC GGCGACAAGC TGGCCGAGCA CCGCATCCTG
CACCGGCCGG GGCTGAACGA GGAGCTCGGC GCGGCCGTCG TGTGGGGCAG CCAGATGGGC
GCCGTCAACG GCTACGCCGG CGTCGACGGC GTCGCCGGTG CCTGGTACGG CAAGACCCCG
GGGCTCGACC GGTGCGGGGA TGTCCTCAGG CACGCCAACG CGATGGGCGC CGGCCCGAAC
GGCGGCGTCG TCATGTTCTG CGGCGACGAC CCGACCGCCA AGTCGTCCAC CCTGCCCTGC
GACAGCCAGT ACACGTTCGA GGACGCCTGC ATCCCGGTGC TGTTCCCCGG TGACCAGCAG
GAGGTTCTCG ACCTGGGCGT GCACGCCTAC CGGATGTCCC GGTACTGCGG ATCCTGGGCC
GGGATGAAGA TCGTCACCGC GGTGGCGGAC GGCATCGGCA GCGTCGACCT CGACCTCGAC
CGGCACACGC CGGAGGACCC CGAGGACATC CTGGTCGAGG GCCTGCCGTG GCGGCACCGG
CCGCAGGCGA AGATCGGCCC GCACGCGGTC GCCGGCCAGG AGGCGCTCGT CGTCGACCAC
CGGCTGCGCG CCGCGCAGGC CTACGTCCGG CACAACGGCC TCGACCGGGT GGTCGGCGCC
CGGGCGGGCG CGCCGCTGGG CATCGTGTGC GCCGGCAAGA CCTACTTCGA CGTGGTCCAG
GCGTTCTCCG ATCTCGGTGT CCGCCCCGAC GGCCTTGCCG CCGCCGGCGT CCGCGTGCTG
AAGCTCGCGA TGACGTACCC GGTGGTCGAG AGCACCGTCG TCGAGTTCGC CCGGTCGGTC
GAGGAGATCC TCGTCGTCGA GGAGAAGCGG CCGTTCATCG AGACGCAGCT GCGGTCGATC
CTGCACCAGG CGGGGATCAT GGTCCCCGTC GCGGGCAAGA AGGACCTCGA CGGGCGCGCG
CTGCTGTCGA CGGTCGGCGA GCTGGACCCG GCCGCGGTCG GCAAGGCACT GACCCGGGTC
CGCCCGGCGC TCGCCGCCGG TCGCCGCGAC GAGCCGCGGC GGATGTCGCT GCCGCTGCTC
GCGCTGCCGT CCCGGCCGCC GGGCTTCTGT AGCGGCTGCC CGCACAACCG CTCGACGATC
TTCCCGGACG GTGCCCTCGT CGGCGGTGGC GTCGGGTGTC ACGGGATCAT GTACTTCGAG
ACCCGGCACC AGGGCATGAC CAGCCTGCCG CCCACCCCGA TGGGCGCCGA GGGCGTGCCC
TGGATCGGGC TGGCGCCGTT CGTCGACGAG CCGCACCTCA TCCAGAACCT CGGTGACGGC
ACGCTGAGCC ACTCCGGCAT CCTGGCGATC CGGGCGAGCG TCGCCGCCGG CGTCGCCGTC
ACGTTCAAGA TCCTGTACAA CACCGCGGTC GCGATGACGG GCGGCCAGGA CGTCGTCGGC
CTGATGGACG TCCCGGCGAT GACCCGGGCG CTGGAGGCCG AGGGCGTGCG CCGCATCGTG
GTGTGCGCCG AGGACCCGAA GCGCTACGGC CGTCGCGCCC GCTGGGCACC GGGCGTCAAG
GTGCTCGGGC GGGACCACCT CCCGGAGGTG CAGGAGGAAC TGCGGGGCGT GGCCGGCGTC
AGCGTGATCA TCTACGACCA GCGCTGCGCG GCGGAGTCGC GGCGGCTGCG CAAGCGCGGG
CTGCTGCCCG AGCCGCCCCG GCGGGTCGTC ATCAACGAGG CCGTCTGCGA GGGGTGCGGC
GACTGCGGCA CCAAGAGCAA CTGCCTGTCG GTGCTCCCGG TGGAGACCGA GCTCGGCGTG
AAGCGGCGCA TCGACGACCT TTCCTGCAAC CGCGACTACA CCTGCCTGGA CGGCGACTGC
CCGTCCTTCG TCACCGTGCA GCCCCGGTCG CGGACCTGGC CGTGGACGAG GTCCCCGGCG
AAGCGCCGGG CCCGCAGGGC GAGCGCCGAG CAGGCCGCGG GTGCGTCCGC CGGGACGGGC
GAGTCGGTGC GGCCGACCCT GCCCGCCGGT ACCCTGCCGG CGCCCGCCAA CCCGGGTGTC
GACGGCCAGT ACGGCATCTA CGTCACCGGC ATCGGCGGGA CGGGCATCAT CACCGCCAGC
CGGATCCTCG CGGCCGCGGC CGAGTCGGCC GGGCTCGTCG TCGGCGGCGT GGACCAGACC
GGCCTGTCGC AGAAGGCCGG GGCCGTGGTC TCCCACCTGC ACCTGGCGGC GACCAGGGCC
GAGATCGGCT CGGCGACGGT CGGCCCCGGC GGGGCCGACC TCTACCTCTC CGGCGACATC
CTGCAGGCCG CCGGCGGGCC CCAGCTGGAA CGGGTCCGCC CCGGGCACAC CGTGGCGGTC
GTCGAGACCG AGCTGATCCC GACCACGTCG ATGCTCCAGG GCGGCGCGAC CGCGCCCGCC
GACGAGGATC TGCGCAGGGC GATCACCGAC CGGGTCGGCC CCGAGCGGGT CGCCTTTATC
GCCGGGCGGC AGATCGCCGA GCAGGTGTTC GCCGACCAGC TCCTCGGCAA CGTCGTCCTG
CTCGGCGCCG CCTTCCAGCT CGGCGGGCTG CCGTTCACGC TGGACGACGT CGAGAACGCG
ATGCGCCGGC AGGGTAAGGC CGCGGCGAAG AACCGGGAGG CCTTCGAGTG GGGCCGCTGG
GCCGCGCACG ACCCGGCGGC CGTCGAGGCC AGCCTCGCCG GGCCCGCGGC CACCGGGCCC
GCGGCCACCG GGTCGGCGGG CGCGGAGGCC TCCGGCGCCG GTCCGGAGCC GGGCCGGCGG
CCCGGCCTCA CCGACCCGTC ATCGGCGGCG CTGACCCGCG CGGTGGCGCT CGTCGCCGAG
CGCCCGCTGC CCCCGGCGCT GCGTGACCTG CTGGTCCGCC GGGCCGCCCA GGTGATCGAC
TACTCGGGCG ACTCGCTGGC GCGGCGCTTC CTCGGCCTGG TCGAGCAGGC GTCGGCGCGC
GATGACGAGG CCAGGGGGTG GGAGCTCACC CGGGCGGTCG CCGACTCCTG GCACAAGATC
CTGACCTACA AGGACGAGTA CGAGGTCGCC CGGCTGCACC TGAAGACGGA CTACGACGAC
GTCGCGCGTG ACCTCGGCAT CGACGGGCCG TACAAGGTGA CCTACCACCT CCACCCGCCG
GCGCTGCGCC GGCTGGGCAT GTCGAAGAAG ATGCCGATGG GCCGGCCCTA CGCCGTCGCG
TTCCATGGCC TGCGCGCGAT GAAGCGGCTG CGCGGGACGC CCTTCGACAT CTTCGGTTAC
GACCCCGACC GCCGCACCGA GCGGGCCGTG ATCGCGGAGT ACGAGGCGCT GATCACCGAG
CTGGTCCGAC CGGTGCCGGC CGGGGCCGCC GTCGCCTACG AGACGCTGGT CCGGGCCGCC
GAGTCCGTGC AGTCGGTCAA GGGCTATGCG GAGATCAAGG AGGCCGCCGT GGAACGGTGG
CGCGCCGAGA TCGCCCAGCT GCGGCGCGAC CTGGTACCGG CCACCGCCGG GCTCGACTGA
 
Protein sequence
MIGATGERAA SGDVIEAPPL EGPPVADPPA VEAPVPAPAA AALLSGVETL ARLLVVRERL 
DAKDGLTTGT MVSGYPGSPL GTFDLTLDKL GDKLAEHRIL HRPGLNEELG AAVVWGSQMG
AVNGYAGVDG VAGAWYGKTP GLDRCGDVLR HANAMGAGPN GGVVMFCGDD PTAKSSTLPC
DSQYTFEDAC IPVLFPGDQQ EVLDLGVHAY RMSRYCGSWA GMKIVTAVAD GIGSVDLDLD
RHTPEDPEDI LVEGLPWRHR PQAKIGPHAV AGQEALVVDH RLRAAQAYVR HNGLDRVVGA
RAGAPLGIVC AGKTYFDVVQ AFSDLGVRPD GLAAAGVRVL KLAMTYPVVE STVVEFARSV
EEILVVEEKR PFIETQLRSI LHQAGIMVPV AGKKDLDGRA LLSTVGELDP AAVGKALTRV
RPALAAGRRD EPRRMSLPLL ALPSRPPGFC SGCPHNRSTI FPDGALVGGG VGCHGIMYFE
TRHQGMTSLP PTPMGAEGVP WIGLAPFVDE PHLIQNLGDG TLSHSGILAI RASVAAGVAV
TFKILYNTAV AMTGGQDVVG LMDVPAMTRA LEAEGVRRIV VCAEDPKRYG RRARWAPGVK
VLGRDHLPEV QEELRGVAGV SVIIYDQRCA AESRRLRKRG LLPEPPRRVV INEAVCEGCG
DCGTKSNCLS VLPVETELGV KRRIDDLSCN RDYTCLDGDC PSFVTVQPRS RTWPWTRSPA
KRRARRASAE QAAGASAGTG ESVRPTLPAG TLPAPANPGV DGQYGIYVTG IGGTGIITAS
RILAAAAESA GLVVGGVDQT GLSQKAGAVV SHLHLAATRA EIGSATVGPG GADLYLSGDI
LQAAGGPQLE RVRPGHTVAV VETELIPTTS MLQGGATAPA DEDLRRAITD RVGPERVAFI
AGRQIAEQVF ADQLLGNVVL LGAAFQLGGL PFTLDDVENA MRRQGKAAAK NREAFEWGRW
AAHDPAAVEA SLAGPAATGP AATGSAGAEA SGAGPEPGRR PGLTDPSSAA LTRAVALVAE
RPLPPALRDL LVRRAAQVID YSGDSLARRF LGLVEQASAR DDEARGWELT RAVADSWHKI
LTYKDEYEVA RLHLKTDYDD VARDLGIDGP YKVTYHLHPP ALRRLGMSKK MPMGRPYAVA
FHGLRAMKRL RGTPFDIFGY DPDRRTERAV IAEYEALITE LVRPVPAGAA VAYETLVRAA
ESVQSVKGYA EIKEAAVERW RAEIAQLRRD LVPATAGLD