Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6838 |
Symbol | |
ID | 5675151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8335206 |
End bp | 8338925 |
Gene Length | 3720 bp |
Protein Length | 1239 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245687 |
Product | indolepyruvate ferredoxin oxidoreductase |
Protein accession | YP_001511078 |
Protein GI | 158318570 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGGG CGACCGGCGA GCGGGCGGCG TCCGGCGACG TGATCGAGGC GCCCCCGCTG GAGGGCCCGC CCGTCGCTGA TCCGCCCGCG GTCGAGGCCC CGGTGCCCGC CCCGGCCGCG GCGGCGCTGC TCTCCGGAGT CGAGACCCTG GCGCGCCTGC TCGTCGTGCG CGAGCGGCTG GATGCCAAGG ACGGCCTGAC CACCGGGACG ATGGTGTCGG GCTACCCGGG CTCGCCCCTG GGCACGTTCG ACCTGACCCT GGACAAGCTC GGCGACAAGC TGGCCGAGCA CCGCATCCTG CACCGGCCGG GGCTGAACGA GGAGCTCGGC GCGGCCGTCG TGTGGGGCAG CCAGATGGGC GCCGTCAACG GCTACGCCGG CGTCGACGGC GTCGCCGGTG CCTGGTACGG CAAGACCCCG GGGCTCGACC GGTGCGGGGA TGTCCTCAGG CACGCCAACG CGATGGGCGC CGGCCCGAAC GGCGGCGTCG TCATGTTCTG CGGCGACGAC CCGACCGCCA AGTCGTCCAC CCTGCCCTGC GACAGCCAGT ACACGTTCGA GGACGCCTGC ATCCCGGTGC TGTTCCCCGG TGACCAGCAG GAGGTTCTCG ACCTGGGCGT GCACGCCTAC CGGATGTCCC GGTACTGCGG ATCCTGGGCC GGGATGAAGA TCGTCACCGC GGTGGCGGAC GGCATCGGCA GCGTCGACCT CGACCTCGAC CGGCACACGC CGGAGGACCC CGAGGACATC CTGGTCGAGG GCCTGCCGTG GCGGCACCGG CCGCAGGCGA AGATCGGCCC GCACGCGGTC GCCGGCCAGG AGGCGCTCGT CGTCGACCAC CGGCTGCGCG CCGCGCAGGC CTACGTCCGG CACAACGGCC TCGACCGGGT GGTCGGCGCC CGGGCGGGCG CGCCGCTGGG CATCGTGTGC GCCGGCAAGA CCTACTTCGA CGTGGTCCAG GCGTTCTCCG ATCTCGGTGT CCGCCCCGAC GGCCTTGCCG CCGCCGGCGT CCGCGTGCTG AAGCTCGCGA TGACGTACCC GGTGGTCGAG AGCACCGTCG TCGAGTTCGC CCGGTCGGTC GAGGAGATCC TCGTCGTCGA GGAGAAGCGG CCGTTCATCG AGACGCAGCT GCGGTCGATC CTGCACCAGG CGGGGATCAT GGTCCCCGTC GCGGGCAAGA AGGACCTCGA CGGGCGCGCG CTGCTGTCGA CGGTCGGCGA GCTGGACCCG GCCGCGGTCG GCAAGGCACT GACCCGGGTC CGCCCGGCGC TCGCCGCCGG TCGCCGCGAC GAGCCGCGGC GGATGTCGCT GCCGCTGCTC GCGCTGCCGT CCCGGCCGCC GGGCTTCTGT AGCGGCTGCC CGCACAACCG CTCGACGATC TTCCCGGACG GTGCCCTCGT CGGCGGTGGC GTCGGGTGTC ACGGGATCAT GTACTTCGAG ACCCGGCACC AGGGCATGAC CAGCCTGCCG CCCACCCCGA TGGGCGCCGA GGGCGTGCCC TGGATCGGGC TGGCGCCGTT CGTCGACGAG CCGCACCTCA TCCAGAACCT CGGTGACGGC ACGCTGAGCC ACTCCGGCAT CCTGGCGATC CGGGCGAGCG TCGCCGCCGG CGTCGCCGTC ACGTTCAAGA TCCTGTACAA CACCGCGGTC GCGATGACGG GCGGCCAGGA CGTCGTCGGC CTGATGGACG TCCCGGCGAT GACCCGGGCG CTGGAGGCCG AGGGCGTGCG CCGCATCGTG GTGTGCGCCG AGGACCCGAA GCGCTACGGC CGTCGCGCCC GCTGGGCACC GGGCGTCAAG GTGCTCGGGC GGGACCACCT CCCGGAGGTG CAGGAGGAAC TGCGGGGCGT GGCCGGCGTC AGCGTGATCA TCTACGACCA GCGCTGCGCG GCGGAGTCGC GGCGGCTGCG CAAGCGCGGG CTGCTGCCCG AGCCGCCCCG GCGGGTCGTC ATCAACGAGG CCGTCTGCGA GGGGTGCGGC GACTGCGGCA CCAAGAGCAA CTGCCTGTCG GTGCTCCCGG TGGAGACCGA GCTCGGCGTG AAGCGGCGCA TCGACGACCT TTCCTGCAAC CGCGACTACA CCTGCCTGGA CGGCGACTGC CCGTCCTTCG TCACCGTGCA GCCCCGGTCG CGGACCTGGC CGTGGACGAG GTCCCCGGCG AAGCGCCGGG CCCGCAGGGC GAGCGCCGAG CAGGCCGCGG GTGCGTCCGC CGGGACGGGC GAGTCGGTGC GGCCGACCCT GCCCGCCGGT ACCCTGCCGG CGCCCGCCAA CCCGGGTGTC GACGGCCAGT ACGGCATCTA CGTCACCGGC ATCGGCGGGA CGGGCATCAT CACCGCCAGC CGGATCCTCG CGGCCGCGGC CGAGTCGGCC GGGCTCGTCG TCGGCGGCGT GGACCAGACC GGCCTGTCGC AGAAGGCCGG GGCCGTGGTC TCCCACCTGC ACCTGGCGGC GACCAGGGCC GAGATCGGCT CGGCGACGGT CGGCCCCGGC GGGGCCGACC TCTACCTCTC CGGCGACATC CTGCAGGCCG CCGGCGGGCC CCAGCTGGAA CGGGTCCGCC CCGGGCACAC CGTGGCGGTC GTCGAGACCG AGCTGATCCC GACCACGTCG ATGCTCCAGG GCGGCGCGAC CGCGCCCGCC GACGAGGATC TGCGCAGGGC GATCACCGAC CGGGTCGGCC CCGAGCGGGT CGCCTTTATC GCCGGGCGGC AGATCGCCGA GCAGGTGTTC GCCGACCAGC TCCTCGGCAA CGTCGTCCTG CTCGGCGCCG CCTTCCAGCT CGGCGGGCTG CCGTTCACGC TGGACGACGT CGAGAACGCG ATGCGCCGGC AGGGTAAGGC CGCGGCGAAG AACCGGGAGG CCTTCGAGTG GGGCCGCTGG GCCGCGCACG ACCCGGCGGC CGTCGAGGCC AGCCTCGCCG GGCCCGCGGC CACCGGGCCC GCGGCCACCG GGTCGGCGGG CGCGGAGGCC TCCGGCGCCG GTCCGGAGCC GGGCCGGCGG CCCGGCCTCA CCGACCCGTC ATCGGCGGCG CTGACCCGCG CGGTGGCGCT CGTCGCCGAG CGCCCGCTGC CCCCGGCGCT GCGTGACCTG CTGGTCCGCC GGGCCGCCCA GGTGATCGAC TACTCGGGCG ACTCGCTGGC GCGGCGCTTC CTCGGCCTGG TCGAGCAGGC GTCGGCGCGC GATGACGAGG CCAGGGGGTG GGAGCTCACC CGGGCGGTCG CCGACTCCTG GCACAAGATC CTGACCTACA AGGACGAGTA CGAGGTCGCC CGGCTGCACC TGAAGACGGA CTACGACGAC GTCGCGCGTG ACCTCGGCAT CGACGGGCCG TACAAGGTGA CCTACCACCT CCACCCGCCG GCGCTGCGCC GGCTGGGCAT GTCGAAGAAG ATGCCGATGG GCCGGCCCTA CGCCGTCGCG TTCCATGGCC TGCGCGCGAT GAAGCGGCTG CGCGGGACGC CCTTCGACAT CTTCGGTTAC GACCCCGACC GCCGCACCGA GCGGGCCGTG ATCGCGGAGT ACGAGGCGCT GATCACCGAG CTGGTCCGAC CGGTGCCGGC CGGGGCCGCC GTCGCCTACG AGACGCTGGT CCGGGCCGCC GAGTCCGTGC AGTCGGTCAA GGGCTATGCG GAGATCAAGG AGGCCGCCGT GGAACGGTGG CGCGCCGAGA TCGCCCAGCT GCGGCGCGAC CTGGTACCGG CCACCGCCGG GCTCGACTGA
|
Protein sequence | MIGATGERAA SGDVIEAPPL EGPPVADPPA VEAPVPAPAA AALLSGVETL ARLLVVRERL DAKDGLTTGT MVSGYPGSPL GTFDLTLDKL GDKLAEHRIL HRPGLNEELG AAVVWGSQMG AVNGYAGVDG VAGAWYGKTP GLDRCGDVLR HANAMGAGPN GGVVMFCGDD PTAKSSTLPC DSQYTFEDAC IPVLFPGDQQ EVLDLGVHAY RMSRYCGSWA GMKIVTAVAD GIGSVDLDLD RHTPEDPEDI LVEGLPWRHR PQAKIGPHAV AGQEALVVDH RLRAAQAYVR HNGLDRVVGA RAGAPLGIVC AGKTYFDVVQ AFSDLGVRPD GLAAAGVRVL KLAMTYPVVE STVVEFARSV EEILVVEEKR PFIETQLRSI LHQAGIMVPV AGKKDLDGRA LLSTVGELDP AAVGKALTRV RPALAAGRRD EPRRMSLPLL ALPSRPPGFC SGCPHNRSTI FPDGALVGGG VGCHGIMYFE TRHQGMTSLP PTPMGAEGVP WIGLAPFVDE PHLIQNLGDG TLSHSGILAI RASVAAGVAV TFKILYNTAV AMTGGQDVVG LMDVPAMTRA LEAEGVRRIV VCAEDPKRYG RRARWAPGVK VLGRDHLPEV QEELRGVAGV SVIIYDQRCA AESRRLRKRG LLPEPPRRVV INEAVCEGCG DCGTKSNCLS VLPVETELGV KRRIDDLSCN RDYTCLDGDC PSFVTVQPRS RTWPWTRSPA KRRARRASAE QAAGASAGTG ESVRPTLPAG TLPAPANPGV DGQYGIYVTG IGGTGIITAS RILAAAAESA GLVVGGVDQT GLSQKAGAVV SHLHLAATRA EIGSATVGPG GADLYLSGDI LQAAGGPQLE RVRPGHTVAV VETELIPTTS MLQGGATAPA DEDLRRAITD RVGPERVAFI AGRQIAEQVF ADQLLGNVVL LGAAFQLGGL PFTLDDVENA MRRQGKAAAK NREAFEWGRW AAHDPAAVEA SLAGPAATGP AATGSAGAEA SGAGPEPGRR PGLTDPSSAA LTRAVALVAE RPLPPALRDL LVRRAAQVID YSGDSLARRF LGLVEQASAR DDEARGWELT RAVADSWHKI LTYKDEYEVA RLHLKTDYDD VARDLGIDGP YKVTYHLHPP ALRRLGMSKK MPMGRPYAVA FHGLRAMKRL RGTPFDIFGY DPDRRTERAV IAEYEALITE LVRPVPAGAA VAYETLVRAA ESVQSVKGYA EIKEAAVERW RAEIAQLRRD LVPATAGLD
|
| |