Gene Franean1_5559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5559 
Symbol 
ID5673889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6737020 
End bp6739272 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content70% 
IMG OID641244415 
Productterpene synthase metal-binding domain-containing protein 
Protein accessionYP_001509819 
Protein GI158317311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCTT TCACGCTCCC GGAGTTCTAC CTGCCCTATC CACCCCGGCT GAATCCGAAC 
CTGGAGCACG CCCGCGTGCA CAGCCGGGCG TGGGCCGGGG AGATGGAGAT GATCGACGTA
CCGCAGGACG GTGTGGCGAT CTGGAGCGGG CAGGACTTCG ACTCCCACGA CTACGCGCTG
CTGTGCGCGT ACACCCATCC GGACGCGGAC GAGGCGCGGC TGGATCTCAT CACGGACTGG
TACGTCTGGG TTTTCTACTT CGACGACCAC TTCCTCGAGG TCTACAAGCG CGGCCGGGAT
GTCGCCGGAG CCCGCCGGTA TCTCGACCGG CTGCGCCTGT TCATGCCGGT CGAGGGCGCC
GTCACCGCGG AGCCGGCCAA CCCGGTCGAG CGTGGCCTCG CCGACCTGTG GTCCCGCACC
GTCCCGGACC GCACGCCGGC CTGGCGGCGG CGGTTCGCGA CGAGCACCCG TCATCTGTTG
GACGAGTCCC TGTGGGAGCT GGCCAACATC GACGAGAACC GGCTCGCCAA CCCGGTCGAG
TACATCGAGA TGCGGCGCAA GGTCGGCGGC GCGCCCTGGT CGGCGAACCT GGTCGAGCAC
GCGGCGGACG CCGAGGTGCC CGACGCGATC GCCGCGACCC GGCCGGCGCA GGTGCTGCGG
GACACCTTCT CCGACGCCAT CCACCTGCGT AACGACCTGT TCTCCTACCA GCGGGAGGTG
CAGGAGGAGG GCGAGCTCAG CAACGGCGTC CTGGTGCTGG AGCGGTTCCT GGACTGCCCG
ACCCAGCAGG CGGCCGACGC CGTGAACGAC CTGCTGACCT CGCGGCTGCA CCAGTTCGAG
CACACCGCGC TCACCGAGCT GCCGCCGGTG CTCGACGAGC ACGGCGTCAC CCCGACCGCC
CGCCGGGACG TCCTGGCGTA CGTCAAGGGG CTGCAGGACT GGCAGGCCGG CGGCCATGAG
TGGCACATGC GGTCCAGCCG CTACATGAAC GCGGAGTCCG GGGCGACCGG GCCCGTTCCC
GGCAGCCTTC CCGGGGATGC CACCGGGCTC GGCACGTCGG CGGTGCGTAT CGCGGCCTCG
CTGCTGGCCA CCGCGCCCGC CCGGATGCGC GCTTTCACCC ACGTCCCGCA CCAGGTCGTG
GGGCCGGTGA AGCTCCCTGC CTTCTACATG CCGTTCACGA CCGGTGAGAG CCGGCATCTG
GCGGCAGCCC GTCACAACAT CGTCGAATGG TCGGCGGCGG TCGGGTTTCT CGACCCGGTA
CCCGGCATCT GGGACGAGCA CAAGCTCCGG GCGGCCGACT TCGCGCTGTG CTCCGCGGCC
ATCCATCCGA ACGCGACAGC CGCGGAGCTG GACCTGACGA CGGGATGGCT CACCTGGGGG
ACCTACGCCG ACGACCTCTA CCCGGTCCTG TACGGACGGA CCCGCGACCT GGCCGGCGCG
CGGGCCTGCA CCGAGCGGCT GAAGGAACTC ATGCCGGTGG AGCCCGGCCC GCTGCCCGTC
CCGGTCGGCG GGCTGGAGCG CGGCCTCGCC GACCTGTGGC CGCGCACCAC CCGGGACATG
ACGCCCGACT CCCGGCGCAC GTTCCGCCGG ACGGTCTGCA TCATGCTCGA CAGCTGGCAG
TGGGAGCTGG CGAACCAGGC GCAGAACCGT ATTCCCGACC CGGTGGACTA CATCGAGATG
CGCCGCCGGA CCTTCGGCTC GGACCTGACG ATGAGCCTGT CCCGGCTCGG GCACGGGCGG
TCCGTTCCAC CGGAGATCTA CGGCACCCGG CCGATCAGGG CGCTGGAGAA CTCCGCCGCG
GATTACTCCT GCCTGCTCAA CGATATTTTC TCCTACCAGA AGGAAATCCA GTTCGAGGGC
GAGATCCACA ACTGTGTTCT CGTCTTCCAG AACTTTCTCG GCTGCGGCGC CGAACGCGCG
ATCGGCGTGG TCAACGATCT GATGACCGCG CGGCTGCGCG AGTTCGAGCA CGTCGTCGAC
GTCGAGCTGC CCGCCCTCTT CGACACCTAC GAGCTGACGG AGGAGGCGCG GGACGTCCTG
CGGGGGTACG TGGGCGAGCT GAAGAGCTGG CTGGCCGGGG TTCTCCGCTG GCACCAGGGG
ACGCGGCGTT ACGACGAGGC GGAGCTGCGC CACCATCCGG CGGTCGGGGT GCGGCCCTTC
GGGGGGCCGG TCGGTCTCGG CACGTCGGCG GCCGACATCC GGCGGGCGCT ATCGGGAAAA
TCTGGGCAAC CGACCGCCCT GACCGGCTCC TGA
 
Protein sequence
MQPFTLPEFY LPYPPRLNPN LEHARVHSRA WAGEMEMIDV PQDGVAIWSG QDFDSHDYAL 
LCAYTHPDAD EARLDLITDW YVWVFYFDDH FLEVYKRGRD VAGARRYLDR LRLFMPVEGA
VTAEPANPVE RGLADLWSRT VPDRTPAWRR RFATSTRHLL DESLWELANI DENRLANPVE
YIEMRRKVGG APWSANLVEH AADAEVPDAI AATRPAQVLR DTFSDAIHLR NDLFSYQREV
QEEGELSNGV LVLERFLDCP TQQAADAVND LLTSRLHQFE HTALTELPPV LDEHGVTPTA
RRDVLAYVKG LQDWQAGGHE WHMRSSRYMN AESGATGPVP GSLPGDATGL GTSAVRIAAS
LLATAPARMR AFTHVPHQVV GPVKLPAFYM PFTTGESRHL AAARHNIVEW SAAVGFLDPV
PGIWDEHKLR AADFALCSAA IHPNATAAEL DLTTGWLTWG TYADDLYPVL YGRTRDLAGA
RACTERLKEL MPVEPGPLPV PVGGLERGLA DLWPRTTRDM TPDSRRTFRR TVCIMLDSWQ
WELANQAQNR IPDPVDYIEM RRRTFGSDLT MSLSRLGHGR SVPPEIYGTR PIRALENSAA
DYSCLLNDIF SYQKEIQFEG EIHNCVLVFQ NFLGCGAERA IGVVNDLMTA RLREFEHVVD
VELPALFDTY ELTEEARDVL RGYVGELKSW LAGVLRWHQG TRRYDEAELR HHPAVGVRPF
GGPVGLGTSA ADIRRALSGK SGQPTALTGS