Gene Franean1_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3306 
Symbol 
ID5671678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3917945 
End bp3919216 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content65% 
IMG OID641242195 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001507615 
Protein GI158315107 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.478644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.259475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGAT GGCCGAAGCC GGCTGAGGGA AGCTGGACGG AACACTACCC GGAGCTGGGA 
ACCGGGCTGG TGTCGTACGA GGACTCCATC TCGCCCGATT TCTACGCCCT GGAACGGGAT
GCGATCTTCA GGCGTGCCTG GCTGAACGTG GGCCGGGTCG AGCAGTTACC GCGCAGCGGG
AGCTACTTCA CCAAGGAGAT CGAGGCCGCC AGGGCCTCGA TCGTCGTCGT GCGCGACAAC
GACGGTCAGG TCCGTGCCTT CCACAACATC TGTCGCCATC GCGGCAACAA GCTGGTGTGG
AACGACTTCC CGCGGGAGGA GACCGCCGGT AGCTGTCGCC AGTTCACCTG CAAGTACCAC
GGCTGGCGGT ACGGCCTGGA CGGCGCCGCC ACCTTCGTGC AGCAGGAGGG CGAGTTCTTC
AACCTGGACA AGAAGGACTT CGGCCTCGTC CCGGTGCACT GCGACGTCTG GTCCGGATTC
ATTTTCGTCA ATCTGGCGAA GGAGCCCGAA CAGTCGCTGC CCGAGTTCCT CGGCCCGATG
GTCACCGCGC TCGGCGGATA TCCGTTCGAC ACGATGACCG AGCGCTTTTA CTACCGTGCT
GAGGTCGGCG CGAACTGGAA GCTGTTCATG GACGCCTTCC AGGAGTTTTA CCATGCGCCC
ATTCTGCACG CCCGGCAGAC GCCGTCGAAG TTCTCGACGG CGGCGCAGCA GGCGGGGTTC
GAGGCGCCGC ACTACCGCAT CGACGGCCCG CACCGGCTGG TCAGCACGGC CGGTATCAAG
GCGTGGGAAC TGGACCCGGA GATGCGCAAG CCGATGGAGG ACATCACCCG CAGCGGCCTG
TTCGGGCCCT GGGACGAACC CGATCTCGGC GTCGAGAAGA TGCCGGCGGG TCTCAACCCG
GCCGGCTGCG AGCCGTGGGG TCTGGACTCG TTCAATCTCT GGCCGAACTT CGTGATCCTG
ATCTGGGCGG GTGGCTGGTA TCTCACCTAC CATTACTGGC CGACGTCGCA CAACACGCAC
ACCTTCGAGG GGAACCTGTA TTTCGTTCCC TCCCGGAACG CGCGGGACCG GGTTGCCCGC
GAGATGGCCG CGGTGACCTT CAAGGAGTAC GCGTTGCAGG ACGCCAACAC GCTGGAGGCG
ACGCAGTCGA TGCTCGAGTC CCGGGCTGTC GCCGAGTTCC CACTGAACGA CCAGGAGGTT
CTCTGCCGGC ACCTGCACAA GGTCGCGGCC GACTGGGTCG GGGAATACCA GCACAACGGC
GCGGGGGCGT GA
 
Protein sequence
MARWPKPAEG SWTEHYPELG TGLVSYEDSI SPDFYALERD AIFRRAWLNV GRVEQLPRSG 
SYFTKEIEAA RASIVVVRDN DGQVRAFHNI CRHRGNKLVW NDFPREETAG SCRQFTCKYH
GWRYGLDGAA TFVQQEGEFF NLDKKDFGLV PVHCDVWSGF IFVNLAKEPE QSLPEFLGPM
VTALGGYPFD TMTERFYYRA EVGANWKLFM DAFQEFYHAP ILHARQTPSK FSTAAQQAGF
EAPHYRIDGP HRLVSTAGIK AWELDPEMRK PMEDITRSGL FGPWDEPDLG VEKMPAGLNP
AGCEPWGLDS FNLWPNFVIL IWAGGWYLTY HYWPTSHNTH TFEGNLYFVP SRNARDRVAR
EMAAVTFKEY ALQDANTLEA TQSMLESRAV AEFPLNDQEV LCRHLHKVAA DWVGEYQHNG
AGA