Gene Franean1_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3683 
Symbol 
ID5672049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4361013 
End bp4362272 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID641242566 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001507986 
Protein GI158315478 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0804885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACT TCCCGAAGCC CGCGGAGGGC AGCTGGACCG AGCACTACCC GGACCTCGGT 
ACCGGGTTCG TCTCGTTCGA GGACTCGATC TCGCCCGAGC ACTATGAGCT GGAGCGCAAG
GCGATCTTCG GGCGGACCTG GCTCAACGTC GGCCGGGTCG AGCAGATCCC GCGGACCGGC
AACTACTTCA CGAGGGAGAT CCACGCCGCG CGTGCGTCGC TGATCGTCGT CCGCGACGCC
GAAGGCGAGG TCCGCGCGTT CCACAACGTC TGCCGGCATC GCGGCAACAA GCTGGTGTGG
AACGACTTCC CCCAGGAGGA GACGCACGGC ACGGCACGGC AGTTCACCTG CAAGTACCAC
GCCTGGCGTT ACGGGCTCGA CGGCTCCTGC ACGTTCGTCC AGCAGGAGTC GGAGTTCTTC
GACCTCGACA GGTCGCAGCT CGGGCTGGTA CCGGTGCGCT GTGAGCTGTG GGAGGGCTTC
ATCTTCATCA ACCTCGACAG CGAGGGCACG ACATCGCTGG CCGAGTACCT GGGGCGCTTC
GCGAAGGGAC TCGAGGGCTA TCCCTTCGGC GAGATGACCG AGGTCTACAA GTACCGGGCC
GAGGTCGGGA GCAACTGGAA GCTCTACATC GACGCGTTCG CGGAGTTCTA CCACGCGCCC
GTCCTGCACG CGAAGCAGTA CGTGGGCGAC GAGTCACGCA AGCTGCTGGG CTACGGCTAC
GAGTCACTGC ACTACGACAT CGACGGGCCG CACTCGATGC AGTCGGCGTG GGGCGGGATG
TCGCCGCCCA AGGACCTCAA CATGGTGAAG CCGATCGAGC GGGTCCTGCG CAGCGGCAAC
TTCGGGCCGT GGGACCGCCC CGACATCGAG GGCCTGAACC CGCTGCCGTC AGGGGTCAAC
CCGGTCGGCC ACCCGGCGTG GGGCCTGGAC TCCTACGTCT TCTTCCCGAA CTTCATGATC
GTGGTGTGGG CGCCGGGCTG GTACCTCACC TACCACTACT GGCCGACGGC GTACAACCAG
CACATCTTCG AGGGCACGCT GTACTTCGTG CCGGCCCGGA CCGCGCAGGA TCGCATCCGG
CAGGAGCTTG CCGCGGTCAC CTTCAAGGAG TTCGCGCTGC AGGACTGCAA CACGCTCGAG
GCCACGCAGA CCATGCTCGA GTCCCGCGCC GTCTCGCGGT TCCCGCTCAA CGACCAGGAG
ATCGCCATCC GGCACCTCCA CAAGACCGCG GGTGACTACG TCGCCGGGTA CCAGGGGTAG
 
Protein sequence
MAHFPKPAEG SWTEHYPDLG TGFVSFEDSI SPEHYELERK AIFGRTWLNV GRVEQIPRTG 
NYFTREIHAA RASLIVVRDA EGEVRAFHNV CRHRGNKLVW NDFPQEETHG TARQFTCKYH
AWRYGLDGSC TFVQQESEFF DLDRSQLGLV PVRCELWEGF IFINLDSEGT TSLAEYLGRF
AKGLEGYPFG EMTEVYKYRA EVGSNWKLYI DAFAEFYHAP VLHAKQYVGD ESRKLLGYGY
ESLHYDIDGP HSMQSAWGGM SPPKDLNMVK PIERVLRSGN FGPWDRPDIE GLNPLPSGVN
PVGHPAWGLD SYVFFPNFMI VVWAPGWYLT YHYWPTAYNQ HIFEGTLYFV PARTAQDRIR
QELAAVTFKE FALQDCNTLE ATQTMLESRA VSRFPLNDQE IAIRHLHKTA GDYVAGYQG