Gene Franean1_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1410 
Symbol 
ID5669816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1706233 
End bp1707504 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content63% 
IMG OID641240333 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001505760 
Protein GI158313252 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.190649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCGTT GGCCCAAGCC GCCCGAAGGC AGCTGGACGG AGCATTACCC GGAGCTCGGA 
ACCGAGCCTG TTTCCTACGA GGACTCGATC TCGCCCGAAT TATATGATCT TGAGCGGGAG
GCAATCTTCA AGCGCGGCTG GCTCAATGTC GGCCGGGTGG AGCTGCTGCC AAAGAACGGC
AGCTACTTCA CCAAGGAGAT CCACATAGCC AAGACCTCGG TCATCGTCCT GCGGGACAGG
AGCGGCCAGG TGCGGGCGTT CCACAACATC TGCAGGCACC GCGGCAACAA GCTGGTGTGG
AACGACTTCC CGAACGAGGA GACCAGCGGC ACCTGCCGTC AGTTCACCTG CAAGTACCAC
GCCTGGCGTT ACGACCTCGA CGGTTCGCTG AACTTCATCC AGCAGGAGGG CGAGTTCTTC
AACCTCGACA AGAACGACTA CGGGCTCGTT CCCGTCCACT GCGACGTGTG GCAGGGCTTC
ATCTTCATCA ACCTCGCGAA GGAGCCCGAG CAGCAGCTGT CGGACTTCCT CGGCCCCATG
ATCACCTCTA TCGAGGGTTA CCCCTTCGAC AAGATGACCG AGCGGTGGTA CTACCGTTCG
GAGATCAAGG CCAATTGGAA GCTCTACATG GATGCCTTCC AGGAATTCTA CCACGCACCG
ATCCTGCACG CGCGACAGTC GCCGGCCAAG TTCGCCAACT CCGCGCAGCA GGCCGGGTTC
GAGGCACCGC ACTACCGCAT CGACGGCCCG CACCGCCTGG TGAGCACCGC CGGGATTCGG
GCGTGGGAAC TCGAGCAGGA TCTGCGCAAG CCGATCGAGG AAATCACCCG CAGCGGTCTG
TTCGGGCCCT GGGACATCCC CGACCTCGGC ATCGAGAAAA TGCCGGCGGG TATCAACCCG
GCCGGGTGCG ACCCGTGGGG CCTCGACTCG TTCCAGCTCT TCCCGAACTT CACGATCCTG
ATCTGGGGCC AGGGCTGGTA CCTCACATAC CACTACTGGC CGACGGCCTA CAACAGCCAC
ATCTTCGAGG GCACCCTCTA CTTCATTCCC GCGAAGACGC CGCGGGAGCG GGTGGCCCAC
GAGCTGACGG CGCTGTCCTT CAAGGAGTTC GGCCTGCAGG ACGCCAACAC GCTCGAGGCG
ACACAGACGA TGATGGAGTC CCGGGTGGTC GACCGGTTTC CGCTGGGGGA CCAGGAGGTG
CTGTGCCGGC ACCTCCACAA GGAGACGGCC GACTGGATCG AGAAGTACCA GCGCGAACGA
GCGGGAGCGT GA
 
Protein sequence
MTRWPKPPEG SWTEHYPELG TEPVSYEDSI SPELYDLERE AIFKRGWLNV GRVELLPKNG 
SYFTKEIHIA KTSVIVLRDR SGQVRAFHNI CRHRGNKLVW NDFPNEETSG TCRQFTCKYH
AWRYDLDGSL NFIQQEGEFF NLDKNDYGLV PVHCDVWQGF IFINLAKEPE QQLSDFLGPM
ITSIEGYPFD KMTERWYYRS EIKANWKLYM DAFQEFYHAP ILHARQSPAK FANSAQQAGF
EAPHYRIDGP HRLVSTAGIR AWELEQDLRK PIEEITRSGL FGPWDIPDLG IEKMPAGINP
AGCDPWGLDS FQLFPNFTIL IWGQGWYLTY HYWPTAYNSH IFEGTLYFIP AKTPRERVAH
ELTALSFKEF GLQDANTLEA TQTMMESRVV DRFPLGDQEV LCRHLHKETA DWIEKYQRER
AGA