Gene Franean1_4719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4719 
Symbol 
ID5673061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5633802 
End bp5635145 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content66% 
IMG OID641243576 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001508992 
Protein GI158316484 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.642788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.301808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCA CCGACGACCG CCGCCCGGTC GCCACTCGCG ATGTTCCGTT TGCGATGCGT 
GACCGGTTAC ATGTGCCTCG TGAGCGTTAT TACGACCGGG GGTTCTACGA GCTGGAGAAG
GAGTATCTGT GGCCGAGGGT CTGGCAGATG GCGTGCCGCC TGGAGGAGAT TCCGCAGCCT
GGGGATTTCG TCGAGTACGA GATCTGCGAC CAGTCGATTC TGGTGGTGCG GCAGCCTGAC
CGGTCGGTCA AGGCGTTCCA TAATGCGTGC CGTCATCGGG CTACCCAGCT GTGTTCGGGT
TCCGGGCGGC TTCCCGGAGG GCAGATCGTG TGCCCGTTCC ACGGCTGGCG GTGGAACCTC
GACGGCAGTA ACTCGTTCGT GTACGGCGCG GAGGGGTTCG CGCCGGAGAT CCTGCGTCCG
GAGGACATTC GGCTGCAGGA GTGCAGGGTC GACACGTGGG GTGCCTGCGT GTGGATCAAC
ATGGATCCGG ATGCCCGTCC GTTGCGGGAG GCGCTGGCGC CGGTAGCGGG GTTGCTGGAC
GCGGTGTGCG TGGAGAACAT GCGGGTGTGG TGGTGGAAGG AGACGATCCT CAACGCGAAC
TGGAAGGTGG CTCAGGAGGC GTTCCACGAG GGCTATCACG TGATGGGCAC GCATCCGCAG
CTCACGTTCG GTCTGGGTGA CGACTATCCG TTCGGGAATG TCGAGTACAC CGCGTTCGGG
AACGGCCACG GTCGTTTCCA GGGCCGGTTC GACCCGACCG CGGGCGGCGT CTCCCAGGGG
CGTGGTGGGG AGGCGTTCCT GGAGCGGTCG CGGATCCTGT GGGAGGGGCA GGACGCGATG
ACCCTCGAAC GTGACCTGCA CGTCTTCCGG GGGATGCGTA ACCGGGTCGC GCCCGGTGAG
GACTTCCCGA CGGCGGCGAT CAAGGCGTTG TTCGACTACG CCGAGGGTGC CGGTATCCCG
CTGCATCCGA CACCGGAGGG CCTGCGGCTG TGGGGTGGCG GGGTTTTCCT GTTCCCGAAC
TTCCTCATGC TTCCCCAGTT CGGTAACGCG CTGTCGTACC GGGTCCGCCC CTACAACGAC
GACCCCGAGT GGTGCCGTTT CGAGGTGTGG TCGTTGACCA TGTACCCGGA GGGCGAGGAG
CCCGGCCGGG CGAAGCTGAA GGGCCGCTTC GCCTCCGACG ACACCGAGAA CTGGGGTCTG
ATCCCACGCC AGGACTTCAG CAACATCGCA AGCCAGCAAC GCGGCCTGCA TTCGCGCAGC
TACCGCGAGC ACCGTCTCGC CACCGAGATG GAGCAGCTCA TCAGCAACAT GCACTCCGAA
CTCGACCGGT ACCTCGCCGG CTGA
 
Protein sequence
MALTDDRRPV ATRDVPFAMR DRLHVPRERY YDRGFYELEK EYLWPRVWQM ACRLEEIPQP 
GDFVEYEICD QSILVVRQPD RSVKAFHNAC RHRATQLCSG SGRLPGGQIV CPFHGWRWNL
DGSNSFVYGA EGFAPEILRP EDIRLQECRV DTWGACVWIN MDPDARPLRE ALAPVAGLLD
AVCVENMRVW WWKETILNAN WKVAQEAFHE GYHVMGTHPQ LTFGLGDDYP FGNVEYTAFG
NGHGRFQGRF DPTAGGVSQG RGGEAFLERS RILWEGQDAM TLERDLHVFR GMRNRVAPGE
DFPTAAIKAL FDYAEGAGIP LHPTPEGLRL WGGGVFLFPN FLMLPQFGNA LSYRVRPYND
DPEWCRFEVW SLTMYPEGEE PGRAKLKGRF ASDDTENWGL IPRQDFSNIA SQQRGLHSRS
YREHRLATEM EQLISNMHSE LDRYLAG