Gene Franean1_4357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4357 
Symbol 
ID5672712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5201223 
End bp5202491 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID641243230 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001508647 
Protein GI158316139 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACT TCACCAAGCC CGTCGAGGGC AGCTGGACAG AGCACTTCCC CACCCTCGGA 
ACCGACTTCG TCTCGTTCGA GGACTCGATC TCACCGGAGC ACTACGAGTT GGAACGCAAG
GCGATCTTCG AGCGGAGCTG GCTCAACGTC GGCCGCGTGG AGCAGATCCC GAAGCGGGGA
AACTACTTCA CCAAGGAGAT CCAAGCCGCC CGCGCCTCGA TCATCGTCGT CCGCGACAAC
GAGGACCAGA TCCGCGCCTT CCACAACGTC TGCCGCCACC GCGGCAACAA ACTGGTGTGG
AATGACTTCC CGCACGAGGA GGTCGCGGGC ACCGCGCGCC AGTTCCAGTG CAAGTACCAC
GCCTGGCGCT ACGGGCTCGA CGGCACCTGC ACCTTCGTCC AGCAGGAGGC GGAGTTCTTC
GACCTCGACA AGTCGAAGCT CGGTCTCGCG GCGGTGCGCT GCGAGGTTTG GGAGGGCTTC
ATCTTCATCA ACCTCGACAA CGAGGACACC ACCCCGGTGC GCGAGTACCT CGGGCGGTTC
GCGAAGGGCA TGGAGGGCTA CCCGTTCGAC CAGATGACCG AGGTCTACCG GTACCGGGCG
CACGTCAAGA GCAACTGGAA GCTCTACATA GACGCGTTCG CCGAGTTCTA CCACGCACCC
GTCCTGCACG CGAAGCAGTA CGTCGGCACC GAGTCCCGCA AACTCATAGG CTACGGCTAC
GAGGGGCTGC ACTACGACCT CGATGGCCGG CACTCGATGC AGTCCGCGTG GGGCGGCATG
TCGCCACCCA AGGACCTCTC CATGGTGAAG CCGATCGAGC GGATTCTGCG CAGCGGCAAC
TTCGGGCCAT GGGACCGTCC CGACATCACG GGTCTCGACC CGCTACCCGG GGGCGTCAAC
CCGTCGGGCC ACCGGGCATG GGGCAACGAC TCGTACCTGT TCTTCCCGAA CTTCATGATC
CTGATCTGGG CGCCGGGCTG GTATCTCACA TACCACTACT GGCCGACCGC GTACAACGAG
CACATCTTCG AGGGAACGCT GTACTTCGTC CCGCCGAAGA ACGCGGCGGA GCGCCTGCGG
CACGAACTGG CCGCGGTCAC CTTCAAGGAG TTCGCGCTAC AGGACTGCAA CACCCTCGAG
GCGACGCAGA CCATGCTCGA GTCCCGCGCG GTCCGGGATT TCCCGCTCAA CGACCAGGAG
ATCCTCATCC GTCATCTCCA CAAGTCCGCG AACGACGTCG TCGCCGCCTA CCAGGCCGCG
ACGAAATGA
 
Protein sequence
MAHFTKPVEG SWTEHFPTLG TDFVSFEDSI SPEHYELERK AIFERSWLNV GRVEQIPKRG 
NYFTKEIQAA RASIIVVRDN EDQIRAFHNV CRHRGNKLVW NDFPHEEVAG TARQFQCKYH
AWRYGLDGTC TFVQQEAEFF DLDKSKLGLA AVRCEVWEGF IFINLDNEDT TPVREYLGRF
AKGMEGYPFD QMTEVYRYRA HVKSNWKLYI DAFAEFYHAP VLHAKQYVGT ESRKLIGYGY
EGLHYDLDGR HSMQSAWGGM SPPKDLSMVK PIERILRSGN FGPWDRPDIT GLDPLPGGVN
PSGHRAWGND SYLFFPNFMI LIWAPGWYLT YHYWPTAYNE HIFEGTLYFV PPKNAAERLR
HELAAVTFKE FALQDCNTLE ATQTMLESRA VRDFPLNDQE ILIRHLHKSA NDVVAAYQAA
TK