Gene Franean1_4273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4273 
Symbol 
ID5672628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5106996 
End bp5108066 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID641243146 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001508563 
Protein GI158316055 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.56971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.449147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGA CGGAAGGCGG CGTGGCGGCG ACGGCCTATA CCAGCCGGGA ACGTTTCGAA 
CTGGAACGTG AAGTCGTCCG GCGTTCACCG CAGCTCGTGG GCTACAGCTC CGAACTGCCC
GACGCCGGAA CCTACTGCAC AAAGACCGTG ATGGACGTCC CCGTGCTGTT GACTCGCGGC
GATGACGGAA CGGTGCGCGC TTTCCAGAAC GTGTGCGCCC ATCGGCAGGC TCAGGTCGCC
CAGGGCTGCG GCGTGGCGCA GCGGTTCACC TGCCCCTGGC ATGCGTGGAC GTACGACGCG
CGCGGCGAGT TCGTCGGCGG TCCGGGGCGG GAGGGCTTCC CCGCGACACT CAACGGTCAG
GCGCGGCTCA ACGAGCTGCC CGCCGCCGAG AACGCGGGGT TCCTGTGGGT CGGGCTGGAC
CCGGCCGCCG GGCCGCTCGA CATCGACGCA CACCTGGGCG AGCTGGGCCC GGAGCTGGCG
TCCTGGAACA TCGGGTCGTG GGCGCCGGTG GGCGAGAAGG TCATCGACTC ACCGGTCAAC
TGGAAGCTCG CGCTCGACAC GTTCGCCGAG AGCTACCACT TCGCGTCGGT GCACCGGGAC
ACGTTCGCGC TGATCAACAA GAGCAACTGC GCGCTGTTCG ACTCCTACGG GCCGCACCAC
CGCCTGGTCT TCCCGATGAA CCACATCACC GACCTGGCGG ACAAGCCGGA GGAGGAGTGG
GAGCCGCTGA ACAACTTCGT GTTGATCTAC GCCCTGTTCC CCAACATCGT CCTGTCCGTG
ACGGTCGCGA ACGGCGAGGT GTTCCGGGTG TACCCGGGCG AGCGCCCGGG CCATTCGGTC
ACCTACCACC AGAACGCGTC CCCGATGGAC CTCACCGACG AGGCGACCCG GGAGACCGCG
GAGACGATCT TCGACTACGC GCACAACGCC GTCCGCGACG AGGACTACGC GCTGGCGGCC
CAGGTGCAGG CGAGCATGGC CTCGGGCGCG CGCGCGGACC TCGTCTTCGG GCGCAACGAG
CCCGGCCTGC ACCACCGGCA CGAGGTTCTC GAGGACGCGC TCGGCCGCTA G
 
Protein sequence
MTPTEGGVAA TAYTSRERFE LEREVVRRSP QLVGYSSELP DAGTYCTKTV MDVPVLLTRG 
DDGTVRAFQN VCAHRQAQVA QGCGVAQRFT CPWHAWTYDA RGEFVGGPGR EGFPATLNGQ
ARLNELPAAE NAGFLWVGLD PAAGPLDIDA HLGELGPELA SWNIGSWAPV GEKVIDSPVN
WKLALDTFAE SYHFASVHRD TFALINKSNC ALFDSYGPHH RLVFPMNHIT DLADKPEEEW
EPLNNFVLIY ALFPNIVLSV TVANGEVFRV YPGERPGHSV TYHQNASPMD LTDEATRETA
ETIFDYAHNA VRDEDYALAA QVQASMASGA RADLVFGRNE PGLHHRHEVL EDALGR