Gene Franean1_4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4290 
Symbol 
ID5672645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5128400 
End bp5129671 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content65% 
IMG OID641243163 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001508580 
Protein GI158316072 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.502996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCACT TCAGCAAGCC CCCGGCGGGG AGCTGGACCG AGAACTATCC CGAGCTCGGC 
ACCGCGCCCG TCGACTACAA CGACTCGATC GACCCCGAGT TCTACGAGCA GGAGCGCGAG
GCGATCTTCA AGCGCTCCTG GCTGAACGTG GGCCGGGTCG AGCGCCTCCC CCGCACCGGC
AGCTACTTCA CCAAGGAGCT GCCCGCCGCC GGGACGTCCC TGATCATCGT CAAGGGCGGC
GACGGCAAGG TGCGCGCGTT CCACAACGTG TGCCGGCACC GCGGCAACAA GCTGGTGTGG
AACGACTTCC CGGGTGAGGA GACCGCCGGC AGCTGCCGGC AGTTCATCTG CAAGTACCAC
GCCTGGCGTT ACGACCTCAC CGGCGAGCTG ACCTTCGTCC AGCAGGAGGG TGAGTTCTTC
GACCTGGACA AGAAGCAGTT CGGCCTCAAG GAGGTCGCCT GTGAGGTCTG GGAAGGCTTC
ATCTTCATCA ACCTGAACCC GCAGGAGACC CTCACCGAGT ACCTCGGTGA CATGGCCAAG
GGCCTCGAGG GCTACCCGTA CTCGGAGCTG ACCGAGGTCT ACTCCTACCG GGCCGAGGTC
GGCGCCAACT GGAAGCTGTT CATCGACGCT TTCGCGGAGT TCTACCACGC GCCCGTCCTG
CACCAGAAGC AGGCCGTCAA GGGCGAGTCC GAGAAGCTCA TCGGCTACGG GTTCGAGGCG
CTGCACTACC AGTTGTTCAG CCCGCACTCG ATGGTGTCCT CCTGGGGCGG CATGGCCCCG
CCGAAGGACC CGTCGATGGT CAAGCCGATC GAGCGCGTGC TGCGCAGCGG CCTTTTCGGC
CCCTGGGACA CCCCCGAGGT CGACGGCCTC GAGGTCGAGA AGCTCCCGAC GGGGATCAAC
CCGGTGAAGC ACAAGTCCTG GGGCACCGAC TCGTTCGAGA TCTTCCCCAA CTTCACGCTG
CTGTTCTGGA AGCCGGGCTG GTACCTGACG TACCACTACT GGCCGACCGC GGTGAACAAG
CACACGTTCG AGGCGAGCCT CTACTTCGCC CCGCCGAAGA ACGCCCGCGA GCGACTGGCC
CAGGAGCTGG CGGCGGTGAC GTTCAAGGAG TACGCGCTCC AGGACGCCAA CACCCTCGAG
GCCACCCAGA CGATGATCGG TACCCGTACC GTCACCGAGT TCCCCCTGTG CGACCAGGAA
ATCCTGCTCC GGCACCTGCA CAAGGTCGTC GGCGACCGAG TCAAGGAGTT CAGCGATGCC
GCTGCCGTCT GA
 
Protein sequence
MPHFSKPPAG SWTENYPELG TAPVDYNDSI DPEFYEQERE AIFKRSWLNV GRVERLPRTG 
SYFTKELPAA GTSLIIVKGG DGKVRAFHNV CRHRGNKLVW NDFPGEETAG SCRQFICKYH
AWRYDLTGEL TFVQQEGEFF DLDKKQFGLK EVACEVWEGF IFINLNPQET LTEYLGDMAK
GLEGYPYSEL TEVYSYRAEV GANWKLFIDA FAEFYHAPVL HQKQAVKGES EKLIGYGFEA
LHYQLFSPHS MVSSWGGMAP PKDPSMVKPI ERVLRSGLFG PWDTPEVDGL EVEKLPTGIN
PVKHKSWGTD SFEIFPNFTL LFWKPGWYLT YHYWPTAVNK HTFEASLYFA PPKNARERLA
QELAAVTFKE YALQDANTLE ATQTMIGTRT VTEFPLCDQE ILLRHLHKVV GDRVKEFSDA
AAV