Gene Franean1_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2969 
Symbol 
ID5671353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3495909 
End bp3497198 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID641241873 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001507293 
Protein GI158314785 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.665588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC TCGCCAAGCC GTCGGAGGCC AGCTGGACCG AGCGCTACCC GGAGCTGGGC 
ACCGGCCTCG TGTCGTACGA GGACTCCATC TCCCCGGAGT TCTACGAGCT CGAGCGCGAG
GCGATCTTCC GGCGTGCCTG GCTCAATGTC GGCCGGGTCG AACAGTTGCC CCGCAAAGGG
AGCTTCTTCA CGAAGGAGCT TGCCGTTGCC AAGACGTCGC TCATCGTCAC GCGCGATCTC
GATGGACAGG TTCGCGCCTT CCACAACGTG TGCCGCCACC GCGGCAACAA GCTGGTGTGG
GACAAGACGC CGAGGGACGA GACCAGTGGT GTCTGTCGCC AGTTCATGTG CAAGTACCAC
GGCTGGCGGT ACGGCCTGGA CGGTCAGCTG AAGTTCGTCC TTCAGGAGGA GGAGTTCTTC
GACTTCGACA AGGCGGACTT CCCGCTTGTG TCAGCTCACT GCGAGGTCTG GGAAGGCTTC
ATCTTCATCA ACCTGTCGGA CGAACCCAGC CAGTCGCTCC TTGAATTCCT TGGGCCGATG
GTGCGCGGGC TCGAGGGCTA CCCGTTCCAT CACGTCACCG AGCGGTATGC CTTCAAGGCG
GACATCCTCA GTAACTGGAA GATCTACCTT GACGCATTCC AGGAGTACTA TCATGCGTCG
ATCTTGCACT CGCAGCAGCA GGTGCCGAGC CTGCGCAGCT TTGAGTCCGG TTTCAAGGCG
CCGCACTACC AGGTCGATGG ACCGCATCGG CTGGTGAGCA CCGGCGGCTG GAAGGGCGTG
CCGCGGCACA TGCTGCCGCT CGACCAGATG TACCCGATCG AGCACAACAT CGAGGCCGGC
ATGATGGGCC CCTGGCAGCG CCCGGACATC CCCGAGCTCG ACCCGGCCAA CCTGCCGGCG
GGGCTCAACC CTGGCGGGCT CGACCCCTGG TCGATCTCGA ACTTCCAGAT CTGGCCCAAT
TTCGTGATCC TGGTCTATGA GCGGGGCTGG TACCTCACCT ACCAGTACTG GCCGACTTCC
CACAACACGC ATGTGTGGGA GATGTCGTAC TACTTCCCGC CGTCGCGGAA TGCCAGCGAA
CGGATCCGGC ACGAGGTCAC CGCCGTCGTG TCCAAGGAGG CCGGTCTCCA GGACGCGGGC
ACCCTCGACG GCACCCAGAT GGGCCTGGAA TCCAGGGTTA TCGACAGATA TCCGCTGTCC
GATCAGGAGA TTACCGTACG TCACCTGCAC AAGGTCACCG GAGATTGGGT GCAGTCCTAT
CTTCGGGACG GAAAGGCAGT CCGGGCATGA
 
Protein sequence
MARLAKPSEA SWTERYPELG TGLVSYEDSI SPEFYELERE AIFRRAWLNV GRVEQLPRKG 
SFFTKELAVA KTSLIVTRDL DGQVRAFHNV CRHRGNKLVW DKTPRDETSG VCRQFMCKYH
GWRYGLDGQL KFVLQEEEFF DFDKADFPLV SAHCEVWEGF IFINLSDEPS QSLLEFLGPM
VRGLEGYPFH HVTERYAFKA DILSNWKIYL DAFQEYYHAS ILHSQQQVPS LRSFESGFKA
PHYQVDGPHR LVSTGGWKGV PRHMLPLDQM YPIEHNIEAG MMGPWQRPDI PELDPANLPA
GLNPGGLDPW SISNFQIWPN FVILVYERGW YLTYQYWPTS HNTHVWEMSY YFPPSRNASE
RIRHEVTAVV SKEAGLQDAG TLDGTQMGLE SRVIDRYPLS DQEITVRHLH KVTGDWVQSY
LRDGKAVRA