Gene Franean1_6914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6914 
Symbol 
ID5675227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8423025 
End bp8424320 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content66% 
IMG OID641245763 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001511154 
Protein GI158318646 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCC AGGAAAAACT TGCGACCGGT AGAGGCAAGT ACACACCGGG GTACCCGAAC 
CTCGACACCG GGCCGGTCGA CTACGAGGAC TCGATCTCCG AAGAGTTCTT CCAGGCCGAG
CGCGAGGCGA TCTTCAAGCG GACCTGGCTG AAGGTCGGCC GGATGGAGCA GCTGCCCCGC
AACGGCACGT TCTTCACCCG CGAGTTCGTC GGCCTGGGGT CGATCGTGAT CACCCGGCAC
ACCGACGGCG AGGTGTACGC GCTGCACAAC ATCTGCGCGC ACCGCGGCAA CAAGGTCGTC
TGGCAGGAGC ACCCGACCAA CGAGACCCAG GGCAGCGCCC GGCAGTTCGC CTGCAAGTAC
CACGGCTGGC GCTACGGCCT CGACGGCAAG TGCACCTACG TCACCAAGCG GAACGAGTTC
TTCGAGTCGC TGCCCGACGA CGAGCTCGCC ATGCCGCAGC TGCGCTGCGA GGTCTTCGCC
GGGTTCATCT TCGTGAACTT CAGCCAGGAC GCTCCGCCGC TGCGCCAGTT CCTCGGTGAG
AAGCTGGCCA CCGAGCTGGA GAGCTGGCCG TTCGAGAAGT TCACCAACCA CTGGTCCTTC
CGGACGAAGG TCAAGGGCAA CTGGAAGATC GGCATCGACG CGCTGCTGGA GTGGTACCAC
CCGGCGTACG TCCACGGGCG GTTCCTCAAC ACCAACGTGG CCGAGGCGGA GAAGCTCGTC
CCGCCGATGG ACTCCTACCA TTACGACCTG TTCACCCCGC ACATGCTGAC CTCGGTGCCC
GGCCCGCCGC TGCTGAAGAA GAAGCAGGGC TCGGTCGGCC CGGCCAAGCG GGACATGAAC
TGGGCCTACC GGCTGTTCCG CGCCGGCCTG TTCGGCCCGG ACGACGTCCG CGAGGACCTC
GGCCACCTCA CCCCGGACCG CAACCCCGGC AACGTCCAGT CCTGGAGCAA CGACCAGTAC
TGGCTGTTCC CGAACCTGTC GGTCCAGCTC TGGGGCCGCG GGTACTACAT CACCTACCAG
TACATCCCGG AGACGGTGGG CACCCACGCC TACGAGGTCG ACATCTACTT CCCGGAACCG
AAGACCGCCT CCGAGCGCCT CGCCCAGGAG CTCGTCGTCG ACAGCACCAT CGAGTTCGCG
ATGCAGGACA CGAACACGGT GGAGGCGACC TGGTCGCAGC TCAACAACCG CGCGCTGCAG
ACGTTCCACC TGTCCGACAT GGAGCTGATG ATCCGTCAGT TCCACAAGGT TGTCCGGGAC
GCCGTCGCGG CGCACCAGGC CGGCAGCGAG AAGTAG
 
Protein sequence
MAIQEKLATG RGKYTPGYPN LDTGPVDYED SISEEFFQAE REAIFKRTWL KVGRMEQLPR 
NGTFFTREFV GLGSIVITRH TDGEVYALHN ICAHRGNKVV WQEHPTNETQ GSARQFACKY
HGWRYGLDGK CTYVTKRNEF FESLPDDELA MPQLRCEVFA GFIFVNFSQD APPLRQFLGE
KLATELESWP FEKFTNHWSF RTKVKGNWKI GIDALLEWYH PAYVHGRFLN TNVAEAEKLV
PPMDSYHYDL FTPHMLTSVP GPPLLKKKQG SVGPAKRDMN WAYRLFRAGL FGPDDVREDL
GHLTPDRNPG NVQSWSNDQY WLFPNLSVQL WGRGYYITYQ YIPETVGTHA YEVDIYFPEP
KTASERLAQE LVVDSTIEFA MQDTNTVEAT WSQLNNRALQ TFHLSDMELM IRQFHKVVRD
AVAAHQAGSE K