Gene Franean1_5685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5685 
Symbol 
ID5674011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6902441 
End bp6903502 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content75% 
IMG OID641244538 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001509941 
Protein GI158317433 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0319041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGTC GCCTGACCAA CACCGACCCG GCGCTGCGGC ACGGCTGGCA CCCGGTCGCC 
CGGTCGCCCG AGCTCGCCGA CGAGCCGATC GCCGTCCGGC TGCTCGGCGA GCCGTGGGTG
CTGGCCCGGC TCGACGACCA GGTGGCGGCC TTCGCCGACT GGTGCCCGCA CCGGCTCGCC
CCGCTGTCGG CGGGACGGGT CGAGGGCCAC GAGCTCGTCT GCGGCTACCA CGGATGGCGG
TTCGTCGCGT CCGGGGAGTG CACCGCGGTA CCGGCGCTCG GCCCGGGAAT ACCGGCGCCG
CGGCGGGCCC GCGCCATCCC GCCGTGGGGG GTGACCGAGC GGCACGGGCT GGTGTGGATC
GCGCCCGCGG AGCCGTTCGC CGACATCATC GAGCTGCCGG AGGCGGCCGA GGACGGGTTC
GACGACGCCT GGCTGCCCGC CGCGCGCACG ACGGCCTGCG CGGGCCTGCT CGCCGACAAC
TTCCTCGACA CCGCGCACTT CCCGTTCGTG CACGCCGCCA CCATCGGCGC CGGCGAGGAG
ACCGTCGTCG CGCCGTACCG GGTGGACGCC GACGGCGACG GCTTCCTGGT CCGGATGGAT
CAGGAGGTCG CCAACCCGGA GGATCCGGGG GTCGCGGCCG GGCTCCGTCC GCTCATCCAA
CGCCGCACGT CGACGTACGT GTACCGCCCG CCGTTCATGC TGCGGCTGCG GCTGGAGTAC
CCCGACGCCG GGATCACCAA CACGATCCTG TTCTGCCTGC AGCCCGAGGA GGCCGCCGCG
ACCCGCGTCT ACACCCGCAT CCTGCGCGAC GACCTGGGCG GTGATCCCGC CCGGCTGGCC
GAGGCCGTCC GCTTCGAGCA GGCGGTGCTC GACGAGGACC TCGCCCTGCA GGAGCGCTTC
ACCATCGACG GACTCCCGCT GATCTCCGGG GACGGTGGGA CGGCCGCGGA GGTCAGCATC
CGCGCGGACG CGGCGGGCGT GGCGCTGCGC CGGGTGCTGG CCGCTGTCGT CGCCAGGGCA
GCCCGCAGCT CTCCCACCAG GGCAGCGGAT ATCCGACACT GA
 
Protein sequence
MTGRLTNTDP ALRHGWHPVA RSPELADEPI AVRLLGEPWV LARLDDQVAA FADWCPHRLA 
PLSAGRVEGH ELVCGYHGWR FVASGECTAV PALGPGIPAP RRARAIPPWG VTERHGLVWI
APAEPFADII ELPEAAEDGF DDAWLPAART TACAGLLADN FLDTAHFPFV HAATIGAGEE
TVVAPYRVDA DGDGFLVRMD QEVANPEDPG VAAGLRPLIQ RRTSTYVYRP PFMLRLRLEY
PDAGITNTIL FCLQPEEAAA TRVYTRILRD DLGGDPARLA EAVRFEQAVL DEDLALQERF
TIDGLPLISG DGGTAAEVSI RADAAGVALR RVLAAVVARA ARSSPTRAAD IRH