Gene Franean1_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4158 
Symbol 
ID5672513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4938710 
End bp4941040 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content78% 
IMG OID641243031 
Productmagnesium chelatase 
Protein accessionYP_001508448 
Protein GI158315940 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1239] Mg-chelatase subunit ChlI
[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0127951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.72272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACGT TCCCGTTCTC CGCGCTGGTC GGCCAGGATG ATCTGCGCTT GGCGTTGTTG 
CTCAATGCGG TGTCCCCGAC GGTGGGCGGA GTGCTGGTCC GCGGGGAGAA GGGCACGGCG
AAGTCGACGG CGGTCCGCGC GCTGGCGGGG TTGCTGCCGC CGGTGGCGGT GGTGCCGGGC
TGCCGGTTCT CCTGCGATCC GGCGGCGCCG GTGGTGCACT GCCCGGATGG CCCGCACCCG
ACCAGCCTCG CCGACGGTCA CGGTGCCGGC GGGGTGGGGG TGTCGCGGCC GGCGCGGCTG
GTGGAGCTGC CGGTCGGCGC GTCGGAGGAC CGGGTGACCG GCGCGTTGGA CCTGGACCGG
GCACTGGCCG AAGGGGTGAC CGCGCTGCGC CCGGGGCTGC TCGCGGCCGC GCACCGCGGG
GTGCTCTACG TCGACGAGGT GAACCTGCTC GCCGACCATC TGGTGGATCT GCTGTTGGAC
GCGGCCGCGC TCGGGGTGGC GCAGGTGGAA CGCGACGGGG TGTCGGTCAG CCATCCGGCG
CGGTTCTGGC TGGTGGGGAC GATGAACCCG GAGGAGGGTG AGCTGCGCCC GCAGCTGCTG
GACCGGTTCG GGTTGACGGT GCAGGTGGCG GCCAGCCGTG ACCCGCGGAT ACGCGCGCAG
GTGGTACGCC GCCGGTTGAC GTTCGAAGCC GACCCGGTCG GGTTCGCCGC GCTCTGGGCC
GGCGCGGAGG CGGAGCTGGC GGGGCGGATC GCGGCCGCGC AGGCCCGGTT GGGCCGGGTG
GAGCTCTCCG ACGCCGCGTT GGACGCGGTC GCCGCGGTGT GCGCGGCGTG TGACGTCGAC
GGGATGCGCG CCGACGTGGT GCTGGCGAAG ACGGCGATGG CGCGCGCGGC GTGGGAGGGT
CGGGCGGAGG TCAGCCCGGA TGACATTCGG GTCTCGGCCC GGTTCGTGCT GCCGCATCGG
CGCCGCCGTG GCCCGTTCGA CGCCCCGGGT GGGGACGGCC GCGCGCTCGA CGACACGGTC
GAGGAGGTGC TGGCCGACCG GGTACGCCCC GGCGACGGCC CGGACGGGGA CGGGGACGGG
GACGGGGACG GGTCGGATGA TTCTCCTGGT GGCGGGTCGC CGGGTGGTGG CCAGCGGTGG
CGTGACGACA CCGGCGGCGC CGGCGGGTGG GACGGGTCTG GCGGGGATGG GTCGGGGGTG
GATGTTCCGG CTGGTGGTAC CGGTGACCTC ACCCCCGCTT CGGCGAGGGC CGGTGGCGAG
TCCGCTGATC TGAGCGGGCC GGCGGCGGGT GGGCCTGAGG GTGTCGGGCC GGCGGGTCCT
CCCCCGCCCG GTGCCGCCCT GTCGTCCGCG GCGCCGGCGG GCGGCGGTGG GCCGGCAGCG
GAGGCGGGCG GCCCGGCGGG CATGGCCCGA GCAGCCCAGA CAGCGCAGAC GCATGTGGAC
GGCGGCGCGG CGCGCGGCGG TGGGCCGGCG CCGGCGGCTG GGGTTCCCGG TATGCCGTTC
CGGCCGCGGC TGCTGTCCGT CGCCGGGCTG GGCGCGGGGG CGGCCGGGCG TCGTTCGGCG
GCACGCACCT CCCGTGGGGC GGTCGCCCGG ACCGGCGCGG ACGCACCTGG GTTGCACCTG
CCGGCGACGC TGCTGGCCGC TGCCCCCCAC CAGCACGGCC GGGGCCGCAC CGGCCCGGCG
CTGGTGCTCG ACCCCGCCGA CCGGCGCGGC GCGCAGCGGC GCGGCCGGGA GGGGAACCTG
GTGCTGTTCG TCGTCGACGC CAGCGGGTCG ATGGCCGCCC GCACCCGGCT GCGCCGGGTC
AGCACCGCGG TGCTGTCTCT GCTTGTCGAC GCCTACCAGC GCCGCGACCG CATCGGCATG
ATCACGTTCC GGGGGGTGGG TGCGCAGGTG GTGCTTGCGC CGACGTCCAG TGTGGAGGTG
GGGGCGGCGC GGCTGGTCGG GTTGCGGACC GGGGGGCGGA CCCCGATCGC GGCCGGGTTG
GAATGCGCCG GGGTGGTGCT GCGCGCCGAG GCGCGCCGCG ACCCTGACCG CCGTCCCCTG
CTGGTGCTGG TCACCGACGG GCGGGCCACC GCCGGGGGTG ATCCGGGTGC GGCGGCGCGG
GGGTTGCTGC GCGCGGCTGG TGGGGTCGCC GGGCAGGGCG GCGGCGCACG CGGTGGGGCC
GGGGCGGCTG GCCGGGTGCG CCGGGGTGGG TTGGCGAGTG TGGTCGTGGA CTGTGAGACC
GGCCCGGTGC GCCTCGGGCT GGCCGGGCGG CTCGCCGCCG TGCTCGGCGC CGACCTGGTC
GGCCTCGACG CCCTCCCGCA GGCCGGCGGC GTCCCCGGGC GGGTCGCCTG A
 
Protein sequence
MVTFPFSALV GQDDLRLALL LNAVSPTVGG VLVRGEKGTA KSTAVRALAG LLPPVAVVPG 
CRFSCDPAAP VVHCPDGPHP TSLADGHGAG GVGVSRPARL VELPVGASED RVTGALDLDR
ALAEGVTALR PGLLAAAHRG VLYVDEVNLL ADHLVDLLLD AAALGVAQVE RDGVSVSHPA
RFWLVGTMNP EEGELRPQLL DRFGLTVQVA ASRDPRIRAQ VVRRRLTFEA DPVGFAALWA
GAEAELAGRI AAAQARLGRV ELSDAALDAV AAVCAACDVD GMRADVVLAK TAMARAAWEG
RAEVSPDDIR VSARFVLPHR RRRGPFDAPG GDGRALDDTV EEVLADRVRP GDGPDGDGDG
DGDGSDDSPG GGSPGGGQRW RDDTGGAGGW DGSGGDGSGV DVPAGGTGDL TPASARAGGE
SADLSGPAAG GPEGVGPAGP PPPGAALSSA APAGGGGPAA EAGGPAGMAR AAQTAQTHVD
GGAARGGGPA PAAGVPGMPF RPRLLSVAGL GAGAAGRRSA ARTSRGAVAR TGADAPGLHL
PATLLAAAPH QHGRGRTGPA LVLDPADRRG AQRRGREGNL VLFVVDASGS MAARTRLRRV
STAVLSLLVD AYQRRDRIGM ITFRGVGAQV VLAPTSSVEV GAARLVGLRT GGRTPIAAGL
ECAGVVLRAE ARRDPDRRPL LVLVTDGRAT AGGDPGAAAR GLLRAAGGVA GQGGGARGGA
GAAGRVRRGG LASVVVDCET GPVRLGLAGR LAAVLGADLV GLDALPQAGG VPGRVA