Gene Franean1_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1205 
Symbol 
ID5669618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1440503 
End bp1442290 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content75% 
IMG OID641240137 
ProductRNA modification protein 
Protein accessionYP_001505565 
Protein GI158313057 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00267373 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.652733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCCTC GTCCTGCACG CCGGGTAGCT CTCGTCACGC TCGGGTGTTC CCGTAACGAG 
GTGGATTCCG AGGAGCTTGC CGCCCGCCTC GCCGCCGACG GCTGGGACCT GGTCGACGAC
GCCGCCGACG CGGACGCCGT CCTGGTCAAC ACCTGCGGTT TCGTCGAGGC GGCGAAGAAG
GACTCGATCG ACGCCCTGCT CGCCGCCGAC GCACTGCGCG GGCCCGGCGG CAACGGGCCC
GGTGACGGTG AGTCCGGCGA TGACGGGCCG GGCGGCGGCG CGACCACCGG GCGGGCGACC
GCGTCGGGCG GGCCGCGCGC GGTCGTGGCC GTGGGCTGCA TGGCCGAGCG GTACGGCCGG
GAGCTCGCCG ACACGCTGCC CGAGGCGGAC GCCGTCCTCG GCTTCGACGC ATATCCGCGG
ATCTCGGCGC ACCTCGACGC GGCGCTCGCC GGCTCGGCGC CGGCCTCGCA CACCCCGCGT
GACCGGCGCA CGCTGCTGCC GATCTCTCCG GTGGAGCGGG GCGACGCCTC CCGCGCGGCG
CACTCGCCGC ACATCCCTGG TCACATCCAG CTCCCCGGTG GTGGACGTGT CCTTCCGTCG
CCGGACGGCC GACCGGCGGC GAACAGCCGC CCGGCAACGG ATCTGGCGGA CCTGGCCGGC
CTGGCGGAGC CGGCGGGGGA GGGCGCGGCC GGTACCGGCC GGCGGCGGCT CACCTCCAGC
CCGGTCGTGC CGCTGAAGCT GTCGAGCGGT TGCGACCGCC GTTGCGCCTT TTGCGCCATC
CCGTCCTTCC GTGGCTCGCA CGTGTCCCGC CGGCCGGAGG AGGTACTGGC CGAGGCCGAG
TGGCTGGCTG GGCAGGGCGC CCGCGAGCTT GTCCTGGTCA GCGAGAACTC GACGTCCTAC
GGCAAGGATC TCGGCGATCT GCGGGCGCTC GAGAAGCTGC TGCCCCTGCT GGCCGCCGTT
CCGGGGATTG TCCGTGTCCG CACTGTGTAT CTCCAGCCGG CCGAGCTGCG GCCGTCCCTG
CTCGAGGTGC TGCTGACGAC GCCGGGCCTG GCGCCCTACC TTGACCTGTC GTTCCAGCAC
GCCAGCCCGG CGGTGCTGCG GCGGATGCGC CGGTTCGGCG GCTCGACCGA CTTCCTGGAC
CTGCTGCGGC GGGCCCGCGC GCTGCTCCCC GACCTGGGCG CCCGCTCGAA CGTGATCGTC
GGCTTCCCCG GTGAGACCGA CGAGGACGTC GACATCCTGG TGAATTTTCT CGAGCGCGCC
GACCTCGACG CTGTCGGGGT GTTCGGCTAC TCCGACGAGG AGGGGACGGA GGCCGCCGGG
ATGGCGGGCC ACGTCGACCC CGAGGAGATC GAGAGCCGGC GGGCCGAGGT CACCGACCTC
GTCGAGCAGC TCACCGCAGC CCGCGCCGAG CGCCGCATCG GCACGACCGT CGAGGTGCTC
GTCGAGGAGG TGGCCGGCGG TCTCGGGTAC GGCTGCGCCG GGCACCAGCA GGCCGACGCC
GACGGCTCCT GCACGGTCCG CCTGCCCGCG GGCGGGCCAC CGGGCGGGGT GTCCGTCGGG
GACCTCGTCG AGGCCCGGGT CGTGGCGGCC GAGGGCGTCG ACCTGATCGC GGAGTTCACC
GGCGTGCTCG ACCGTGCCGG CGCCGGGCTG GCCAGCGCTG GGTCAGCCGG TGCCGGGTCG
GTCGGTGCCG GGTCGGTCGG TTCGGCCGGG GCGGGTTCCG TGCTGCCGCC GATCCCGGAC
GGGGTCGGCC CGATGGACGC GGTGGGCCAC CCTTCGGGCG TGGCGTGA
 
Protein sequence
MSPRPARRVA LVTLGCSRNE VDSEELAARL AADGWDLVDD AADADAVLVN TCGFVEAAKK 
DSIDALLAAD ALRGPGGNGP GDGESGDDGP GGGATTGRAT ASGGPRAVVA VGCMAERYGR
ELADTLPEAD AVLGFDAYPR ISAHLDAALA GSAPASHTPR DRRTLLPISP VERGDASRAA
HSPHIPGHIQ LPGGGRVLPS PDGRPAANSR PATDLADLAG LAEPAGEGAA GTGRRRLTSS
PVVPLKLSSG CDRRCAFCAI PSFRGSHVSR RPEEVLAEAE WLAGQGAREL VLVSENSTSY
GKDLGDLRAL EKLLPLLAAV PGIVRVRTVY LQPAELRPSL LEVLLTTPGL APYLDLSFQH
ASPAVLRRMR RFGGSTDFLD LLRRARALLP DLGARSNVIV GFPGETDEDV DILVNFLERA
DLDAVGVFGY SDEEGTEAAG MAGHVDPEEI ESRRAEVTDL VEQLTAARAE RRIGTTVEVL
VEEVAGGLGY GCAGHQQADA DGSCTVRLPA GGPPGGVSVG DLVEARVVAA EGVDLIAEFT
GVLDRAGAGL ASAGSAGAGS VGAGSVGSAG AGSVLPPIPD GVGPMDAVGH PSGVA