Gene Franean1_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0458 
Symbol 
ID5668879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp541777 
End bp543945 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content71% 
IMG OID641239389 
Productresolvase domain-containing protein 
Protein accessionYP_001504827 
Protein GI158312319 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGG CGGGGTCTGT GCTGGGCAGC AGGTCGGAGA GCCGGATCCG GCCGGAGCAC 
ACTGATCGCG CGGCGGTGGT CTATGTGCGG CAGTCCAGCA GGCAGCAGGT CGTCGAGCAC
GCCGAGTCGA CGCGGGTGCA GTATGCGCTG GTCGAGCGGG CCGTGACTCT CGGTTGGGCC
CGGTCGCGGG TGACGGTCAT TGATGATGAC CTGGGGGTGT CGGCGGCGGT CGCGGCGTCC
CGGCCGGGGT TCGCGCGGCT GGTCACCGAG GTGACGATGG GCCGGGTCGG TCTGGTGCTG
GGGGTGGAGA TGTCCCGGTT GGCGCGCACC GGCCGGGACT GGCACCAGCT GATCGAGTTG
TGTTCGTTGG CGGGGACGTT GCTCGCCGAT CTCGACGGGG TCTACGACCC CGGTGTCTAC
AACGACCGTC TGCTGCTGGG GTTGAAGGGC ACGATGTCCG AGGCCGAGCT GTGGCTGATC
CGGCAGCGGA TGTGGGGCGG GAAGCTGGCC AAGGCCGAAC GCGGCGAGCT GGCGTTCGCG
CTGCCGATCG GCTACTGGCG CGACCGCGGC GGGCAGGTGG TGTTCGACCC GGACGAGCAG
GCCCGGACCG TGGTCGGGCT GGTCTTCGAC CTGTTCGACC GGCTCGGGAC GCTCAACGGC
GTGCTGCGCT GGCTGGTCGA CCACCAGGTG CAGCTGCCGG TGCGCTCCCA CAGCGGGGTG
GACAAGGGTG AGCTGACCTG GCGGCGACCC AACCGGGAAA CCCTGCAGGT CATGCTGCAC
AACCCCATCT ATGCCGGGTA CCACGCCTAC GGTCGGCGCC GCGTGGATGC GCGGCGCAGG
AAGGCGGGCC GGCCCAGCAC GGGGCGGGTG GTGCGATCAA TGGACGACTG GCATGTGCTG
CTACCCGACC GGATGCCGGC CTACATCGGC ACCGACCGGT ATGCCGCGAA CCTGGCCCGT
TTGGAGGCCA ACCGGCAGAC CGCCGCCTCA CCCGGAGCGC CCAGGCCAGG ATCGGCACTG
CTGGCGGGCC TGGTGCGCTG CGGACGGTGC GGACATCGGA TGACGGTCAG CTACCACACC
CCGGCCAGCC GGTTCCCGTC GCACAACTAC CACTGCGGCT ACCTGCTCGC CACCTACGGC
ACCGGCCGGA CCTGCCAGCA CCTCGCCGGC CCGGCACTGG ACCGCTACGT GACCGCCCAG
CTGCTCGACG CCGTCGCCCC CGCCGCCCTG GAGGTCTCGC TGGCCGCCGC TGCCCACGCC
GAATCAGACC GGGCCGAGCT GGACACCCTG TGGCGTCAGC GGCTGGAACG CGCCCGCTAC
GCCGCCGGCC GCGCCCGGCG CCAGTACCAG CTCGCCGAAC CGGAAAACCG GCTGGTCACC
CGCCAGCTGG AAACCGACTG GGAGACGGCG CTGGCCGACC TCGACCGGCT CGAAACCGAC
TACCAGCGGT TCGTCGAGGC CCGCCCGCAG ACGCTCACCG CCGCCGAACG GGCGGCCATC
ACCGCGCTCG CCCACGACCT GCCCGCGCTC TGGACAGCGC CGACCACCAG CCAGACCGAC
CGCAAACAGC TCCTGCGCAC CCTGATCGAC GAGATCACCG TGACGGTCGT CGGCACCAGC
GAACTCGTCG ACGTCACGAT CACCTGGGCC GGCGGGCATC AGACCCACGG CCGCACCACC
CGCCCGGTCG CCCGCCTCGA CCAGCTCTCC TACTACCACC ACCTGGTCGA GCGGGTCAGC
GAACTGGCCA GCGCCGGCCA CTCCAGCCGC CAGATCGCCG ACCAGCTCAA CACCGAGGGA
CTACGCCCAC CCAAACGCAC CACCCGCTTC GGCCCCGACC AGATCCTCAC CCTCACCCGC
CGACTTGGCA TCGGGGTCCA CCACCCCCGC GACACCCGCA CCGCCCTGGC CAACCCCGGC
CCCGGCCGCT GGTCGGTCGC CGGCCTCGCT GTCGCCCTGA ACATGCCGAC CGCCACTCTC
TACACCTGGA TCTACCGCGG ATGGATCACG GCGGAACGCC ATCCGGACGG CAGATCCTGG
ATCATCCTCG CCGACGACGT CGAGATCAGG CAGCTCCGCG AACGCCGTGA CCGCCCACCC
GGCTACTACA CCCGAGCCCG CTGGACCCGA CCCCACCTGG ACCACAGCAC GAACGGAACC
CGGACATGA
 
Protein sequence
MSMAGSVLGS RSESRIRPEH TDRAAVVYVR QSSRQQVVEH AESTRVQYAL VERAVTLGWA 
RSRVTVIDDD LGVSAAVAAS RPGFARLVTE VTMGRVGLVL GVEMSRLART GRDWHQLIEL
CSLAGTLLAD LDGVYDPGVY NDRLLLGLKG TMSEAELWLI RQRMWGGKLA KAERGELAFA
LPIGYWRDRG GQVVFDPDEQ ARTVVGLVFD LFDRLGTLNG VLRWLVDHQV QLPVRSHSGV
DKGELTWRRP NRETLQVMLH NPIYAGYHAY GRRRVDARRR KAGRPSTGRV VRSMDDWHVL
LPDRMPAYIG TDRYAANLAR LEANRQTAAS PGAPRPGSAL LAGLVRCGRC GHRMTVSYHT
PASRFPSHNY HCGYLLATYG TGRTCQHLAG PALDRYVTAQ LLDAVAPAAL EVSLAAAAHA
ESDRAELDTL WRQRLERARY AAGRARRQYQ LAEPENRLVT RQLETDWETA LADLDRLETD
YQRFVEARPQ TLTAAERAAI TALAHDLPAL WTAPTTSQTD RKQLLRTLID EITVTVVGTS
ELVDVTITWA GGHQTHGRTT RPVARLDQLS YYHHLVERVS ELASAGHSSR QIADQLNTEG
LRPPKRTTRF GPDQILTLTR RLGIGVHHPR DTRTALANPG PGRWSVAGLA VALNMPTATL
YTWIYRGWIT AERHPDGRSW IILADDVEIR QLRERRDRPP GYYTRARWTR PHLDHSTNGT
RT