Gene Franean1_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1645 
Symbol 
ID5670047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1964442 
End bp1966430 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content78% 
IMG OID641240563 
Productmembrane-bound lytic murein transglycosylase B-like protein 
Protein accessionYP_001505989 
Protein GI158313481 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2951] Membrane-bound lytic murein transglycosylase B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0719412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGG CGGACGGCGA CACGCCGAGC GGCCGCCACC GGCGGCCCCG GCGGCAGCCC 
CGTGACGGTC GCGTCCGGAC GGCGTTGCCA GTGGTCGCCA CGGCGGGGCT GCTGCTGCTC
TGCTCCGGCG CGGCGGCCCA GCCGGAGTCG GAGCCGTTGC CATCCGATCA GGCCGGCCGG
TGGAACCTGG CGGATTCCAG GCCGCCGGGC ACGGCGCCGA CCGACGTGGC GAGCCTCACC
GAGATCCCCG CGGGCCTCGA CCCGCTCGGA GAGGACTCTG GTGCGGCCGG CGATGGCGGT
ACCGACGCGA CGACCGCCCC GGGGGACGAG ACCGGCGGGG ACGGGAGCGC CGGCAGCGGG
AGCGGGAGCG ACGACGGCTC GGTGCCGGCG GGCACTGTCA TCACGGGTGG GCGGGGCGTC
CCCGACCGCG TCCTCGACGC CTACCGCCAG GCCGCCGGGC GCGTCGAACG GGAGCTGCCC
GGCTGCCACC TCCCGTGGGA ACTCCTCGCC GCCATCGGGA AGATCGAGTC CGGCCACGCG
GCGGGCCGGC CGATGGCCGA CGACGGCACG GTGACCCGGC CGATCCTCGG TCCCGTCCTC
GACGGCCGGG ATGGGCGCGC ACTGATCCGC GACTCGGACG ACGGCGTGTT CGACGGCGAC
GCCACGCTGG ACCGCGCGGT CGGCCCGATG CAGTTCATCC CGACCACCTG GCGGACGTCG
GGCCGGGACG GGTCGGGCGA CGGCCGCCGC GATCCGCAGA ACATCCACGA CGCGACGCTC
GCCGCGGGCG GCTACCTCTG CGCCCACGGG CGGGACGTGA GCCGGCCGGA CCAGCTGCGC
GCGGCGATCT TCTCCTACAA CCCGTCGGCG TCCTATGTGA ACGCCGTGCT GACCTGGATG
ACCGCCTATC GGGAGAAGGG CGCGACCGCG CTGCCCGGCG AGCCGGCGCA GGCCGGTGAC
CCGGAGCCGG CTATCCCGGA GCTGACCGCC CCGGAGCCGG CTGGCGACCC CCTGGTCCCG
GCCGAGCCCA CGACGCCAGC TGAGCCCACC ACGCCGGCTG AGCCCACCGC GCCGGCCGAA
CCGGGCACGC CGGCCCCCGG TGAGCCGCCG GCCGGGCCGG GGCCGTCGCT GCCTGCCGGC
GGGGACGGGG GGCCCCGCGT CGAGGTCGTC CCGGAGGAGC ACCGGCCGGG TGACGGGGGC
GAGGTCGGCG AGGAGCCGGT GTCCGCGGCG CTGCGCGACC TTGGCATCGC GGTCACCGAT
CTCGCCGTCA CACCGGTCGA GCTCGACGGC GCCGCCGCCG GCTTCGAGGC GCTGGACCTG
ACCTCGGCGC GCCGCCCGGC TCCGGCCGGA CCGCTGCGGG TGGTGGCCAC CGCCGCGACG
GAGGCCGGGC GCCCGATCGC GAGAAGCGAG ATCACGATTC CCGCCGAGCC GGACCGGCCC
GCGCAGGCAG CGCCCGGGGG CACGGGAGCG GCGTCCGGGG CCGCTGGAAC GACGCCCGGG
GAGGAGAAGC CGACCCTGCT CGCGCGGCTC GCCGGCGGGG ACCTCGCCGC GGCGGGCCTG
CCCGCGGGAC GGTTCGTCCT CACCCTCGAG GCGCATGCCG AGTCCGGGCG CGTGTACACG
GTGCGGCTGC TGGTCAGCCA GGTGAACGTG AAAGCCTTCG TCGCCCGCCC GGCGGCCACC
GGCACTCCGC CCACCGGGCA CGCGCCGTCC CCGAAGCCGC CGGCGTCCGC ACCCGGGCCG
TCTACCGGAT CGCCCGCCGG CGCACCGAGA ACTCCCGTCC GGCCCGCCAC TCCACCGGCC
GCCTCGCCGC CAGCCACGTC GACACCGGCC ACCTCAAAGC CAGCCACGTC TCCACCGGCC
GCCTCGCAGC CGGCCGCCTC GCAGCCGGCC ACCACGCAGC CGGCCACCAC GGCGCCAGCC
GCGGTGGGGT CCGGCCCCGC GACGCCCACG GCGACTCGGA CGGGTGGCCC GGCGTCCGCG
GCGCCGTAG
 
Protein sequence
MTPADGDTPS GRHRRPRRQP RDGRVRTALP VVATAGLLLL CSGAAAQPES EPLPSDQAGR 
WNLADSRPPG TAPTDVASLT EIPAGLDPLG EDSGAAGDGG TDATTAPGDE TGGDGSAGSG
SGSDDGSVPA GTVITGGRGV PDRVLDAYRQ AAGRVERELP GCHLPWELLA AIGKIESGHA
AGRPMADDGT VTRPILGPVL DGRDGRALIR DSDDGVFDGD ATLDRAVGPM QFIPTTWRTS
GRDGSGDGRR DPQNIHDATL AAGGYLCAHG RDVSRPDQLR AAIFSYNPSA SYVNAVLTWM
TAYREKGATA LPGEPAQAGD PEPAIPELTA PEPAGDPLVP AEPTTPAEPT TPAEPTAPAE
PGTPAPGEPP AGPGPSLPAG GDGGPRVEVV PEEHRPGDGG EVGEEPVSAA LRDLGIAVTD
LAVTPVELDG AAAGFEALDL TSARRPAPAG PLRVVATAAT EAGRPIARSE ITIPAEPDRP
AQAAPGGTGA ASGAAGTTPG EEKPTLLARL AGGDLAAAGL PAGRFVLTLE AHAESGRVYT
VRLLVSQVNV KAFVARPAAT GTPPTGHAPS PKPPASAPGP STGSPAGAPR TPVRPATPPA
ASPPATSTPA TSKPATSPPA ASQPAASQPA TTQPATTAPA AVGSGPATPT ATRTGGPASA
AP