Gene Franean1_6731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6731 
Symbol 
ID5675044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8186498 
End bp8187424 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content69% 
IMG OID641245580 
ProductNLP/P60 protein 
Protein accessionYP_001510971 
Protein GI158318463 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins)
[COG2951] Membrane-bound lytic murein transglycosylase B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00865661 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.270738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGC TCGTCCTCGG ACTGTGCGCG CTGCTGCTCG CCGTACCGAT CCTCGCCGGC 
GGCGTCGCGG CCGGACTCCT CGGCGGCGAG GCCGGCGGCG GCGAGCCGGC ATCCGCCGCG
GCCGCCGCGG GGGAGATCCC TGTCGACTAC CAGCGGCTCT ACGTCACCGC TGCGGCCACC
TGCCCGGGGC TGCCGTGGAC GGTGCTGGCC GCGGTCGGGA AAGTCGAGAC CGACCACGGG
CAGAACCCGG ACTGGACCTC GCTGGCCGGC GCCCAGGGGC CGATGCAGTT TCTGCCTACC
ACCTTCGCCG CCTATGGGGT CGACGGCGAC GCCGACGGCA GCACCGACAT CAACAATCCC
GCCGACGCTG TCTACTCCGC CGCCCATTAT CTGTGTGCCT CCGGCGCACA GAACGGCGCG
AATATTCCCG GGGCGCTCTA CACCTATAAT CATGATAATT CTTACGTGAC GCGGGTTCTC
ACCCAGGCCG ACGTTTACAC CACCTCCGAC CTCACCACCA GCAGCGGCCC GTCGGACGCG
GCGCTGACCG CGGTGGACTA CGCCACCGCG CAGATCGGCC TGCCGTATCT GTGGGGTGGG
GACGGCCCCG ATTATGGCGA GAAAGGCTTT GATTGTTCGG GGCTTACCCG GGCCGCCTAC
GCCGCCGCCG GAGTCACCAT CCCCCGCGTC GCGCAGGCCC AGTTCAACGC CGGACCACGA
CTACCCCCAG GGGCCCCACT GGAAATCGGA GACCTCGTAT TCTACGGCCC GTCCGACATT
GACATCACCC ACGTGGGAAT TTACCTTGGC AGCGGGGAAA TGGTGAACGC GCCCCGGCGC
GGGGCCCCGG TCCGGACCGA AACCTACGTC CGACCCAGCT ACCGCGGAGC CACCCGACCC
GTCCCCGCGT CGGCCGGCTT TCCCTGA
 
Protein sequence
MTRLVLGLCA LLLAVPILAG GVAAGLLGGE AGGGEPASAA AAAGEIPVDY QRLYVTAAAT 
CPGLPWTVLA AVGKVETDHG QNPDWTSLAG AQGPMQFLPT TFAAYGVDGD ADGSTDINNP
ADAVYSAAHY LCASGAQNGA NIPGALYTYN HDNSYVTRVL TQADVYTTSD LTTSSGPSDA
ALTAVDYATA QIGLPYLWGG DGPDYGEKGF DCSGLTRAAY AAAGVTIPRV AQAQFNAGPR
LPPGAPLEIG DLVFYGPSDI DITHVGIYLG SGEMVNAPRR GAPVRTETYV RPSYRGATRP
VPASAGFP