Gene Franean1_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1751 
Symbol 
ID5670153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2100152 
End bp2101141 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content76% 
IMG OID641240672 
Producthemolysin A 
Protein accessionYP_001506095 
Protein GI158313587 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1189] Predicted rRNA methylase 
TIGRFAM ID[TIGR00478] hemolysin TlyA family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00952938 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000670381 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCCGCA GGATCCGTCT GGACGCGGAG CTCGTCCGGC GCCGGCTCGT CCCCTCCCGG 
GAACGGGCCG TCGAGGCGAT CGCCGCGGGC CGCGTGCGGG TCGGCGGCGT CACCGCGACG
AAGCCGGCGA CGGTCGTCGA CGGTGCGACG TCGATCGTGC TCGCCGTGGA CGACGACCCC
GGCTATGCCT CCCGGGGAGC GCACAAGCTG GTCGGCGCGT TCGAGGCGTT CGGCGTCCCA
GCGCCCGGGA CGCCCGACCT GTCCGGCGGG CCCGGCCAGC CTGATGTGCC GGATCGGCCT
GGTTCGCCCG CGGCGGTGGC GGGGCCGCCG GCGCTCGTGG TCGCCGGGCG GAGGTGCCTC
GACGCCGGTG CGTCGACCGG CGGGTTCACC GACGTGCTCC TCCGGTACGG CGCGGCGCGG
GTGGTGGCCG TCGACGTCGG ATACGGGCAG CTCGTCTGGC GGCTGCGCTC GGATCCGCGG
GTGCGCGTGC TGGACCGGAC GAACGTCCGC AACCTCACGC CCGAGCAGGT CGGGGAGCCG
GTGGAGCTGG TCGTGGGCGA CCTCTCGTTC ATCTCGTTGG TCCTGGTGCT GCCCGCGCTG
CGCGCGTGCG CCGCGCCGGA CGCCGACTTC GTCCTGCTGG TCAAGCCGCA GTTCGAGGTG
GGCCGGGAGT TACTCGGCTC CGGTGGTGTG GTCCGTGATG TGGCCCTGCA CGCTCGGGCG
GTGCGCACCG TCGTGACCGC CGCCGAGGGG CTCGGGCTGG GGGTGCGCGG CGTGGCGGCC
AGCCCGCTGC CGGGGCCCGC CGGCAACGTC GAGTACCTCG CCTGGCTGCG CGCGGACGTC
CGCCCGACTC CAGACGAGGT CGAGGCGATG ATCACCACGG CGATCGAGGC GGGGCCCGCG
GGCACGGCGG CCCCCGCCCC AGCGCCGCCG CCGTCGTCCC AGGACGCCGG CCAGGACGCC
CCAGCCGACG GCGAAAGGAC CGGCCGATGA
 
Protein sequence
MARRIRLDAE LVRRRLVPSR ERAVEAIAAG RVRVGGVTAT KPATVVDGAT SIVLAVDDDP 
GYASRGAHKL VGAFEAFGVP APGTPDLSGG PGQPDVPDRP GSPAAVAGPP ALVVAGRRCL
DAGASTGGFT DVLLRYGAAR VVAVDVGYGQ LVWRLRSDPR VRVLDRTNVR NLTPEQVGEP
VELVVGDLSF ISLVLVLPAL RACAAPDADF VLLVKPQFEV GRELLGSGGV VRDVALHARA
VRTVVTAAEG LGLGVRGVAA SPLPGPAGNV EYLAWLRADV RPTPDEVEAM ITTAIEAGPA
GTAAPAPAPP PSSQDAGQDA PADGERTGR