Gene Franean1_0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0066 
Symbol 
ID5668491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp81495 
End bp82970 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content55% 
IMG OID641238994 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001504439 
Protein GI158311931 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.592117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCTTAG GCGGCAGTAT GGGAGGACTT GGCGTGGTCG ACGTGATGGT GAAGATGCAA 
CCCTCGGACG GGGGCTCGGT TGCCGACTCT CTTGATCAAG CTGATGAGGT CATGAGGTCC
GCCGACATTA CCCTTGACAG CCACTTTGAG CCCATTCGAA TGCAGGGAAA GAGCCGGGTC
GGGAGAGATG AAATTCTGCC GTACGGTGGC GACCGATTCA GGCCTTCTTC GAGCGCTGGG
GAAGATGCGG AACTGCGAAC CGAGAAAGGC GGTCCGGCAT GCGTTATAGT GCGTGCTTCG
GTAGATACTT GCGATCCTCG GGAGATTGAG GGGTCGCTCA TCGAATTGAA TCGGAGATCT
GAGGTCATCT CAATCTACTC GGACCCAGAG ATCGCACCAT ATTGGAAATG CTGGAGAGAA
AATCAGCCCG AAAATTTGTC GGATATGCTT TCGGTGCTGA ACCTGCAAGA ACTTTCCGCT
GTTGGCATGG ATGGCGATGG TGTTGATGTC GCGGTTGTTG ATGGTGGAAT TGACGCGGAT
TACCTTGTGC AACGCTCGCG CGATCTCAAG CCCCTAGAGG GCTGGCACCC AGACAACCTA
CAGAATACCC CAGGTCAATA CGCCGTAAAT AATGATCGCG ATGCGGCGCA TGGAACAATG
TGTGCCCATG AAGTTCTACT GGCCTCACCC CGGGCGCGAA TTCTAGACTA CGCGCTCCTG
CGTCGAGCTG CTACGGTGAA CAACAAAGCC ACCATGTCGG GGCTCGACAT TAGGTTCTCC
CACGCAATAG CTGCGTACCA CGCTCTCGCT AGTCGGCTAA GGAAGGATCG ACGTAGTAAT
GGAGGAAGCC TCAGCAGGCC ACTGGTTGTT ACGAACTCGT GGGGACTTGG CTCAGTAGCC
TCAGATGAGG TCACAAACCG CCTCGGCCGG TATCGTGATC AGTTTGAACA CCCCTTCAAC
CTCGCGGTCG AAGAACTATC ATTGGCCGGA GCCGATATCG TTTTTGCGGC TGGAAACAAC
GGGCAGCCTC ACCCGGACGA TTCCACTTGG CCACAGGATG AACTTCCCAT TACAGGCGCA
AATTCACATC CTCTCGCGCT ATGCGTTGGG GCTGTTACGG TAGGTGGCGA ACGGATATGC
TACTCTTCCC AAGGTCCAGG TAGACTGTTC TGGGGAAAGC CAGATGTCAT GGGTTATTCA
GAATATGTGG GATCGGAGGT GCTAGGTTCG GATACGCCAG ACGTGGGAAC ATCTGCGGCA
TGTCCGCTTG TTGCAGGAGT CATCGCGGCA GTTCGTAGCA AAATCGGGAC GGATGTGTTG
TCTCCCGTGA AGTTGCGAGA GGCAGTGAGG TGTAGCGCCT GGATGCCTTC CGTTGCAGGA
CACTGCAAGC CGAATAGCGA ATACGGGTGG GGGATCATAG ATCCGAGTGC CCTCCTGGCT
GGTGTCCGAG AGCATCTAAC TCAATCGCGA GAGTAA
 
Protein sequence
MPLGGSMGGL GVVDVMVKMQ PSDGGSVADS LDQADEVMRS ADITLDSHFE PIRMQGKSRV 
GRDEILPYGG DRFRPSSSAG EDAELRTEKG GPACVIVRAS VDTCDPREIE GSLIELNRRS
EVISIYSDPE IAPYWKCWRE NQPENLSDML SVLNLQELSA VGMDGDGVDV AVVDGGIDAD
YLVQRSRDLK PLEGWHPDNL QNTPGQYAVN NDRDAAHGTM CAHEVLLASP RARILDYALL
RRAATVNNKA TMSGLDIRFS HAIAAYHALA SRLRKDRRSN GGSLSRPLVV TNSWGLGSVA
SDEVTNRLGR YRDQFEHPFN LAVEELSLAG ADIVFAAGNN GQPHPDDSTW PQDELPITGA
NSHPLALCVG AVTVGGERIC YSSQGPGRLF WGKPDVMGYS EYVGSEVLGS DTPDVGTSAA
CPLVAGVIAA VRSKIGTDVL SPVKLREAVR CSAWMPSVAG HCKPNSEYGW GIIDPSALLA
GVREHLTQSR E