Gene Franean1_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0886 
Symbol 
ID5669300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1031357 
End bp1034107 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content76% 
IMG OID641239813 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001505248 
Protein GI158312740 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.421013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACCA CCGGCGACGA CCAGACCACC GATGCCCGTC CTGGCACGGA CAAGGCAAAA 
GCCCCGTCAG TGCCCGTGCC CGCGCCCCGC TCGCGGGCGG CGGAGGGCAC CGTCCCGTCG
TCAGGAGGCG TGAGCCGACC GGTCCCGGCG TCCCTGCCGC TGCACGGGCC GGAACGTCGC
CCCGCGACGG AGCGTCGTCC CGGCCCCGAA GCTCGGGCGG ACACGCCGCG CCGCCCCGCG
TCGGCGGCGG AACGGCCACC CTTCGTCGAG CCGGTCCGGG CCACCACACG CGGGAGCGTG
CCGACCACAG GCCGCCCGTC CAGCACGGGG AGCTCGTCCA CGACGGGTAA CGCCGGGGCC
ACCGGCGGCA CACGCGACGA GACGACCAGG CGCGTCGGCG GGAAACGGGA CGGTGCCGGC
CCACGGGGCC CGGCCGCTCG CGCGACATCC GGCCGGGAAC CAGCCGATCG GGAGGCATCC
AGCCGGGAGG CGCCCAGCCG GGCTACGGCC GCACGGGAGG CAGCCGCCGC GCCGAAGGGG
CCGGCGGGCC GGCAGGGCGC CGTCCCGCCG GGGGCGTCGG ATGGTGCCGC TGCCGCGAAG
ACTCCGCTCG TCCGGGAAGG CCGTGCCGCC CAGAACGGCT CGTCTTCTCA GAACGGTTCC
GCTACCCGGG ATGCTGCCGG GCGGAATGGC TCGTCCTCTC AGAACGGCGC CGCCAGCCGG
GACGCCGCCG CGCGGACTGG CTCCATGGGC CCCGAAAGCG TGGCCCGCAC GCCGATCGCG
TCGCCTACCG GGGATCCGGG TCGGCGTGAC ACCGCGTCCC GCGCGGGCGC GCCGGAGCGT
GACGGTGGCG GGCGTGGCGT TCAGGGCACG GGCGCCGCTT CCGGCCCGGG GCCCGCGCCA
GGAGCGGGAT CGGCCCCTGG CGCGCCGGCT GCCCGGGGAG GCTCCGATCC GGGCAGCGCC
GCGAAGGCGC GTGTGACGGC GCAGCCGCGG CCAGCGGTTG CGCGGGAGAG CTCGCGGCCC
TCGGTTCCGC CACCGCAGGG CCCGCTGCCG TCGGCTTCGC GGGAGGCCGT CCCACGGGAG
AGCAGCCGTG AGGCCGTCTC GCGGGAGGGC ACCCGGGGGA GCGTCTCGCG GGAGGTCAGC
CGGGAGGCGG TCTCGCGGGA GAGTGCTCGG GGGAGTGTTC CGCGGGAGGT CAGCCGGGAG
GCGGTCTCGC GGGAGACGGG CCGGGCACCG GTCTCGCGGG AGACCAGCCG TGAGGCCGTC
TCCAGGGACG GTGCCCGGGG GAGCGTCCCG CGGGAGGGCC TCCGTGAGAC GACCTCGCGG
GACGCCGACC AGCGCGGGGC CGGCCGGCCG TTGGCCGACC CGCTGGTGGA TCTGGTGGGG
CCGCGGCGCG CCCCGCGCCC GCCCACGCCG CCGCCCTCGG AGCTTGCGCC ATCCGGTAGG
TCTGCGCCCT CCTCGGCCAC GCCCGCCCCA TCCGCCCCAT CTGTACCAGC GACCTCGGCG
GCTGGGACGT CGGGCGCTGG GAAATCGGCA CCCGCCTCCG CAGCAGGGGG GTCGACGTCT
GGTGTGTCCA CGACCGATGC GGCCAGAGCA ACGGCCGACG CGGCGAGAGC CGCGAGCGCG
GCGACGCACC ACCGCGCGGC CGCTCCGAAG ATCCGCCATG GCCGGCGGGC GCTCGTGATC
ACTGCTGTGA TCGCCGGACT GCTGGGGGGA GCGACCGGTG GGTGGGTGAC GAGCCTGATA
CTGGGTGACT CCGAGGGGAC GTCGTCGCCG ACAGCGTCCG CCGAGGCCGC GCCGACGGTC
ATCGACCCCG GCTCGGTGGC CGGGGTCGTC GCGCGAGTGC TGCCGTCCGT TGTGACCATC
GACGTGACGG CGGGGGCCGA GGGCGGGAAC GGCTCCGGGG TGATCATTCG GTCTGAGGGC
TACGTACTCA CCAACAACCA TGTCATCGCG CCGGCGGCGA ACGCGGGTGG CCAGGTAATG
ATCACGATGA GTGATGGCGC CGAGCCCGTG CTCGCCGAGA TCGCCGGACG GGACGCCTCT
TCAGATCTTG CGGTGCTGCG CATCCCCGGG GCCTCCGGCC TGCCGGCGGC GACGCTGGGA
CGGTCCGGTT CGCTGGTCGC CGGCGCTCCG GTGATCGCGA TCGGTGCGCC CTTCGGACTC
TCGGGGACGG TCACCACGGG GATCGTCAGC GCGCTCGACC GGAACCCGAC CGTGCCCGCC
GAGGGCGGCG GGGCGTCCGT GATCATCGGA GCGATCCAGA TCGACGCGGC GATCAATCCC
GGGAACTCCG GTGGCCCGCT GCTCGACGCC CGTGGCCAGG TCGTCGGCCT GAACACGGCG
ATCGCGACGG CGCCGGGCGG GCAGGCGCCG TCGGGCAGCG TCGGCGTCGG GTTCGCGATC
CCCATCGACT ACGCCGCGTC GGTGGCGGAC GAGATCATCC GCACCGGGCG GGCCACCCAC
CCCTACACCG GAGTGTCGGC CGCGACGGTC ACCGCCGCCG AGGCCCGGGC GCGCGGCACC
ACCCCGGGCG CGATCATCCG TGACGTCGAG CCGGCGGGCC CCGCGGCCGC GGCCGGGCTG
CTGCCGGGCG ACATCATCAC CCGGGTCGAC GACACGGTCG TCACCAGCAC GAACGATCTC
ACCGCGGCCA CCCGGCTGCA CCACGTCGGC GACACGGTGA CCGTGACCTT CCAGCGCAAC
GGAGTGGAGA GCACAGCGCG GGTGGTCCTC CAGGAACAGT CGCCCGGCTG A
 
Protein sequence
MDTTGDDQTT DARPGTDKAK APSVPVPAPR SRAAEGTVPS SGGVSRPVPA SLPLHGPERR 
PATERRPGPE ARADTPRRPA SAAERPPFVE PVRATTRGSV PTTGRPSSTG SSSTTGNAGA
TGGTRDETTR RVGGKRDGAG PRGPAARATS GREPADREAS SREAPSRATA AREAAAAPKG
PAGRQGAVPP GASDGAAAAK TPLVREGRAA QNGSSSQNGS ATRDAAGRNG SSSQNGAASR
DAAARTGSMG PESVARTPIA SPTGDPGRRD TASRAGAPER DGGGRGVQGT GAASGPGPAP
GAGSAPGAPA ARGGSDPGSA AKARVTAQPR PAVARESSRP SVPPPQGPLP SASREAVPRE
SSREAVSREG TRGSVSREVS REAVSRESAR GSVPREVSRE AVSRETGRAP VSRETSREAV
SRDGARGSVP REGLRETTSR DADQRGAGRP LADPLVDLVG PRRAPRPPTP PPSELAPSGR
SAPSSATPAP SAPSVPATSA AGTSGAGKSA PASAAGGSTS GVSTTDAARA TADAARAASA
ATHHRAAAPK IRHGRRALVI TAVIAGLLGG ATGGWVTSLI LGDSEGTSSP TASAEAAPTV
IDPGSVAGVV ARVLPSVVTI DVTAGAEGGN GSGVIIRSEG YVLTNNHVIA PAANAGGQVM
ITMSDGAEPV LAEIAGRDAS SDLAVLRIPG ASGLPAATLG RSGSLVAGAP VIAIGAPFGL
SGTVTTGIVS ALDRNPTVPA EGGGASVIIG AIQIDAAINP GNSGGPLLDA RGQVVGLNTA
IATAPGGQAP SGSVGVGFAI PIDYAASVAD EIIRTGRATH PYTGVSAATV TAAEARARGT
TPGAIIRDVE PAGPAAAAGL LPGDIITRVD DTVVTSTNDL TAATRLHHVG DTVTVTFQRN
GVESTARVVL QEQSPG