Gene Franean1_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2368 
Symbol 
ID5670764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2811528 
End bp2812703 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID641241285 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001506706 
Protein GI158314198 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.352881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.118221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAGAC ATCTACTCCG GGCCGCGGCT ACCTGTGGAT TCATCGGCGC AGTGGCGATG 
ACCCAGTTAA CTGCTCAGGG TGCCGCGTCC GCGGCGCCTC CCGACCTCAC CGGCCGTTAC
ATCGTGGTGC TGAAATCCGC GCCCTCTGCG GCCGCCTCCG CGGCGGCCGC GACCCGGGCT
CGGGATCTCG GCGCCCAGGT GACCCGCGAG TTCCAGCACA CGCTGAACGG GTACTCCGCG
CAGCTCGACC CGGCGCAGCT TGCCGCCGTC CGGGCGGATC CCGAGGTCGC CTACGTGGAA
CCCGACCAGG TGGTGCGGGC CGATACCGAG CAGCGAACGG CAGACTGGGG CCTGGACCGC
ATCGACCAGC GCAAGCTCCC ACTGAACCGG GCGTACACGT ACGCCTCGAC CGGCGCCAGG
GTCACCGCCT ACATTGTCGA CACCGGTATC CGCACCAGCC ACCGGGATTT CGGTGGCCGC
GCCTCCGGCG GTTTCTCCGT CATCGATGAC GGTTACGGAA CCGAGGACTG CAACGGCCAC
GGCACGCACG TCGCCGGAAC GACCGGGGGA ACGGCGCACG GCGTCGCCAA GTCGGTGCGG
CTCGTCTCGG TGCGTGTTCT GGACTGCGCC GCGTTCGGCA CCGTCAGCGG CGTCATAGCC
GGCGTCGAAT GGGTCACCGC CCATCACGGC AGTGGCCCCG CCGTGGTGAA CATGAGCCTG
ACCGGCGGCG CGTCGCGGGC GTTCGACCAG GCGGTGCGGC AGTCGATCGC CTCCGGCCTG
GTGTACTCGG TGGCCGCGGG AAACAGCAAC GGCGACGCCT GCGCCATCTC GCCCGCGCGG
GTGCCCCGGG CGATCACCGT CGGGGCGACG ACGACCGCCG ACAGCCGGGA CACCACGTAC
TCCAACTTCG GCTCCTGCGT GGACGTCTTC GCTCCGGGGA CCGGGATCAC CTCGGACTGG
AACACCTCCG ACACCGCCAC CAACACCATC AGCGGGACGT CGATGGCGAC CCCGCACGTC
ACCGGTGTGG CGGCGCTCTA CCTGCAGCAG CATCCCGGCG CCGGGCCCAA CAAGGTGCGC
GACGCGATCG TGGACACCGC GACCCGCGGC GCGCTGACCA ACATCGGCGC CGGCTCGCCG
AACCTGCTGC TCTACTCACG CGGGTCGGGC TTCTGA
 
Protein sequence
MKRHLLRAAA TCGFIGAVAM TQLTAQGAAS AAPPDLTGRY IVVLKSAPSA AASAAAATRA 
RDLGAQVTRE FQHTLNGYSA QLDPAQLAAV RADPEVAYVE PDQVVRADTE QRTADWGLDR
IDQRKLPLNR AYTYASTGAR VTAYIVDTGI RTSHRDFGGR ASGGFSVIDD GYGTEDCNGH
GTHVAGTTGG TAHGVAKSVR LVSVRVLDCA AFGTVSGVIA GVEWVTAHHG SGPAVVNMSL
TGGASRAFDQ AVRQSIASGL VYSVAAGNSN GDACAISPAR VPRAITVGAT TTADSRDTTY
SNFGSCVDVF APGTGITSDW NTSDTATNTI SGTSMATPHV TGVAALYLQQ HPGAGPNKVR
DAIVDTATRG ALTNIGAGSP NLLLYSRGSG F