Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2368 |
Symbol | |
ID | 5670764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2811528 |
End bp | 2812703 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241285 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001506706 |
Protein GI | 158314198 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.352881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.118221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAGAC ATCTACTCCG GGCCGCGGCT ACCTGTGGAT TCATCGGCGC AGTGGCGATG ACCCAGTTAA CTGCTCAGGG TGCCGCGTCC GCGGCGCCTC CCGACCTCAC CGGCCGTTAC ATCGTGGTGC TGAAATCCGC GCCCTCTGCG GCCGCCTCCG CGGCGGCCGC GACCCGGGCT CGGGATCTCG GCGCCCAGGT GACCCGCGAG TTCCAGCACA CGCTGAACGG GTACTCCGCG CAGCTCGACC CGGCGCAGCT TGCCGCCGTC CGGGCGGATC CCGAGGTCGC CTACGTGGAA CCCGACCAGG TGGTGCGGGC CGATACCGAG CAGCGAACGG CAGACTGGGG CCTGGACCGC ATCGACCAGC GCAAGCTCCC ACTGAACCGG GCGTACACGT ACGCCTCGAC CGGCGCCAGG GTCACCGCCT ACATTGTCGA CACCGGTATC CGCACCAGCC ACCGGGATTT CGGTGGCCGC GCCTCCGGCG GTTTCTCCGT CATCGATGAC GGTTACGGAA CCGAGGACTG CAACGGCCAC GGCACGCACG TCGCCGGAAC GACCGGGGGA ACGGCGCACG GCGTCGCCAA GTCGGTGCGG CTCGTCTCGG TGCGTGTTCT GGACTGCGCC GCGTTCGGCA CCGTCAGCGG CGTCATAGCC GGCGTCGAAT GGGTCACCGC CCATCACGGC AGTGGCCCCG CCGTGGTGAA CATGAGCCTG ACCGGCGGCG CGTCGCGGGC GTTCGACCAG GCGGTGCGGC AGTCGATCGC CTCCGGCCTG GTGTACTCGG TGGCCGCGGG AAACAGCAAC GGCGACGCCT GCGCCATCTC GCCCGCGCGG GTGCCCCGGG CGATCACCGT CGGGGCGACG ACGACCGCCG ACAGCCGGGA CACCACGTAC TCCAACTTCG GCTCCTGCGT GGACGTCTTC GCTCCGGGGA CCGGGATCAC CTCGGACTGG AACACCTCCG ACACCGCCAC CAACACCATC AGCGGGACGT CGATGGCGAC CCCGCACGTC ACCGGTGTGG CGGCGCTCTA CCTGCAGCAG CATCCCGGCG CCGGGCCCAA CAAGGTGCGC GACGCGATCG TGGACACCGC GACCCGCGGC GCGCTGACCA ACATCGGCGC CGGCTCGCCG AACCTGCTGC TCTACTCACG CGGGTCGGGC TTCTGA
|
Protein sequence | MKRHLLRAAA TCGFIGAVAM TQLTAQGAAS AAPPDLTGRY IVVLKSAPSA AASAAAATRA RDLGAQVTRE FQHTLNGYSA QLDPAQLAAV RADPEVAYVE PDQVVRADTE QRTADWGLDR IDQRKLPLNR AYTYASTGAR VTAYIVDTGI RTSHRDFGGR ASGGFSVIDD GYGTEDCNGH GTHVAGTTGG TAHGVAKSVR LVSVRVLDCA AFGTVSGVIA GVEWVTAHHG SGPAVVNMSL TGGASRAFDQ AVRQSIASGL VYSVAAGNSN GDACAISPAR VPRAITVGAT TTADSRDTTY SNFGSCVDVF APGTGITSDW NTSDTATNTI SGTSMATPHV TGVAALYLQQ HPGAGPNKVR DAIVDTATRG ALTNIGAGSP NLLLYSRGSG F
|
| |