Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1984 |
Symbol | |
ID | 5670385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2383614 |
End bp | 2385593 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240905 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_001506327 |
Protein GI | 158313819 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.614445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCTCCC GTGTGAGCAT CGGCCATATT CATGCCTCCG CGCCGGCGAA ATCCGGGCCG CCGAAGGACA CTCCCCCGAA ACCCGCGCCG CCGCCGATGC CGGGCTGGCG CAGGTTCCTC ATCCCGGTCG GGCTGCTCGT CACGGTGATG CTGCTGATCG CTCCGAGCCT GTTCGCCACC CAGCCGGATT CGCTGACCTA CTCCGATTTC GTGGGTCGGG TCGACAGCGG CGTGGTGCGC TCGGTCACCA TCGATGACCG CGGTGGTGTC GACGGCACGC TGACCGACGG CACTGACTTC ACCACCCAGA TCCCCACCGC ACTCGACACA ACCGCGCTCG AACGTCAGCT CGCCGCCAAG AAGGTGCAGA TCACGGCCAC CAGGACGGGC ACGTCGTTCT GGTCGGTGGT GCTGAGCTTC CTGCCGCTGC TGCTCCTGAT CGGTTTCTTC GTGTGGTCCG GCCGCATGGC CCGGCGCCAG CTCTCCGGCG GCGGCGCGCT CGGAATGTTC GGCCGCTCCC GGGCGAAGAT CACCGAGGCG GACCGGCCTG ACACTCGTTT CAGCGACGTC GCGGGCTACG AGGGCGCGAA ACAGGAGATC AGCGAGGTCG TCGCCTTCCT GCGCAATCCC GACCAGTATC TTGAAGTCGG CGCACACGGC CCGCGCGGCG TGCTGATGGT CGGCCCGCCC GGCACCGGCA AGACCCTGCT CGCGCGCGCC GTCGCCGGCG AGGCCGAGGT GCCGTTCCTT TCGATCACCG GCTCGGGCTT CGTCGAGATG TTCGTCGGTG TCGGCGCATC CCGGGTGCGG GACCTCTTCA CCGAGGCTCG CAAGCGGGCG CCGTCCATCA TCTTCATCGA CGAGATCGAC GCCATCGGCG GCCGGCGCGG CTCCAGCGCG TTCGGCGGGT CCAACGACGA GCGCGAGCAG ACGCTGAACC AGCTCCTCGC CGAGATGGAC GGGTTCGAGT CCACGTCGGG CGTGGTCGTC CTCGCCGCCA CGAACCGCCC CGAGACCCTT GACCACGCGC TGCTGCGCCC GGGTCGGTTC GACCGCCAGG TCACAGTGCC GCTGCCGACC CAGTCCGAAC GGGCCGAGAT CCTCGCCGTG CACACCCGCG GCAAGGCACT CACCGATGAC GCCGACCTCA CCCGGATCGC CCGGGGCACG CCCGGATTCT CCGGCGCGGA CCTGGCCAGC CTCGTCAACG AGGCGGCGAT CAACGCCGTC CGCGACGGAC GTTCGGTGGT CAGCGCCGCC GATCTCGACG CCGCGCGTGA CCGCATTCTC CTCGGACGGC GCGATGCCTC GAACGCCCTG CTCCCGGACG AGAAGCGGTC CGTCGCCGTG CACGAGTCGG GCCATGCCCT GGTGGCGGCA CTCTGCGACG ACGCCGACCC GGTCGCGAAG GTGACCATCC TCCCCTCGGG CATGGCGCTC GGCGTCACCC AGCAGCTCCC CGAGGCCGAG CGGCACCTCT ACTCCGAGGC TTATCTGCTG GACAGCCTGG CCGTGCGGCT CGGCGGTCGG GCGGCCGAGC TGGTGGTGTT CGGCCACGGC TCCACCGGTG CCTCGAACGA CCTGGCCGGC GCGACCCAAC TCGCCACCCG GATGGTGCGT GAGTTCGGGC TGTCGGAGGA GATCGGACCG GTGGGCTACT CGTCCGACGG GCCCAACTTC CTCGGCGGGG ACGACCTCAT GGCCCGCCCC TACTCGGAGC AGACGCAACG AGTGATCGAC GCCGAGGTGG CACGGCTGCT GCGGGAGGCT CAGGCGCGAG CCGTCGACCT GCTGCGCATG CATCGGAACG CACTCGACGC CCTGACCGCG CGCCTGCTGG AACGGGAGAC CGTCGACGGC ACGGTGGTAG AGGAGCTCGC CGCCGCGTCG ATGGCGAGCT TCACGCGCAG CCCGAACGGA GACGGCTCGG GGGAAGGTCC GGACGGCGGC ATCCCACCGC AGATCTCCCT GCAGACCTGA
|
Protein sequence | MISRVSIGHI HASAPAKSGP PKDTPPKPAP PPMPGWRRFL IPVGLLVTVM LLIAPSLFAT QPDSLTYSDF VGRVDSGVVR SVTIDDRGGV DGTLTDGTDF TTQIPTALDT TALERQLAAK KVQITATRTG TSFWSVVLSF LPLLLLIGFF VWSGRMARRQ LSGGGALGMF GRSRAKITEA DRPDTRFSDV AGYEGAKQEI SEVVAFLRNP DQYLEVGAHG PRGVLMVGPP GTGKTLLARA VAGEAEVPFL SITGSGFVEM FVGVGASRVR DLFTEARKRA PSIIFIDEID AIGGRRGSSA FGGSNDEREQ TLNQLLAEMD GFESTSGVVV LAATNRPETL DHALLRPGRF DRQVTVPLPT QSERAEILAV HTRGKALTDD ADLTRIARGT PGFSGADLAS LVNEAAINAV RDGRSVVSAA DLDAARDRIL LGRRDASNAL LPDEKRSVAV HESGHALVAA LCDDADPVAK VTILPSGMAL GVTQQLPEAE RHLYSEAYLL DSLAVRLGGR AAELVVFGHG STGASNDLAG ATQLATRMVR EFGLSEEIGP VGYSSDGPNF LGGDDLMARP YSEQTQRVID AEVARLLREA QARAVDLLRM HRNALDALTA RLLERETVDG TVVEELAAAS MASFTRSPNG DGSGEGPDGG IPPQISLQT
|
| |