Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4423 |
Symbol | |
ID | 5901884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4790977 |
End bp | 4792857 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564941 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_001686041 |
Protein GI | 167648378 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.765362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTCA AGACCCTGGG CATCTGGCTG GCGATCGCGG TGGCGGTGTT GGCCGCCTAT GTCGTGACTC AGAGCGGCAA GGCCGGCGGC GGCAACGGCG GCGAGATGAG CTACTCGCAG CTGCTGAAGA ACATCGACAG CGGCGACGTC AAGAAGGCCG ACATCAACGG CGATGTGGTC AAGATCGAGC CGCGCACGGG CAAGACCTAC GCGGTCAATG TCCCGCCCAA TTCCGAGGAC CTGGTCAAGC GTCTCGAGGC GCGCAACGCC GAGATCGTCT ACCAGCGCAA CAGCATCAGC CTGCTGGGCA TCCTGTTCCA GATGCTGCCG ATCCTGCTGC TGATCGGCGT GTGGATCTTC TTCATGCGCC AGATGCAGGG CGGCACCAAG GGCGCCATGG GCTTTGGCAA GTCCAAGGCC CGGCTGCTGA CCGAGAACAA GAACCGCGTG CTGTTCGACG ACGTCGCCGG CGTCGATGAG GCCAAGGAAG AGCTGCAGGA AGTGGTCGAG TTCCTCAAGG ACCCGGCCAA GTTCCAGCGC CTGGGCGGCA AGATTCCCAA GGGCGCCCTG CTGGTCGGCC CGCCCGGCAC CGGCAAGACC CTGATCGCTC GCGCCGTCGC GGGTGAGGCC GGCGTGCCGT TCTTCACCAT CTCGGGTTCG GACTTCGTCG AGATGTTCGT TGGCGTCGGC GCCAGCCGCG TGCGCGACAT GTTCGAGCAA GCCAAGAAGA ACGCCCCCTG CATCATCTTC ATCGACGAAA TCGACGCCGT CGGCCGCCAC CGTGGCGCGG GCCTGGGCGG CGGCAACGAC GAGCGCGAGC AGACGCTGAA TCAGCTGCTG GTCGAGATGG ACGGCTTCGA GGCCAACGAA GGCATCATCC TGATCGCCGC CACCAACCGT CCAGACGTGC TGGACCCGGC CCTGCTGCGT CCGGGCCGCT TCGACCGCCA GGTCGTGGTG CCCAATCCCG ACGTCATGGG CCGCGAGAAG ATCATCCGCG TGCACATGAA GAACGTGCCG CTGGCCGCCG ACGTCGACGT CAAGACCCTG GCCCGCGGCA CCCCCGGCTT CTCGGGCGCC GACCTGGCCA ACCTGGTCAA CGAGGCGGCC CTGACCGCCG CGCGCAAGAA CCGTCGCATG GTCACCATGC ACGACTTCGA ATACGCCAAG GACAAGGTGA TGATGGGTGC CGAGCGTCGC TCGATGGCCA TGAGCGAGGA TGAAAAGCGC AACACCGCCT ATCACGAGGG CGGTCACGCC CTGGTGGCCC TCAGCGTCCC GGTCGCCGAC CCGGTGCACA AGGCCACCAT CGTGCCGCGC GGTCGCGCCT TGGGCATGGT CATGCAGTTG CCGGAGGGCG ATCGCTATTC CATGAACTTC ACCCAGATGA CCTCGCGCCT GGCCATCATG ATGGCCGGCC GCGTGGCCGA GGAGCTGATC TTCGGCAAGG AGAACATCAC GTCCGGCGCC TCCAGCGACA TCAGCGCCGC CACCAGCCTG GCCCGCAACA TGGTCACCCG CTGGGGCTTC TCCGACGAGC TGGGCACCGT GGCCTATGGC GACAACCAGG ACGAGGTGTT CCTGGGCCAT TCGGTGGCCC GCACCCAGAA CGTCTCGCCC GAGACCATGA TCAAGATCGA CAGCGAAGTG CGTCGCCTGG TCAAGGGCGG CGAGGACGAG GCCCGCCGGA TCCTGACCGA GAAGCTGGAA CAGCTGCACT CGATCGCCAA GGCGCTGCTG GAGTTCGAGA CCCTGTCGGG CGACGAGATC ATCGGCGTGA TGAAGGGCGT CCAGCCCACC CGCGAGGAAG ACGAGACCAA CAAGATGCCG ACCGGCCCGA CGGCCTCGGT GCCGGTCTCG CCCACCGGCG TGACGGCGTA G
|
Protein sequence | MNFKTLGIWL AIAVAVLAAY VVTQSGKAGG GNGGEMSYSQ LLKNIDSGDV KKADINGDVV KIEPRTGKTY AVNVPPNSED LVKRLEARNA EIVYQRNSIS LLGILFQMLP ILLLIGVWIF FMRQMQGGTK GAMGFGKSKA RLLTENKNRV LFDDVAGVDE AKEELQEVVE FLKDPAKFQR LGGKIPKGAL LVGPPGTGKT LIARAVAGEA GVPFFTISGS DFVEMFVGVG ASRVRDMFEQ AKKNAPCIIF IDEIDAVGRH RGAGLGGGND EREQTLNQLL VEMDGFEANE GIILIAATNR PDVLDPALLR PGRFDRQVVV PNPDVMGREK IIRVHMKNVP LAADVDVKTL ARGTPGFSGA DLANLVNEAA LTAARKNRRM VTMHDFEYAK DKVMMGAERR SMAMSEDEKR NTAYHEGGHA LVALSVPVAD PVHKATIVPR GRALGMVMQL PEGDRYSMNF TQMTSRLAIM MAGRVAEELI FGKENITSGA SSDISAATSL ARNMVTRWGF SDELGTVAYG DNQDEVFLGH SVARTQNVSP ETMIKIDSEV RRLVKGGEDE ARRILTEKLE QLHSIAKALL EFETLSGDEI IGVMKGVQPT REEDETNKMP TGPTASVPVS PTGVTA
|
| |