Gene Caul_4423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4423 
Symbol 
ID5901884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4790977 
End bp4792857 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content66% 
IMG OID641564941 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001686041 
Protein GI167648378 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.765362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCA AGACCCTGGG CATCTGGCTG GCGATCGCGG TGGCGGTGTT GGCCGCCTAT 
GTCGTGACTC AGAGCGGCAA GGCCGGCGGC GGCAACGGCG GCGAGATGAG CTACTCGCAG
CTGCTGAAGA ACATCGACAG CGGCGACGTC AAGAAGGCCG ACATCAACGG CGATGTGGTC
AAGATCGAGC CGCGCACGGG CAAGACCTAC GCGGTCAATG TCCCGCCCAA TTCCGAGGAC
CTGGTCAAGC GTCTCGAGGC GCGCAACGCC GAGATCGTCT ACCAGCGCAA CAGCATCAGC
CTGCTGGGCA TCCTGTTCCA GATGCTGCCG ATCCTGCTGC TGATCGGCGT GTGGATCTTC
TTCATGCGCC AGATGCAGGG CGGCACCAAG GGCGCCATGG GCTTTGGCAA GTCCAAGGCC
CGGCTGCTGA CCGAGAACAA GAACCGCGTG CTGTTCGACG ACGTCGCCGG CGTCGATGAG
GCCAAGGAAG AGCTGCAGGA AGTGGTCGAG TTCCTCAAGG ACCCGGCCAA GTTCCAGCGC
CTGGGCGGCA AGATTCCCAA GGGCGCCCTG CTGGTCGGCC CGCCCGGCAC CGGCAAGACC
CTGATCGCTC GCGCCGTCGC GGGTGAGGCC GGCGTGCCGT TCTTCACCAT CTCGGGTTCG
GACTTCGTCG AGATGTTCGT TGGCGTCGGC GCCAGCCGCG TGCGCGACAT GTTCGAGCAA
GCCAAGAAGA ACGCCCCCTG CATCATCTTC ATCGACGAAA TCGACGCCGT CGGCCGCCAC
CGTGGCGCGG GCCTGGGCGG CGGCAACGAC GAGCGCGAGC AGACGCTGAA TCAGCTGCTG
GTCGAGATGG ACGGCTTCGA GGCCAACGAA GGCATCATCC TGATCGCCGC CACCAACCGT
CCAGACGTGC TGGACCCGGC CCTGCTGCGT CCGGGCCGCT TCGACCGCCA GGTCGTGGTG
CCCAATCCCG ACGTCATGGG CCGCGAGAAG ATCATCCGCG TGCACATGAA GAACGTGCCG
CTGGCCGCCG ACGTCGACGT CAAGACCCTG GCCCGCGGCA CCCCCGGCTT CTCGGGCGCC
GACCTGGCCA ACCTGGTCAA CGAGGCGGCC CTGACCGCCG CGCGCAAGAA CCGTCGCATG
GTCACCATGC ACGACTTCGA ATACGCCAAG GACAAGGTGA TGATGGGTGC CGAGCGTCGC
TCGATGGCCA TGAGCGAGGA TGAAAAGCGC AACACCGCCT ATCACGAGGG CGGTCACGCC
CTGGTGGCCC TCAGCGTCCC GGTCGCCGAC CCGGTGCACA AGGCCACCAT CGTGCCGCGC
GGTCGCGCCT TGGGCATGGT CATGCAGTTG CCGGAGGGCG ATCGCTATTC CATGAACTTC
ACCCAGATGA CCTCGCGCCT GGCCATCATG ATGGCCGGCC GCGTGGCCGA GGAGCTGATC
TTCGGCAAGG AGAACATCAC GTCCGGCGCC TCCAGCGACA TCAGCGCCGC CACCAGCCTG
GCCCGCAACA TGGTCACCCG CTGGGGCTTC TCCGACGAGC TGGGCACCGT GGCCTATGGC
GACAACCAGG ACGAGGTGTT CCTGGGCCAT TCGGTGGCCC GCACCCAGAA CGTCTCGCCC
GAGACCATGA TCAAGATCGA CAGCGAAGTG CGTCGCCTGG TCAAGGGCGG CGAGGACGAG
GCCCGCCGGA TCCTGACCGA GAAGCTGGAA CAGCTGCACT CGATCGCCAA GGCGCTGCTG
GAGTTCGAGA CCCTGTCGGG CGACGAGATC ATCGGCGTGA TGAAGGGCGT CCAGCCCACC
CGCGAGGAAG ACGAGACCAA CAAGATGCCG ACCGGCCCGA CGGCCTCGGT GCCGGTCTCG
CCCACCGGCG TGACGGCGTA G
 
Protein sequence
MNFKTLGIWL AIAVAVLAAY VVTQSGKAGG GNGGEMSYSQ LLKNIDSGDV KKADINGDVV 
KIEPRTGKTY AVNVPPNSED LVKRLEARNA EIVYQRNSIS LLGILFQMLP ILLLIGVWIF
FMRQMQGGTK GAMGFGKSKA RLLTENKNRV LFDDVAGVDE AKEELQEVVE FLKDPAKFQR
LGGKIPKGAL LVGPPGTGKT LIARAVAGEA GVPFFTISGS DFVEMFVGVG ASRVRDMFEQ
AKKNAPCIIF IDEIDAVGRH RGAGLGGGND EREQTLNQLL VEMDGFEANE GIILIAATNR
PDVLDPALLR PGRFDRQVVV PNPDVMGREK IIRVHMKNVP LAADVDVKTL ARGTPGFSGA
DLANLVNEAA LTAARKNRRM VTMHDFEYAK DKVMMGAERR SMAMSEDEKR NTAYHEGGHA
LVALSVPVAD PVHKATIVPR GRALGMVMQL PEGDRYSMNF TQMTSRLAIM MAGRVAEELI
FGKENITSGA SSDISAATSL ARNMVTRWGF SDELGTVAYG DNQDEVFLGH SVARTQNVSP
ETMIKIDSEV RRLVKGGEDE ARRILTEKLE QLHSIAKALL EFETLSGDEI IGVMKGVQPT
REEDETNKMP TGPTASVPVS PTGVTA