Gene Cag_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0391 
Symbol 
ID3747559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp453655 
End bp455643 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content49% 
IMG OID637772919 
Productpeptidase M41, FtsH 
Protein accessionYP_378707 
Protein GI78188369 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0150338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAA AACCAACCAA AAAATCATCC CCCAATAATT CGCGCAACCC CTTTAAACCT 
GTTGATGACG ATAATGGCGG TGGCATGGGT AACTCGGGCA ATGGCAGCCC ACTGCCTCGC
TTTCCCCGTA TGCTCATTAT TGTTATGATT GGTATGCTGG TGCTTTTTAC AGGGCAGCGC
TTTTTTGGCA CCGCAGCAAA TCCTGAAATT AGCTACAACG AGTACAAGTC GCTTTTAGAG
CGCTCGCTTA TTGCTGAAAT TACCATTAGC AGTGGCGAAG AGCGTTCGAC GCTGCTCAAT
GGACGGCTTA CCGCACCAAC AAAATTGCAG CTTGTTAACC AAGCGCTCCA ACAAAGCGAT
CGCTTTTCGG TACGAGTGCC CTCGGTGTCG CTGGAGCAAA CCGATGCGTT GGCAGCCAAA
GGCATTCGCG TTAAGGTGGA AGAGAATTCA GGCGGTCTTA AAACCTTTTT AATCCTTTTT
GCGCCTTGGC TCATTTTTGG GCTGATTTAC TTTTTTGTGA TGCGCAATAT GAATGGAGCC
AACAATGCGC AGGCAAAAAA CATGTTTAAC TTTGGCAAAA GTCGTGCTAA AATGGCGAGC
GAGTTTGATG TTAAAGTTAC CTTTAAAGAT GTAGCTGGCG TTGACGAAGC CATTGAGGAG
TTGAAGGAAA CCGTGGAGTT TTTAGTAAAT CCTGAAAAGT TCCAGAAAAT AGGCGGCAAA
ATTCCTAAAG GCGTTTTGTT GCTGGGTCCT CCAGGTACCG GTAAAACCTT GCTTGCTAAG
GCAATTGCAG GTGAAGCCAA AGTGCCATTT TTCTCCATGT CGGGTGCCGA TTTTGTTGAA
ATGTTTGTGG GTGTAGGAGC TTCTCGTGTG CGCGATTTGT TTGAGCAAGC GAAGAAAAAC
GCGCCTTGCA TTATTTTTAT TGATGAAATT GATGCCGTTG GTCGCAGCCG TGGCGCTGGG
CTTGGCGGTG GACACGATGA GCGTGAGCAA ACGCTTAACC AGTTGTTGGT GGAAATGGAT
GGTTTTGGTA CCACCGATAA TGTGATTTTA ATTGCCGCAA CTAACCGTCC CGACGTGTTG
GATTCAGCAC TTTTACGTCC CGGACGCTTT GATCGTCAAA TCACCATTGA TAAACCCGAC
ATTCGTGGAC GTGAAGCCAT TTTAGCCATT CACACCCAAA AAACACCGCT TGATGAGAGC
GTTACCCTAA CGGTGTTGGC AAAAAGTACC CCTGGTTTTT CAGGTGCCGA CTTAGCCAAT
TTGGTGAACG AAGCGGCACT TTTAGCTGCA CGTCAAGAAG CCGAGCGCAT TACCGCAACC
CATTTTGAGC AAGCACGCGA CCGCATTTTA ATGGGTCCCG AGCGCCGAAG CATTTACATT
TCGGACGAGC AAAAAAAGCT TACCGCATAC CACGAAGCAG GGCATGTGTT GGTTGCACTT
TTTACTCCGG GTTCCGACCC CGTGCACAAG GTTACCATTA TTCCGCGTGG ACGTAGCCTT
GGCTTAACCT CGTACCTGCC GTTAGAAGAT CGCTACACGC AAAATCGTGA ATATTTAGTG
GCAATGATTT CCTACGCACT TGGTGGACGT GCGGCTGAAG AGCTGATTTT TAACGAAGTA
AGCACGGGTG CCTCAAACGA TATTGAACGC GCAACCGATA TTGCACGCCG TATGGTGCGC
CAGTGGGGCA TGAGCGAAAA GCTGGGTCCC GTCAATTACG ACAGCGGCAC CCATCGCGAA
GTGTTCCTTG GCAAAGATTA TTCACATGTT CGTGAATACA GTGAAACAAC GGCGCTGCAT
ATTGATAACG AAGTACACGC CATTATTAGC GGTTGCATGG AGCAAGCCAA AACAATTCTT
ACCACAAAGC AAGAGTTGTT GCACCGCCTT GCCTTGCAGT TAATTGAGAA GGAATCGCTC
AGCGCCGCAG AAATTGCCGA GCTTACGGGC ACGGAGCTGC CGACTTCCAC CCCAACGCTG
AAGCAGTAA
 
Protein sequence
MAEKPTKKSS PNNSRNPFKP VDDDNGGGMG NSGNGSPLPR FPRMLIIVMI GMLVLFTGQR 
FFGTAANPEI SYNEYKSLLE RSLIAEITIS SGEERSTLLN GRLTAPTKLQ LVNQALQQSD
RFSVRVPSVS LEQTDALAAK GIRVKVEENS GGLKTFLILF APWLIFGLIY FFVMRNMNGA
NNAQAKNMFN FGKSRAKMAS EFDVKVTFKD VAGVDEAIEE LKETVEFLVN PEKFQKIGGK
IPKGVLLLGP PGTGKTLLAK AIAGEAKVPF FSMSGADFVE MFVGVGASRV RDLFEQAKKN
APCIIFIDEI DAVGRSRGAG LGGGHDEREQ TLNQLLVEMD GFGTTDNVIL IAATNRPDVL
DSALLRPGRF DRQITIDKPD IRGREAILAI HTQKTPLDES VTLTVLAKST PGFSGADLAN
LVNEAALLAA RQEAERITAT HFEQARDRIL MGPERRSIYI SDEQKKLTAY HEAGHVLVAL
FTPGSDPVHK VTIIPRGRSL GLTSYLPLED RYTQNREYLV AMISYALGGR AAEELIFNEV
STGASNDIER ATDIARRMVR QWGMSEKLGP VNYDSGTHRE VFLGKDYSHV REYSETTALH
IDNEVHAIIS GCMEQAKTIL TTKQELLHRL ALQLIEKESL SAAEIAELTG TELPTSTPTL
KQ