Gene Plut_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_0158 
Symbol 
ID3744589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp166244 
End bp167290 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content61% 
IMG OID637768197 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_374091 
Protein GI78186048 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.396319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATAC TCGGCCTGGA AACCAGCTGT GACGAAACCT CTGGCGCAGT CCTTGTCGAT 
GGGGAGGTAC GCTCTAACGT CGTCAGTTCG CAACTCTGCC ACAAAGGGTT CGGCGGCGTC
GTTCCCGAAC TGGCCTCAAG GGAGCATGAA CGGCTCATCG TTCCGATCAC CGAAGCCGCC
CTCGCCGAAG CAAATATAAC AAAAAAGGAT ATCGATGTCA TAGCCGCCAC CGCCGGACCG
GGACTCATCG GTGCTGTGAT GGTGGGACTC TCTTTCGCCC AGTCGATGGC CTGGGCACTC
GGCGTGCCGT TCGTGGCGGT CAACCATGTC GAAGCCCATA TGTTCTCTCC GTTCATCGAC
CAAGAGACTG CCGGCGGAGG TCCAATAGGG CCGTTCATCT CGCTCACGGT ATCGGGTGGA
CATACGCTGC TGGCCATCGT CCGGGAGGAT CTCACCTACC GGATCATCGG CCGCACCCTC
GACGATGCGG CCGGAGAAGC CTTTGACAAG ACCGGCAAGA TGCTCGGACT CCCCTATCCG
GCAGGACCGG CCATCGACCG GCTCGCCAAA GAGGGCGATG CCGGCTTCCA CCGGTTCCCG
CGGGCGCTCA CAAGTCAGTC GCAGACCAGC AGAAGCTACC GCGACAACTT CGACTTCAGC
TTTTCCGGTC TGAAAACATC CGTCCTCACC TGGCTCAGGA GCCAGAAAGA GGAGTTCATC
CACGAGCACC GGGCAGACAT TGCGGCATCC ATCCAGGATG CAATCGTCGG CGTGCTCGTC
GAAAAAGCGG TCGGAGCAGC ACGCCGCCAC AACATCGGGG CCATCGCCGT TGCCGGCGGC
GTGAGCGCCA ACTCGGAACT CCGACGAGCC ATGGATGCGG CCTGCCGGAA GCACGGCATT
GCGCTCTTCA TCCCTTCAGC GACCTACTCG ACAGACAACG CCGCCATGAT TGCGACGCTC
GCCGGACTGA AACTCTCCCG TGGGCTCCAG CCCCTCTGCC GGTACGACAC GGCACCCTTT
GCATCGTTCA GTGCGGCAGG GAACTAA
 
Protein sequence
MIILGLETSC DETSGAVLVD GEVRSNVVSS QLCHKGFGGV VPELASREHE RLIVPITEAA 
LAEANITKKD IDVIAATAGP GLIGAVMVGL SFAQSMAWAL GVPFVAVNHV EAHMFSPFID
QETAGGGPIG PFISLTVSGG HTLLAIVRED LTYRIIGRTL DDAAGEAFDK TGKMLGLPYP
AGPAIDRLAK EGDAGFHRFP RALTSQSQTS RSYRDNFDFS FSGLKTSVLT WLRSQKEEFI
HEHRADIAAS IQDAIVGVLV EKAVGAARRH NIGAIAVAGG VSANSELRRA MDAACRKHGI
ALFIPSATYS TDNAAMIATL AGLKLSRGLQ PLCRYDTAPF ASFSAAGN