Gene Cphamn1_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1169 
Symbol 
ID6374844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1257409 
End bp1258713 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content51% 
IMG OID642683667 
Productprotein of unknown function DUF107 
Protein accessionYP_001959584 
Protein GI189500114 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0123968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGTTT TGCTGCTTTT TGCGGCAGTG ACGGTTTTTT CATCAACTTT ACGCGCTGAA 
GAGGCAAAAG GCAACAAGAC AGTTCTTTTT CTCTCTCTTC AGGGTACGGT GAATCCTGGA
AGCGCGGATT TTTTTGAGCG TGCGATTGAC CAGGCTGAAA AAGAGAAGGT TCACGCTATC
CTTGTTGAAC TTGATACACC TGGCGGGCTT GTCTCCTCAT TGCGTGCAAT GGTACAGAGC
GTGCTTGCTT CGCCTGTTCC TGTGATTGTT TATGTGGCGC CTCAGGGAGC TCAGGCTGCA
TCTGCAGGGG CTCTGTTGAC ACTATCCGCA CATGTCGCTG CCATGTCTCC GGGCACGGAG
ATCGGAGCGG CACATCCTGT CGGTCTCGGT GGAGGAGGTG ATGGTGATGA AACCATGAGT
AAAAAGGCCG AGAATGATCT TGCCGCTTTT GCCCGGAGCA TAGCGGAAGA AAGGGGAAGA
AATGCTGAGT GGGCGGAAAA CGCGGTACGG GAAAGTATTG CTTCAACCGC AAACGAAGCA
CTCAAGGCTG GAGTTATCGA TTTTGTCGCC GCCGATCGTG CGGAACTTTT CAGGATGCTT
GACGGCAGGA CGGTCGAAAC GATCGATGGC AGTCTGACGC TTGATTTGAC GGGAGCAGTT
ATTGAAGAAT TTTCTCCGAC CTTGCAGGAA CAGATCCTTA TTAAGCTTGC CGACCCCAAT
CTGGCATATA TTTTTATCAT GGTCGGGCTT GCAGGGCTCT ATTTCGAGTT AGCAAATCCG
GGCTCTATTT TCCCCGGAGT ACTGGGCGCA ATATCGCTTC TTCTTGCTCT TTTCGCTCTT
CAGGCTTTGC CTGTCAATGT CGTCGGTGTG TTGCTCATTG TTCTGGCGGT GGTATTTTTC
GGGCTGGAAC TCTTTGTCGC TAGCGGCGGT ATACTGGCTC TGGCGGGCCT GGTAGCTCTT
TTTGTCGGCT CTCTTATGCT TTTCAATACG GCTGAAACAG GGATTTCCAT TTCCATGACG
GTTTTCCTTC CCGTATTTAT CATGGTGTCA GTATCCCTTT TGGCTATTGT CTGGCTCGTT
ACCAAATCCT CAAGGCTGAA GCTTTCTTCC GGACCCGAAC AGCTGATCGG GGAGGAGGGC
AGTGTGATTC ATGCCATTTT GCCCGGTCAG CCCGGAAAGG TGTTTGTTCA TGGCGAGCTT
TGGGACGCGG AAAGCGGCGA AGAGATCCCT GAAAAGGGAG TCGCGATCGT GAAAGGTTTG
AAAGGACTTA TTTTGCAGGT AACCAAAAAA CAGGAGAACG TATAA
 
Protein sequence
MIVLLLFAAV TVFSSTLRAE EAKGNKTVLF LSLQGTVNPG SADFFERAID QAEKEKVHAI 
LVELDTPGGL VSSLRAMVQS VLASPVPVIV YVAPQGAQAA SAGALLTLSA HVAAMSPGTE
IGAAHPVGLG GGGDGDETMS KKAENDLAAF ARSIAEERGR NAEWAENAVR ESIASTANEA
LKAGVIDFVA ADRAELFRML DGRTVETIDG SLTLDLTGAV IEEFSPTLQE QILIKLADPN
LAYIFIMVGL AGLYFELANP GSIFPGVLGA ISLLLALFAL QALPVNVVGV LLIVLAVVFF
GLELFVASGG ILALAGLVAL FVGSLMLFNT AETGISISMT VFLPVFIMVS VSLLAIVWLV
TKSSRLKLSS GPEQLIGEEG SVIHAILPGQ PGKVFVHGEL WDAESGEEIP EKGVAIVKGL
KGLILQVTKK QENV