Gene Cpha266_0709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0709 
Symbol 
ID4569883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp808188 
End bp809585 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content47% 
IMG OID639765307 
ProductTPR repeat-containing protein 
Protein accessionYP_911188 
Protein GI119356544 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGT CTGATTTTTT TGATGAAAAC CGTCATGAAT CATCCGGTTT ATCACAAAAA 
GAGCCGCCTG ATCTTGACGA TCTTGAAAGC ATGTATGATG CTGAGGAGCT GATTGATCTT
ATCAGCCAGC TCAATGAAGA CGGATTTATC CAGGAGGCCC TTGCCGTTGC GCAAAGGCTT
GAAGCCGTTT CGCCATACAA TGCCGAGACC TGGTTTCATC TTGGTAACTG TCTCACCGTT
AACGGGTATT TCAACGATGC CCTTGAGGCA TTCAATCAGG CAGTGCTGCT CAGTCCCGCT
GACAGCGAAA TGCGCCTCAA CTACGCTCTT GCGCACTTCA ATACGGGATC TCTTGACGAA
GCTCTCGAAA TCCTTGAGGA TATGTATGTT GACTCATCGA TTGAACGGGA GTACTCCTAC
TACCGGGGTA TCATTCTGCA GCGGCTTGAA CGGTTCACTG AATCAGAAAA AGATTTTGAA
CACTGCCTCG AACTCGATCC TGATTTTTCT GACGCATGGT ATGAACTTGC CTACGGAAAA
GATCTCCTGG GAAAACTCGA AGAGAGCACG GCCTGTTACA ACAAAGCACT GGACCACGAT
CCCTACAATA TCAATGCATG GTACAACAAC GGTCTGGTAC TCAGTAAACT GAAACGATAC
GATGAAGCGC TTCAGTGCTA CGATATGTCT CTCGCTCTTG CCGATGATTT CAGTTCCGCA
TGGTATAACC GGGCCAATGT TCTTGCCATA ACGGGAAAAA TCGAAGAGGC GGCAGAAAGC
TATGTGAAAA CTCTTGAGTT TGAACCTGAC GACCTCAATG CCCTGTACAA TCTTGGTATT
GCCTACGAAG AACTTGAGGA GTACAGTGAA GCTATTCTCT GCTATCGACG CTGCATCGAG
CTCAATAACG ATTTCCATGA TGCATGGTTT GCGCTTGCAT GCTGTTATGA AGCCATCGAA
CAGTATAATG AGGGATCACT TGCCATTATT GAAGCTCTGA AGGCAATCCC TGACAGCATC
GAGTTTCTGC TGCTTAAAGC TGAAATAGAG TATAATCTCA ATGAGCTCGA ACACTCTCTT
GAAACCTATC GACATATCAT CACCCTTGAC CCTGAAAGTC CGCAGATATG GGTTGATTAT
GCCATGGTCC TTCGCGAAGC AGGCTATAAC AACGAATCCA TTGAGGCCCT TCATCAGTCG
CTGAAACTTC AGCCGCATTC GGCTGATGCC CATTTCGAGA TTGCTGCCGC CTATTTTGCC
ATGGGGGACA AACTCAGCAC CCTGAAAGCG CTGAGCAAGG CATTCAAAAT CGACCCTGAT
AAAAAACAAC TTTTTCAAAG CACCTTCCCG GAACTTTATC AGCAGGATTC CGTTAGAAAA
ATGCTTGGCA TTTCCTGA
 
Protein sequence
MSLSDFFDEN RHESSGLSQK EPPDLDDLES MYDAEELIDL ISQLNEDGFI QEALAVAQRL 
EAVSPYNAET WFHLGNCLTV NGYFNDALEA FNQAVLLSPA DSEMRLNYAL AHFNTGSLDE
ALEILEDMYV DSSIEREYSY YRGIILQRLE RFTESEKDFE HCLELDPDFS DAWYELAYGK
DLLGKLEEST ACYNKALDHD PYNINAWYNN GLVLSKLKRY DEALQCYDMS LALADDFSSA
WYNRANVLAI TGKIEEAAES YVKTLEFEPD DLNALYNLGI AYEELEEYSE AILCYRRCIE
LNNDFHDAWF ALACCYEAIE QYNEGSLAII EALKAIPDSI EFLLLKAEIE YNLNELEHSL
ETYRHIITLD PESPQIWVDY AMVLREAGYN NESIEALHQS LKLQPHSADA HFEIAAAYFA
MGDKLSTLKA LSKAFKIDPD KKQLFQSTFP ELYQQDSVRK MLGIS