Gene Cpha266_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1902 
Symbol 
ID4570861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2208815 
End bp2210074 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content49% 
IMG OID639766484 
ProductSte24 endopeptidase 
Protein accessionYP_912342 
Protein GI119357698 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0130115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG GTTTCGGACA GGTTGTTTTG TTTACCCTTG TATTGACTTT TTTCCTCAAG 
CTTATTGCTG ATCTGCTGAA CCTCCGGGCT TCCGAGAGCG GGCTTCCGCC GGAGTTTCAG
GGGGTGTATG AAGAGGATGC CTACAGGAAA TCCCAGGACT ATCTGCGGGC AACAACCCGT
TTTTCGCTTA TCGGGGCTTT TGTCGATCTT CTTTTTCTGC TTGTTTTCTG GTTTGCCGGA
GGGTTCAATA TGCTCGACCA GCTTTTGCGC GCACAGGGAT ATAACACGGT GCTTACAGGC
GTGCTCTATA TCGGCGCTCT CTTGCTCCTG CAGGGGATTC TCGGCCTTCC CTTTACCCTT
TACAGGACAT TTGTTATCGA GGAGAGGTTT GGATTCAACA AAACCACACC GAAAGTTTTT
GTTGCTGATC TCCTGAAAAC CCTTTTTCTT GCCCTGCTCA TCGGTACTCC CGTTCTTGCC
GCTCTGCTCT GGTTTTTTGA ACAGGCAGGC CCGTTTGGAT GGCTCTGGGC CTGGGGCGGG
TTGACGCTCT TCACCCTTCT CTTGCAGTAT GTCGCTCCTG CCTGGATCAT GCCGATTTTC
AACAAGTTTG TTCCGCTTGA AGAAGGCGAG CTGAACAATG CCATTATGCA ATATGCCCGA
ACGGTCGGAT TTCCGCTAAC CGGTATTTAC GTGATTGATG GGTCGAAGCG ATCATCGAAA
GCAAATGCGT TTTTTACCGG ATTCGGCAAA CGCAAGAGAA TTGCCCTGTT TGATACGCTT
GTCAGCAACC ATAGCGTCAG TGAGCTTGTT GCTGTGCTTG CGCACGAAAT AGGTCATTAC
AAGAAAAAGC ATGTGCTCAT CAATATGGTG CTCAGCATGG TGAATCTCGG TGTTGTCTTT
TATCTCCTCT CGGTGTTCAT GAACAATCCT GATCTCTTCA GTGCTTTTTT CATGCAGGAT
ATTTCAGTCT ACGGCAGCCT TGTTTTTTTC CTTCTGCTCT ACAGTCCGGT TGAGTTCGTT
CTTTCCATTC TGCTTCAGGC GCTGTCGCGC AAGCATGAGT ATGAGGCCGA CAGCTTTGCC
GTATCAACAT ACAGCGACGG ATTCGCGCTC GGAGAGGCTC TTAAAAAGCT TTCGCGCAGC
AATCTTTCAA ACCTGACGCC TCATGCGCTC TATGTTTTTC TCAACTATTC GCATCCTCCG
GTTGTGCAGC GTATCAGACG AATAAATGAA CATCCTGCCC CCGGTCATCT CAACCATTGA
 
Protein sequence
MMNGFGQVVL FTLVLTFFLK LIADLLNLRA SESGLPPEFQ GVYEEDAYRK SQDYLRATTR 
FSLIGAFVDL LFLLVFWFAG GFNMLDQLLR AQGYNTVLTG VLYIGALLLL QGILGLPFTL
YRTFVIEERF GFNKTTPKVF VADLLKTLFL ALLIGTPVLA ALLWFFEQAG PFGWLWAWGG
LTLFTLLLQY VAPAWIMPIF NKFVPLEEGE LNNAIMQYAR TVGFPLTGIY VIDGSKRSSK
ANAFFTGFGK RKRIALFDTL VSNHSVSELV AVLAHEIGHY KKKHVLINMV LSMVNLGVVF
YLLSVFMNNP DLFSAFFMQD ISVYGSLVFF LLLYSPVEFV LSILLQALSR KHEYEADSFA
VSTYSDGFAL GEALKKLSRS NLSNLTPHAL YVFLNYSHPP VVQRIRRINE HPAPGHLNH