Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2457 |
Symbol | |
ID | 4568885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2831068 |
End bp | 2832129 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639767016 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_912869 |
Protein GI | 119358225 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.347475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAT TAGGAATCGA GACCAGTTGC GACGAAACGT CAGCGGCAGT ACTTTCCAAC GGAAGCGTAT GCTCGAACAT TGTCAGTTCA CAGCTATGCC ATACCAGCTT CGGAGGCGTT GTTCCCGAGT TGGCGTCAAG AGAACACGAG CGGCTGATCG TTTCGATTGT CGATAGCGCA CTCAGTGAAG CCAATATAAC AAAAAACGAT CTCGATGTCA TAGCAGCCAC CGCCGGCCCG GGGCTTATAG GCGCCGTGAT GGTCGGGTTG TGTTTCGGCC AGGCAATGGC TTATGCGCTT GCTATACCGT TTGTACCCGT CAATCATATT GAAGCGCATA TTTTTTCCGC TTTCATTCAG GAAACCCCTC ATCATCAGGC TCCCGAAGGT GATTTTATCT CCCTGACGGT CTCCGGCGGC CACACGCTGC TGTCGCATGT CCATAAAGAC TTCACCTATG AGGTTATCGG CAGAACCCTT GACGATGCAG CAGGGGAAGC TTTCGATAAA ACAGGGAAAA TGCTTGGCCT GCCTTATCCG GCAGGACCGG TGATTGACCG CCTTGCAAAA AACGGTGACC CCTTTTTTCA CGAATTTCCC CGAGCGCTGA CCGCTCACTC GCAAACCAGT AAAAACTATC GCGGCAATTC TGATTTCAGT TTTTCAGGCC TTAAGACCTC CGTATTGACC TTTCTGAAAA AACAGTCGCC GGAATTCATC GAAAAACACC TTCCTGACAT TGCTGCCTCT GTCCAGAAGG CAATCGTCAG CGTGCTGGTT GAAAAAACCG TTTCCGCGGC CCTTGCCGGA AACGTTAAAG CGATATCGAT TGCAGGGGGA GTCAGTGCCA ATTCAGCGCT GAGAACCTCC ATGAAAAAGG CTTGCGAACA GCATGGAATA GCCTTTCATG TTCCCAATGC CGAGTACTCG ACCGACAATG CCGCCATGAT CGCAACTCTC GCAGGACTCC TGCTTGCGCA TGACCTGGTG CCCCGAAACC GGTATAACAT AGCTCCGTTT GCAAGTTTTG CCGCCGGTCG CCGAAAGGCT TCATTGACAT AA
|
Protein sequence | MKILGIETSC DETSAAVLSN GSVCSNIVSS QLCHTSFGGV VPELASREHE RLIVSIVDSA LSEANITKND LDVIAATAGP GLIGAVMVGL CFGQAMAYAL AIPFVPVNHI EAHIFSAFIQ ETPHHQAPEG DFISLTVSGG HTLLSHVHKD FTYEVIGRTL DDAAGEAFDK TGKMLGLPYP AGPVIDRLAK NGDPFFHEFP RALTAHSQTS KNYRGNSDFS FSGLKTSVLT FLKKQSPEFI EKHLPDIAAS VQKAIVSVLV EKTVSAALAG NVKAISIAGG VSANSALRTS MKKACEQHGI AFHVPNAEYS TDNAAMIATL AGLLLAHDLV PRNRYNIAPF ASFAAGRRKA SLT
|
| |