Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1520 |
Symbol | |
ID | 4569570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1729391 |
End bp | 1730287 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639766102 |
Product | HhH-GPD family protein |
Protein accession | YP_911966 |
Protein GI | 119357322 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG CGATAACTCC GAATGTGGTT TCATCCGCTG AACTCGAATA CTATCGACGT CCTGTTGTAC AGAAGGACGT CGATATCGAG TTGTTCCATC AGAAAATCCT TGGATTCCAT AAAGACAATC GTCGATCTTT TCCCTGGAGG GAAACAACAG ACCGTTATGC CATCATGGTC AGCGAGATCA TGCTTCAGCA GACACAGGCT GATCGTGTTA CCGAAAAATA TCAGGCCTGG ATGAGGCGGT TTCCTGATAT CAGAACACTT GCAGATGCTT CGCTCAGGGA TGTGCTTGCT CTCTGGAGCG GACTTGGCTA TAACTCCCGC GGACAGCGGT TACAGAACTG CGCAAGGGAG ATCGAAGATC GTTTTAACGG GGTAGTGCCT TCACTTCCGA CAGAGCTTAA AACTCTTCCT GGTATTGGCG ATTACACCTG CCGATCCATT CCTGTATTTG CCGATAACCT CGATGTTGCC GCAGTCGATA CCAATATCCG AAGAATCATC ATTCACGAGT TCGCCCTTCC GGAAGATATT TCCAAATCGC AGATTCAGGC GGTTGCGGAG CAGCTTCTGC CGATAGGCCG CAGCAGACTG TGGCATAACG CTCTCATGGA CTACGGTGCA CTCTTTCTCA CCAGTCGGAA CACCGGCATT CGCCCCTTGA CGAAGCAGTC GAAATTCGAG GGGTCAAAAC GCTGGTATCG CGGCAGGCTG CTCAAAGAGC TTGTCGCCAG GGATTGTGTC TTTGTTGAAG AGATTCATGA AAAATATGGC TCCTGCCCAT GGGGTTTGCA GGAGATCCTC GATGATCTCC TGCGCGAAGG TCTTGTCGAA GAGGCCGACT GGTCGAATCG GCAAGGAGGA AGAGTGTTGC GCATAAGAGC TCGATGA
|
Protein sequence | MKKAITPNVV SSAELEYYRR PVVQKDVDIE LFHQKILGFH KDNRRSFPWR ETTDRYAIMV SEIMLQQTQA DRVTEKYQAW MRRFPDIRTL ADASLRDVLA LWSGLGYNSR GQRLQNCARE IEDRFNGVVP SLPTELKTLP GIGDYTCRSI PVFADNLDVA AVDTNIRRII IHEFALPEDI SKSQIQAVAE QLLPIGRSRL WHNALMDYGA LFLTSRNTGI RPLTKQSKFE GSKRWYRGRL LKELVARDCV FVEEIHEKYG SCPWGLQEIL DDLLREGLVE EADWSNRQGG RVLRIRAR
|
| |