Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2346 |
Symbol | |
ID | 6376041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2516241 |
End bp | 2517284 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642684830 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001960728 |
Protein GI | 189501258 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCC TCGGCATTGA AACCAGTTGT GACGAAACAT CAGCCTCCGT TGTGCAGAAC GGACGTGTGA CATCGAACAT CATCAGTTCA CAGCTCATCC ACACGTCCTA TGGGGGTGTT GTTCCTGAAC TCGCGTCACG AGAACATGAG CGTCTGATTG TATCGGTTGT CGATGCTGCG GTAAATGAGG CTAATATACA AAAAAACGAT CTCGATGTCA TAGCAGCGAC CGCTGGTCCT GGGCTCATCG GTGCCGTTAT GGTCGGGCTC TGTTTCGCAC AGGGGCTCGC CTATGTGCTT GATAAACCGC TTGTCCCGGT TAACCATATC GAAGCCCATA TATTTTCAGG TTTTATTCAT GAGGGCCCTG ATCACGACCC CCCGAAAGAA GCGTTCATCT CCCTGACCGT TTCCGGCGGA CACACGATGC TCAGCGTGGT GCAACAGGAT CTGACCTATC AGGTCATCGG CCGGACAATT GATGACGCGG CAGGAGAAGC GTTCGACAAA ACCGGCAAAA TGCTCGGACT GGACTATCCT GCAGGACCGG TCATCGACCG GCTTGCTGCA GACGGAGATC CTGGATTTCA CGAGTTTCCG CGTGCTTTGA CATCGCAGTC CCGAACCAGC AAAAGCTATC GGAACAACTT CGACTTCAGT TTTTCAGGAC TGAAAACCTC GGTGCTGCAC TATATCGGCA AACAGGACCC GTCATATATC GAACGCCACC TGCAGGATAT AGCGGCATCG GTTCAGGAGG CGATCACGAG CGTACTGGTG GAGAAAACTG TCGCCGCCGC GAAGAAATAC CGCATAAACG CCATATCGGT TGCAGGCGGC GTCAGCGCCA ACTCCGGCCT CAGACAGAAA ATGGCTGTCG CGTGTGAGGC AAACGGCCTC CGCCTCTACA TCCCCAAGCC GGTCTATTCA ACAGACAACG CCGCCATGAT CGCCACATTC GCCCACCTCA AGCTGTCCCG GGGCACAACA ACACCCAACA CGTACGATAT TGCCCCGTTC GCGAGTTTTG AGACGCAAGG GTAA
|
Protein sequence | MNILGIETSC DETSASVVQN GRVTSNIISS QLIHTSYGGV VPELASREHE RLIVSVVDAA VNEANIQKND LDVIAATAGP GLIGAVMVGL CFAQGLAYVL DKPLVPVNHI EAHIFSGFIH EGPDHDPPKE AFISLTVSGG HTMLSVVQQD LTYQVIGRTI DDAAGEAFDK TGKMLGLDYP AGPVIDRLAA DGDPGFHEFP RALTSQSRTS KSYRNNFDFS FSGLKTSVLH YIGKQDPSYI ERHLQDIAAS VQEAITSVLV EKTVAAAKKY RINAISVAGG VSANSGLRQK MAVACEANGL RLYIPKPVYS TDNAAMIATF AHLKLSRGTT TPNTYDIAPF ASFETQG
|
| |