Gene Cphamn1_2346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2346 
Symbol 
ID6376041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2516241 
End bp2517284 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content54% 
IMG OID642684830 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001960728 
Protein GI189501258 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCC TCGGCATTGA AACCAGTTGT GACGAAACAT CAGCCTCCGT TGTGCAGAAC 
GGACGTGTGA CATCGAACAT CATCAGTTCA CAGCTCATCC ACACGTCCTA TGGGGGTGTT
GTTCCTGAAC TCGCGTCACG AGAACATGAG CGTCTGATTG TATCGGTTGT CGATGCTGCG
GTAAATGAGG CTAATATACA AAAAAACGAT CTCGATGTCA TAGCAGCGAC CGCTGGTCCT
GGGCTCATCG GTGCCGTTAT GGTCGGGCTC TGTTTCGCAC AGGGGCTCGC CTATGTGCTT
GATAAACCGC TTGTCCCGGT TAACCATATC GAAGCCCATA TATTTTCAGG TTTTATTCAT
GAGGGCCCTG ATCACGACCC CCCGAAAGAA GCGTTCATCT CCCTGACCGT TTCCGGCGGA
CACACGATGC TCAGCGTGGT GCAACAGGAT CTGACCTATC AGGTCATCGG CCGGACAATT
GATGACGCGG CAGGAGAAGC GTTCGACAAA ACCGGCAAAA TGCTCGGACT GGACTATCCT
GCAGGACCGG TCATCGACCG GCTTGCTGCA GACGGAGATC CTGGATTTCA CGAGTTTCCG
CGTGCTTTGA CATCGCAGTC CCGAACCAGC AAAAGCTATC GGAACAACTT CGACTTCAGT
TTTTCAGGAC TGAAAACCTC GGTGCTGCAC TATATCGGCA AACAGGACCC GTCATATATC
GAACGCCACC TGCAGGATAT AGCGGCATCG GTTCAGGAGG CGATCACGAG CGTACTGGTG
GAGAAAACTG TCGCCGCCGC GAAGAAATAC CGCATAAACG CCATATCGGT TGCAGGCGGC
GTCAGCGCCA ACTCCGGCCT CAGACAGAAA ATGGCTGTCG CGTGTGAGGC AAACGGCCTC
CGCCTCTACA TCCCCAAGCC GGTCTATTCA ACAGACAACG CCGCCATGAT CGCCACATTC
GCCCACCTCA AGCTGTCCCG GGGCACAACA ACACCCAACA CGTACGATAT TGCCCCGTTC
GCGAGTTTTG AGACGCAAGG GTAA
 
Protein sequence
MNILGIETSC DETSASVVQN GRVTSNIISS QLIHTSYGGV VPELASREHE RLIVSVVDAA 
VNEANIQKND LDVIAATAGP GLIGAVMVGL CFAQGLAYVL DKPLVPVNHI EAHIFSGFIH
EGPDHDPPKE AFISLTVSGG HTMLSVVQQD LTYQVIGRTI DDAAGEAFDK TGKMLGLDYP
AGPVIDRLAA DGDPGFHEFP RALTSQSRTS KSYRNNFDFS FSGLKTSVLH YIGKQDPSYI
ERHLQDIAAS VQEAITSVLV EKTVAAAKKY RINAISVAGG VSANSGLRQK MAVACEANGL
RLYIPKPVYS TDNAAMIATF AHLKLSRGTT TPNTYDIAPF ASFETQG