Gene Clim_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2297 
Symbol 
ID6355642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2529945 
End bp2531006 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content55% 
IMG OID642669888 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001944299 
Protein GI189347770 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0294499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT TAGGAATAGA AACCAGCTGC GATGAAACAT CGGCAGCGGT ACTGCTCGAC 
GGCAGAATCG GTTCAAACGT TATCAGTTCA CAGCGCTGTC ATACCAGCTT CGGAGGAGTC
GTTCCGGAAC TTGCATCGAG AGAACACGAA CGGACAATAG TGTCCATTGT CAACAGCGCG
GTAACTGAAG CCAATATAAC GAAAAATGAA CTCGATTGCA TAGCCGCCAC CGCCGGCCCG
GGTCTTATCG GCGCGGTTAT GGTAGGACTC TGCTTCGCCG AAGGCATGGC GTTCGCTCTC
GGCATTCCGT TCGTTCCGGT GAACCATATC GAGGCGCATA TGTTTTCGGC CTTCATTCCC
GAATCGCCGG AACACAAGTC TCCTGAAGGC CCCTTTATCT CGCTGACCGT ATCCGGAGGC
CATACGCTTC TTTCGCTTGT CCGCGAAGAT CTCTCCTATG ACGTGATCGG AAAAACGCTC
GATGACGCCG CAGGGGAGGC TTTCGATAAA ACCGGCAAGA TGCTCGGCCT CGCATATCCC
GCGGGGCCGG TTATCGACCG CCTTGCGGCA TCGGGGAATC CTCACTTCCA TGCTTTTCCC
AAAGCCCTGA CGTCGAGTTC GCAAACCAGC AGAAGCTATC GGGGCAACTT CGATTTCAGC
TTTTCGGGCC TGAAAACCTC GGTGCTGACC TGGCTGCAGA AGCACCCGGC AGAGTTCATA
CAAACCCATC TGCATGATAT CGCCGCATCG ATACAATACG CCATTGTAAG CGTTCTGACA
GAAAAAGCCG TTGCGGCTGC GCGGTATTTC CGTACCGACG CCATCTCCGT AGCCGGAGGG
GTCAGCGCCA ATTCGGCATT GAGAACGGCG ATGCAGGAAG CCTGTCGGCA CCACGGTATC
CGATTGTATA TACCCGGCAC GGTATATTCG ACCGACAATG CCGCCATGAT AGCCTCGCTT
GCCGGTCTCA TGCTCTCGAA AGGCGCCGTG CGGAAAAACA ATTATGACGT CGCTCCATTC
GCAAGCTTTG CCGCGGGAGC GATCAAGGCA TCATTGAAAT AA
 
Protein sequence
MNILGIETSC DETSAAVLLD GRIGSNVISS QRCHTSFGGV VPELASREHE RTIVSIVNSA 
VTEANITKNE LDCIAATAGP GLIGAVMVGL CFAEGMAFAL GIPFVPVNHI EAHMFSAFIP
ESPEHKSPEG PFISLTVSGG HTLLSLVRED LSYDVIGKTL DDAAGEAFDK TGKMLGLAYP
AGPVIDRLAA SGNPHFHAFP KALTSSSQTS RSYRGNFDFS FSGLKTSVLT WLQKHPAEFI
QTHLHDIAAS IQYAIVSVLT EKAVAAARYF RTDAISVAGG VSANSALRTA MQEACRHHGI
RLYIPGTVYS TDNAAMIASL AGLMLSKGAV RKNNYDVAPF ASFAAGAIKA SLK