Gene Cag_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0109 
Symbol 
ID3747597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp123241 
End bp124293 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content50% 
IMG OID637772635 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_378430 
Protein GI78188092 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATAC TTGGAATTGA AACCAGTTGC GATGAAACCT CAGCCTCCGT GCTCCATAAC 
GGCGTGGTGC TGTCGAACAT TGTAAGCTCG CAACATTGCC ACACTTCGTT TGGCGGCGTG
GTGCCCGAAC TTGCTTCTCG CGAACATGAG AGGCTTATTA CCGCCATTAC GGAGACGGCA
ATAAATGAGG CAAATATACA AAAAGATGCG CTTGATGTTA TAGCGGCAAC GGCTGGACCG
GGGTTAATTG GGGCAATTAT GGTGGGCTTG TGCTTTGCGC AAGGCATGGC GTGCGCCTTA
AACATTCCCT TTGTGCCCAT TAATCATATT GAAGCGCACA TCTTTTCCCC CTTTATTAAT
AGCGGCGCAA ACAGCCCGCT TCCCAAAGAG GGCTACATTT CTCTGACGGT ATCGGGTGGG
CACACCTTGC TTGCCCTTGT AAAACCCGAT CTTTCCTACA CGATTGTTGG AAAAACGCTG
GATGATGCCG CTGGTGAGGC GTTTGATAAA ACGGGAAAAA TGATCGGGCT TCCCTATCCT
GCTGGACCCG TTATTGATAA ACTTGCCGAA AATGGTAATC CCAATTTTTA TCACTTCCCT
CGCGCCTTAA CGTCGCGCTC AAAGAGCCGC AAAAGCTGGG AAGGCAACCT CGACTTTAGC
TTTTCGGGCA TGAAAACCTC TGTGCTTACA TGGTTGCAGC AGCAAAGCCC AGAGAGCGTT
GCTTCCAACC TCCCCGATAT TGCCGCCTCC ATTCAAGCAG CTATTGTGGA TGTATTAGTA
GAAAAAAGCA TTGCCGCAGC TAAGCACTAC AACGTAAGCA CCATTGCCAT TGCAGGCGGC
GTTAGTGCTA ACCGAGGATT ACGCAGCTCC ATGCAAGCCG CCTGCCAGCA ACACGGCATT
ACCCTCTGCC TACCTGAAAC CATCTACTCA ACCGATAACG CCGCTATGAT TGCAAGCATT
GCTGCACTCA AGCTCTCGCA TGGTATGGAA CCACTGTACC GCTATAACGT GGCACCCTAT
GCAAGCTTTT TACACAAAGA CAACTTTTCG TAG
 
Protein sequence
MIILGIETSC DETSASVLHN GVVLSNIVSS QHCHTSFGGV VPELASREHE RLITAITETA 
INEANIQKDA LDVIAATAGP GLIGAIMVGL CFAQGMACAL NIPFVPINHI EAHIFSPFIN
SGANSPLPKE GYISLTVSGG HTLLALVKPD LSYTIVGKTL DDAAGEAFDK TGKMIGLPYP
AGPVIDKLAE NGNPNFYHFP RALTSRSKSR KSWEGNLDFS FSGMKTSVLT WLQQQSPESV
ASNLPDIAAS IQAAIVDVLV EKSIAAAKHY NVSTIAIAGG VSANRGLRSS MQAACQQHGI
TLCLPETIYS TDNAAMIASI AALKLSHGME PLYRYNVAPY ASFLHKDNFS