Gene Cag_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0547 
Symbol 
ID3747051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp651268 
End bp652536 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content47% 
IMG OID637773081 
Productcollagenase 
Protein accessionYP_378863 
Protein GI78188525 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCCT TGAAAAGTCA ACCGAACCAT TCAATTCCAC AAGCTGAACT TATTGCCCCC 
GCAGGCGATA TGACGGCGCT TGTAACTGCT TTACAGGCAG GTGCCGATGC CGTCTATTTT
GGTGCCGAAG GCTATAACAT GCGAGCAGGC AGCAACAACT TTACAAATAG CGACTTTGCT
ACGGTGCGTG CCCTCTGTTC CAAGCACAAC GCAAAAGCCT ATCTTGCACT TAACACCATT
ATTTACGATA GCGAGCTCAA GCAGATGCGC CAAAGCGTTG AAAGCGCAAA AGCCGCTGGT
ATTGACGCTA TTATTTGCTC CGATATGGCA GTTATTGAAG CGTGTCGCCA AGCCGAAATG
CCCATCCACC TCTCCACCCA AGCCTCTGTA AGCAACTACA ACACACTCCG CTTTTTTGCA
GAGCAGGGTG CCGCCATGGT TGTGCTTGCT CGCGAGCTTA CCATCGAGCA AGTGCGCCAC
ATTACACGCA ACATTCAGCA CGACAGCTTA CCTGTTCGCA TTGAGTGCTT TGTGCACGGA
GCCATGTGCG TTGCGGTGTC GGGGCGTTGC TTTCTTTCAC AGGAGTTATT TGGGCGATCA
GCAAATCGAG GACAATGTGT TCAGCCATGT CGCCGTAGCT ACATTATTAC CGATCCCGAA
GAGAACGAAG AGCTTGAGCT TGGCGCTGAT TACGTGATGA GCCCAAAAGA TTTATGCGCC
ATTGAATTTC TTGATGTATT GCTTGATGCT GGCATTAGCG CCTTTAAAAT TGAAGGAAGG
AGCCGTAGCC CCGAATACGT TCACACTACT ACTACGGCAT ATCGCCAAGC ACTCAATATG
TGCATGCAGC AGCGCCACCA AGCCGATTTT AGAAACCGTT ACAGCGCCTT AACAGCTTCG
TTAAAGCACG ATTTAGCAAC AGTTTACAAT CGAGGATTTT CTAACGGCTT TTATTTCGGC
AAGCCTATGG AGGCATGGGC ACAAACGTAT GGATCACAAG CAACGGAGAA AAAAACCTAT
ATAGGCGACA TCAATAAATA CTTTCCAAAA GCAGGAATTG CTGAATTACA CATTCGAGCA
CGAGGTTTAA AGCAAGGCGA TAAACTTTCT ATTCTTGGTG TAAAAAGTGG GATGGTAACG
GTTATAGCTG ATTCATTTCT TACCAACGAT CAACCAAATA CGGAAGCAAT AAAAGGGGAT
AGCGTTACCT TTAAATGCCC TCCCGTTCGC AAAAATGATA AAGTATATGT TTTAGAGGAG
AGAAAGTAA
 
Protein sequence
MTPLKSQPNH SIPQAELIAP AGDMTALVTA LQAGADAVYF GAEGYNMRAG SNNFTNSDFA 
TVRALCSKHN AKAYLALNTI IYDSELKQMR QSVESAKAAG IDAIICSDMA VIEACRQAEM
PIHLSTQASV SNYNTLRFFA EQGAAMVVLA RELTIEQVRH ITRNIQHDSL PVRIECFVHG
AMCVAVSGRC FLSQELFGRS ANRGQCVQPC RRSYIITDPE ENEELELGAD YVMSPKDLCA
IEFLDVLLDA GISAFKIEGR SRSPEYVHTT TTAYRQALNM CMQQRHQADF RNRYSALTAS
LKHDLATVYN RGFSNGFYFG KPMEAWAQTY GSQATEKKTY IGDINKYFPK AGIAELHIRA
RGLKQGDKLS ILGVKSGMVT VIADSFLTND QPNTEAIKGD SVTFKCPPVR KNDKVYVLEE
RK