Gene Clim_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2093 
Symbol 
ID6355071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2308897 
End bp2310138 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content53% 
IMG OID642669688 
Productpeptidase U32 
Protein accessionYP_001944100 
Protein GI189347571 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAACA ACGCCATGGA ACTCATCTCC CCGGCAGGCG ACTGGACCTG CCTTCGCACC 
GCACTGAACG CCGGAGCCGA TGCCGTCTAT TTCGGCGCTG AAGGCTATAA CATGCGCGCG
GGCAGCCGCA ATTTCACGCT GGAGGAGTTT CCCGCCGTCA TGGCTCTCTG CAGAGAGTTC
AGCGCCAAAG GGTATCTGGC GCTGAACACC ATCGTCTATG ACGGGGAGCT GAAAAAGATG
CATCGAACGG TTTCCGCTGC CAGAGCGGCA GGTATCGACG CCATCATATG CTCGGACATG
GCCGTCATCG AGAGCTGCCG GAAAATCGGC ATGCCGTTTC ATATTTCGAC CCAGGCTTCG
GTCAGCAACT ACAGCGCGGT AACGTTCTAT GCCAACCTTG GCGCCAAAAT GGTGGTACTG
GCAAGAGAGC TGACCATCGA ACAGGTGCGC CATATCACCT CTAAAATAAA AGCGGACAAT
CTCGACCTGC GCATCGAATG CTTTGTTCAC GGAGCGATGT GCGTCGCCGT TTCAGGCCGC
TGCTTCATGT CACAGGAGTT GTTCGGGCGA TCAGCCAACC GGGGGCAATG CGTTCAACCC
TGCAGGAGGC AGTATATCGT CACCGATCCC GAAGAGAACC GCGAACTTGA GCTTGGTTCC
GACTACGTCA TGAGTCCGCA AGACCTGTGC GCCATAGAAT TTCTCGACGT TCTCATGGAT
GCAGGAATCG GCGCATTCAA AATCGAAGGA AGAAGCCGCA GTCCCGAATA TGTCCATACC
GCAACTTCCG CTTACCGGAA CGCCATCGAC TTTTGTACAA CCAACCGGAA CACTCCGGCT
TTCGGCGATG GATATAATGC TTTATCACAA AAACTTAAAG AAAAACTCGC CCTGGTTTAC
AACCGGGGAT TTTCGGAAGG ATTTTATTTC GGAAAACCCA TGGATGCATG GACCCGGGAG
TATGGCTCTC TGGCAGGGGA GAAAAAAATC TACATAGGGG ATGTGAAAAA ATATTATCCG
AAGGCGGGAG TTGCCGAAAT TATCATCTTT GCCAGAGGAC TCCGCAGCGG TGACAAGCTT
TCGGTTCTCG GGCCTAAAAC AGGGGTCGCA ACCATCATGG CAGACAGCTT TTTCACCAAC
GATATACCCT CAGAAGAGGC CGGCAAGGGA GACAGCGTCA CCATTAAATG TGCACAGGTG
AGAAAAAACG ACAAGGTTTA CGTGCTTGAA AAAAGGAGAT GA
 
Protein sequence
MQNNAMELIS PAGDWTCLRT ALNAGADAVY FGAEGYNMRA GSRNFTLEEF PAVMALCREF 
SAKGYLALNT IVYDGELKKM HRTVSAARAA GIDAIICSDM AVIESCRKIG MPFHISTQAS
VSNYSAVTFY ANLGAKMVVL ARELTIEQVR HITSKIKADN LDLRIECFVH GAMCVAVSGR
CFMSQELFGR SANRGQCVQP CRRQYIVTDP EENRELELGS DYVMSPQDLC AIEFLDVLMD
AGIGAFKIEG RSRSPEYVHT ATSAYRNAID FCTTNRNTPA FGDGYNALSQ KLKEKLALVY
NRGFSEGFYF GKPMDAWTRE YGSLAGEKKI YIGDVKKYYP KAGVAEIIIF ARGLRSGDKL
SVLGPKTGVA TIMADSFFTN DIPSEEAGKG DSVTIKCAQV RKNDKVYVLE KRR